slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
BIG DATA PowerPoint Presentation
Download Presentation
BIG DATA

Loading in 2 Seconds...

play fullscreen
1 / 25

BIG DATA - PowerPoint PPT Presentation


  • 138 Views
  • Uploaded on

BIG DATA. MBUS 626-01 - G4. Zoe Mayhook Bailee Neyland Crystal Side Michael Stuber. Big Data. Finding that Diamond in the Rough. Most common interpretation of big data is the systematic analysis of huge volumes of data to find patterns and behaviors that are not readily apparent.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'BIG DATA' - nikita


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
slide1

BIG DATA

MBUS 626-01 - G4

Zoe Mayhook Bailee Neyland Crystal Side Michael Stuber

finding that diamond in the rough

Big Data

Finding that Diamond in the Rough
  • Most common interpretation of big data is the systematic analysis of huge volumes of data to find patterns and behaviors that are not readily apparent.
  • Has rapidly created an entire sub-industry that generated $11.59 billion in 2012, according to the research community Wikibon.
  • By 2017, they predict the big data market will be worth $47 billion.
slide3

Big Data Defined

“A massive volume of both structured and unstructured data that is so large that it’s difficult to process using traditional databases and software techniques.”

3-V Model

slide4

Big Data

Continued…

  • What makes data big?
  • Origin
  • Growth
slide5

Major Sources

of Big Data

  • Social Media
  • Server Logs
  • Web/clickstream
  • Machine/sensor
  • Geolocation
important factors to consider
Important factors to consider
  • Big data needs to be mediated by the human touch and common sense
  • Human beings and human-oriented decisions must play a fundamental role in any big data strategy or companies risk alienating their customers and damaging their brands.
slide7

Who is using Big Data?

LEADERS:

  • Amazon
    • Uses big data to drive innovation through data, with scalable services for data collection, storage, integration, analytics and collaboration
    • Handles millions of back-end operations every day, as well as queries from more than half a million third-party sellers
  • Walmart
    • Handles more than one million customer transactions every hour
    • Uses big data to reach customers, or friends of customers who have mentioned something online to inform them about that exact product and include a discount
  • Netflix
    • Uses big data to more-accurately predict the consumer behaviors of their subscribers and potential subscribers
who else is using big data
Who else is using Big Data?

And who should?

Start ups

Healthcare

slide9

Big Data Companies

  • Cloudera- leader in Apache Hadoop-based software and services and offers a powerful new data platform that enables enterprises and organizations to look at all their data and ask bigger questions for unprecedented insight at the speed of thought.
  • MapR - delivers on the promise of Hadoop with a proven, enterprise-grade platform that supports a broad set of mission-critical and real-time production uses.
  • Splunk- founded to pursue a disruptive new vision: make machine data accessible, usable and valuable to everyone. By monitoring and analyzing everything from customer clickstreams and transactions to network activity and call records - Splunk turns machine data into valuable insights no matter the business.
  • Palantir- Delivers big data technology to improve crisis response
slide10

Up-and-Coming

Big Data Companies

slide11

Big Data Technologies

Wide-scale digitization of information has created many new sources of data

Traditional approaches to managing data don’t support volume, velocity, and variety

New approaches are needed:

  • NoSQL Databases
  • MapReduce & Hadoop
nosql
NoSQL
  • Not Only Structure Query Language
  • Data can be unstructured
  • Data is typically organized in key-value pairs
  • Values can be anything from images, songs, and documents, to lists or traditional data types
  • Examples include Cassandra & Redis
mapreduce hadoop
MapReduce & Hadoop
  • All processing is done on key/value pairs
  • Basic approach is to organize very large sets of data (map) and then crunch them (reduce)
  • Many algorithms can be implemented within MapReduce architecture
  • Hadoop & MapReduce systems provide task management & file systems to distribute jobs across hundreds (or thousands) of commodity servers
mini case discussion
Mini Case Discussion

The San Leandro California Police Department uses mounted squad car camera’s to routinely photograph license plates while patrolling the area. Millions of these pictures are passed on to the Northern California Regional Intelligence Center, and are analyzed using big data software developed by Palantir.

What are the benefits to photographing, saving, and analyzing license plate information?

2. What do you find most concerning?

ethics for big data
Ethics for Big Data
  • Ethically Neutral
  • Might not align with how we feel, but should align with core values
  • Ethical inquiry should take place due to the sheer volume, variety and velocity of big data
framework for big data ethics
Framework for Big Data Ethics
  • Identity
    • Relationship between offline and online identity
  • Privacy
    • Who should control access to data?
  • Ownership
    • Who owns data, can rights be transferred?
  • Reputation
    • Can we determine what data is trustworthy?
slide22

Alignment of Methodology

  • Inquiry
    • Discussion of core values
  • Analysis
    • Review current practices, and assess how well they align with core values
  • Articulation
    • Explicit, written expression of alignment and misalignment between values and practices
  • Action
    • Tactical plan to close alignment gaps
slide23

Ethical Guidelines - Proposals

  • Radical Transparencies
    • explain what data is being collected and how it will be used
  • Simplicity by Design
    • Allow users to adjust any privacy settings to determine what they want shared or now
    • Privacy policies should be simple and understandable
  • Preparation and Security
    • Define what information and data you need, and what information you can do without
    • Develop crisis strategy if company system gets hacked
  • Make Privacy Part of the DNA
    • Hire a chief privacy officer or chief data officer
    • Address privacy in all levels of the organization
benefits of adopting
Benefits of Adopting

Big Data Ethics

  • Reduction in risk of unintended consequences
  • Faster consumer adoption (reducing fear of unknown)
  • Increased pace of innovation
  • Reduced friction from legislation
slide25

Big data is about “building new analytic applications based on new types of data, in order to better serve your customers and drive a better competitive advantage.”

…Thank you