Saturday, January 24, 2015

Big Data

  • Genomics complex big data.
  • Remote sensing data. 
  • Station derived data.
  • GPS and transport data.
  • Vehicles,sensor and radar data.
  • Hotel reservations, prices, and hotel market indicators.
  • Browsing history log data.
  • Sentiment analysis in business reviews.
  • search engine cookie sorting.
  • Movie Box office data.
  • Tool : Hadoop
  • Haul truck sensor data for preventive maintenance and mine route optimisation.
  • Banking and brokerage portal access data.
  • Tool : Splunk
  • Correlation of real time financial activity events with multiple threat intelligence feeds.
  • Genetic sequence data from cancer
  • Analytics data user profiles, actions and usage
  • Network security monitoring data for threat analysis
  • Security information event management (SIEM) data
  • Financial data time series
  • Farm data from sensors and ERP
  • Stock options data.
  • EOD tick data.
  • Tool : MongoDB
  • Twitter hashtag relationships
  • Twitter influencers and hashtag relationship
  • Tool : Twitter public API
  • Goverment population data
  • Human and machine-generated structured event streams
  • Tool : Snowplow
  • Data : GitHub event stream archive
  • Click stream logs for recommendation systems.
  • Tool : Scraping libraries
  • Tool : API
  • Generate your own data example metro train arrivals departures at stations.
  • Retail transaction and loyalty data.
  • Governmental health records.
  • Governmental survey data.
  • Offline retail sales data
  • Poll response data
  • NYC taxi data
  • High resolution FMRI data
  • DNA sequences and Protein Data
  • RAW radio data
  • twitter social graph [connections between people]
  • Tool : Hadoop Distributed File System
  • Tool : Spark Spark SQL Tachyon GraphX MLlib Hive Impala Drill MapReduce Kafka

No comments:

Post a Comment