ALL COVERED TOPICS

NoSQL Benchmarks NoSQL use cases NoSQL Videos NoSQL Hybrid Solutions NoSQL Presentations Big Data Hadoop MapReduce Pig Hive Flume Oozie Sqoop HDFS ZooKeeper Cascading Cascalog BigTable Cassandra HBase Hypertable Couchbase CouchDB MongoDB OrientDB RavenDB Jackrabbit Terrastore Amazon DynamoDB Redis Riak Project Voldemort Tokyo Cabinet Kyoto Cabinet memcached Amazon SimpleDB Datomic MemcacheDB M/DB GT.M Amazon Dynamo Dynomite Mnesia Yahoo! PNUTS/Sherpa Neo4j InfoGrid Sones GraphDB InfiniteGraph AllegroGraph MarkLogic Clustrix CouchDB Case Studies MongoDB Case Studies NoSQL at Adobe NoSQL at Facebook NoSQL at Twitter

NAVIGATE MAIN CATEGORIES

Close

Hadoop World 2010 Tweet Analysis

Fun project using Hadoop and Twitter streaming API:

During the keynote, I quickly created an Amazon Micro EC2 instance, tapped into the Twitter Streaming API, and began downloading tweets containing the hashtag #hw2010.

After filtering out a few Halloween tweets (get it?  #hw2010?), about 1,500 tweets remained, respectable for a one-day event. 

For my (real-time) Hadoop World in Tweets I’ve used ☞ Storify and my eyes. Not as scalable as Hadoop though.

Original title and link: Hadoop World 2010 Tweet Analysis (NoSQL databases © myNoSQL)

via: http://www.cloudera.com/blog/2010/12/hadoop-world-2010-tweet-analysis/