PyCascading: All content tagged as PyCascading in NoSQL databases and polyglot persistence
Tuesday, 5 February 2013
Twitter and Their Cascading Libraries for Dealing With Different Scenarios
This is the only interesting paragraph from InfoWorld’s article “Twitter’s programmers speed Hadoop development“:
Three Twitter teams are using Cascading in combination with programming languages: The revenue team uses Scala, the publisher analytics team uses Clojure, and the analytics team uses Jython.
Each of these combinations led to new projects:
- Scala + Cascading => Scalding
- Clojure + Cascading => Cascalog
- Jython + Cascading => PyCascading
An interesting question I couldn’t answer is why each team prefers a different language. My hypothesis:
- Scala with its strong typing for handling clear models generating numbers that must always be correct.
- Clojure for designing new analysis models
- Jython enables quick experimentation with data.
Your thoughts?
Original title and link: Twitter and Their Cascading Libraries for Dealing With Different Scenarios (©myNoSQL)