Cascading 2.0 Released
Cascading the Java framework offering data processing, data flow, data integration, and process scheduling APIs for Hadoop has reached version 2.0. The most interesting points in this release summarized on the Cascading blog:
- Apache 2.0 Licensing
- Support for Hadoop 1.0.2
- Local and Hadoop planner modes, where local runs in memory without Hadoop dependencies
- HashJoin pipe for “map side joins”
- Merge pipe for “map side merges”
- Simple Checkpointing for capturing intermediate data as a file
- Improved Tap and Scheme APIs
Congrats!
Original title and link: Cascading 2.0 Released (©myNoSQL)