Treat your dataset like a: * stream of lines when it's efficient to process by lines * stream of field arrays when it's efficient to deal directly with fields * stream of lightweight objects when it's efficient to deal with objects Wukong is friends with Hadoop the elephant, Pig the query language, and the cat on your command line.
Required Ruby Version
None
Authors
Infochimps, Philip (flip) Kromer, Travis Dempsey
Versions
- 4.0.0 March 19, 2014 (833 KB)
- 3.0.1 March 07, 2013 (821 KB)
- 3.0.0 February 20, 2013 (821 KB)
- 3.0.0.pre3 December 17, 2012 (739 KB)
- 3.0.0.pre2 December 01, 2012 (550 KB)