2011/6/24 James Kosin <[email protected]>: > Olivier, > > No main() in the classes. So, how does one get the collection of > articles started?
It's meant to be used as a library. For instance, it is used by the following custom pig Loader: https://github.com/ogrisel/pignlproc/blob/master/src/main/java/pignlproc/storage/ParsingWikipediaLoader.java which is in turn called in pig scripts such as: https://github.com/ogrisel/pignlproc/blob/master/examples/extract_links.pig Apache Pig is scripting language and runtime environment to perform distributed data analysis on an Apache Hadoop (HDFS + MapReduce) cluster. http://pig.apache.org/ -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
