2011/6/24 James Kosin <[email protected]>:
> Olivier,
>
> No main() in the classes.  So, how does one get the collection of
> articles started?

It's meant to be used as a library. For instance, it is used by the
following custom pig Loader:

  
https://github.com/ogrisel/pignlproc/blob/master/src/main/java/pignlproc/storage/ParsingWikipediaLoader.java

which is in turn called in pig scripts such as:

 https://github.com/ogrisel/pignlproc/blob/master/examples/extract_links.pig

Apache Pig is scripting language and runtime environment to perform
distributed data analysis on an Apache Hadoop (HDFS + MapReduce)
cluster.

  http://pig.apache.org/

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

Reply via email to