Hi Have you looked at http://manifoldcf.apache.org? Might be a better fit for what you are describing. Not sure it does parsing though.
On 23 May 2014 11:08, Bayu Widyasanyata <[email protected]> wrote: > Hi, > > Anyone could pointing me on documentation how to pull in (fetching) data > from database (e.g. common RDBMS such MySQL, etc.) with nutch? > While the rest of process are nutch commons: parse and index them. > > Thanks in advance. > > -- > wassalam, > [bayu] > -- Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

