Before Nutch generates Segments, it wants to draw Web page first in fact. But I as long as gain information content and URL of Web page. Web page information content is saved to the MySQL/Oracle database, I self carries out an index by lucene.
I extend Processor in Heritrix,but Nutch i do not know how to do? xingjian wrote: > > hi,ervery > > How to get the content of successful fetcher,in order to will this > content write Oracle database? > I do not need the function of Nutch indexing. > > thanks > -- View this message in context: http://www.nabble.com/How-to-writes-the-results-of-successful-fetcher-to-database.-tf4788594.html#a13717899 Sent from the Nutch - User mailing list archive at Nabble.com.
