Re: Crawl Anywhere -

SUJIT PAL Sun, 10 Feb 2013 21:58:17 -0800

Hi Siva,

You will probably get a better reply if you head over to the nutch mailing list 
[http://nutch.apache.org/mailing_lists.html] and ask there.


Nutch 2.1 may be what you are looking for (stores pages in NoSQL database).

Regards,
Sujit

On Feb 10, 2013, at 9:16 PM, SivaKarthik wrote:

> Dear Erick,
>   Thanks for ur relpy..
>   ya..nutch can meet my requirement... 
>  but the problem is, i want to store the crawled document in html or xml
> format instead of mapreduce format..
>  not sure nutch plugins available to convert into xml files.
>  please share me if you any idea .
> 
> ThankYou
> 
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/ANNOUNCE-Web-Crawler-tp2607831p4039619.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Re: Crawl Anywhere -

Reply via email to