Hi Siva, You will probably get a better reply if you head over to the nutch mailing list [http://nutch.apache.org/mailing_lists.html] and ask there.
Nutch 2.1 may be what you are looking for (stores pages in NoSQL database). Regards, Sujit On Feb 10, 2013, at 9:16 PM, SivaKarthik wrote: > Dear Erick, > Thanks for ur relpy.. > ya..nutch can meet my requirement... > but the problem is, i want to store the crawled document in html or xml > format instead of mapreduce format.. > not sure nutch plugins available to convert into xml files. > please share me if you any idea . > > ThankYou > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/ANNOUNCE-Web-Crawler-tp2607831p4039619.html > Sent from the Solr - User mailing list archive at Nabble.com.