One trick would be to search on a URL, explain link shows what segments
it belongs to, say 1200604211450.

Then using segread command (this works for 0.7.2)

bin/nutch segread -dumpsort -nocontent  segments/1200604211450   

That shows text, parse data for a URL.

Thanks
P




-----Original Message-----
From: Dennis Kubes [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, April 26, 2006 1:42 AM
To: nutch-user@lucene.apache.org
Subject: How to get Text and Parse data for URL

Can somebody direct me on how to get the stored text and parse metadata 
for a given url?

Dennis

Reply via email to