://old.nabble.com/PRUNE-%3A-need-some-help-on-pruning-syntax.-tp26268447p26268447.html
Sent from the Nutch - User mailing list archive at Nabble.com.
--
Subhojit Roy
Profound Technologies
(Search Solutions based on Open Source)
email: s...@profound.in
http://www.profound.in
will be included in all the pages. So i need to restrict my search not to
search with the content of a perticular div
ex : div class=menu /div.
Ho do i remove the content between a div from a search
--
View this message in context:
http://old.nabble.com/PRUNE-%3A-need-some-help
one option is to extend the html parser and look for these things and
ignore them.
you might also want to look at this forum posting:
http://www.mail-archive.com/nutch-user@lucene.apache.org/msg13969.html
On Mon, 2009-11-09 at 07:39 -0800, Annappa wrote:
Hi,
I am unsing Nutch-0.9 for