Philip Brown wrote:
Is it possible on some pages to crawl only between tags or have it not crawl between tags.

ie.

<nocrawl>blah blah blah</nocrawl>
<crawlhere>the content only that I want to crawl</crawlhere>
<nocrawl>blah blah blah</nocrawl>

appreciate any input
kind regards

You can modify DOMContentUtils.java (found in parse-html plugin) to implement this restriction.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to