You can do this within a custom HtmlParser and IndexFilter. In
the HtmlParser, look at the page and decide whether you want it
or not, then insert a metadata property called "index" and
set it to "true" or "false".
In the filter method of the index filter, look up the "index"
metadata, and if it's false, just return without indexing anything.
Howie
I normally "tag" such pages and exclude them in a search by manipulating
the search string.
Not the best solution but it works.
Stefan
Am 24.01.2006 um 10:12 schrieb Franz Werfel:
Hello,
Is it possible not to index certain pages based on their content or on
their size (and not on their url)? If so, how?
Thanks,
F.