Hello
On our homepage (http://www.aramis-research.ch/) we present research
projects from a database. We therefore have a lot of template content (i.e.
field names).
We use ht://Dig as search engine. To make sure that this (to searches
irrelevant) template content is not indexed we have made heavy use of
[noindex_start] and [noindex_end] -- in our case we used '<!--
htdig_noindex_end -->' and '<!-- htdig_noindex_start -->'.
Now I found out that htdig does not seem to consider these tags as white
spaces forming separate words.
Example
[...snip....]
<!-- htdig_noindex_end -->SANDRINE : Biosensor tracing of endocrine
disrupting compounds in surface water, waste water and sludge for water
quality assessment<!-- htdig_noindex_start --><FONT></TD>
</TR><TR>
<TD ALIGN="LEFT" VALIGN="TOP"><BR>
<B>Key words</TD>
<TD ALIGN="LEFT" VALIGN="TOP"></TD>
<TD ALIGN="LEFT" VALIGN="TOP"><BR>
(English)<BR><!-- htdig_noindex_end -->Estrogenic activity; endocrine
disrupting compounds; biosensor; fluorescence techniques; estrogen
receptor<!-- htdig_noindex_start -->
[/...snip...]
(see: http://www.aramis-research.ch/e/6855.html)
In the index the term "assessmentestrogenic" appears which (to me) means
htdig does not consider the masking out of general (template) content as
equivalent to white spaces -- concatenation of the last word in the title
with the first word from 'key words'. Or am I wrong?
I understand that htdig 'completely' ignores everything between the
respective tags (http://www.htdig.org/attrs.html#noindex_start) -- still I
didn't expect this behaviour and I am not sure others do. Is there a work
around or something I can change in the configuration to make sure that the
last word before the beginning of the ignored section and the first word of
the next section are seen/indexed as separate words? Do I have to put a
(hard) white space before '<!-- htdig_noindex_start -->' (which I'd rather
not have to do because our internet system has been developed out of house)?
Thanks for any helpful comments
Andreas
-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100006ave/direct;at.asp_061203_01/01
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html