> Tasos Angelis wrote:
>
> I noticed that in the search results only html tags are removed.
> But, how can I remove some other unwanted words from the results of
> htsearch?
>
> For example in a site that htdig indexes there are some javascript
> functions.
> I want to selectivly remove those functions.
> e.g. .... something something something .... findit();.... something
> to be:
> .... something something something .... .... something
> without findit(); in it.
> There are 10 or little more of those functions.
You should put anything inside of <SCRIPT> tags in SGML comments.
This is not only valid vof Ht://Dig, but also for most other robot
software which else will interpret the contents of <SCRIPT> as normal
text. Another approach to leave it out would be to put the JavaScript
code in an external file (<SCRIPT SRC="file.js">). I don't know any
crawler that will follow the SRC-attribute of the <SCRIPT> tag.
You can also configure Ht://Dig to ignore anything inside <SCRIPT>
by using the noindex_start/noindex_end configuration attributes.
hth,
Torsten
--
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstra�e 14 Tel: +49-4101-403605
D-25474 Ellerbek Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED] Internet: http://www.inwise.de
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.