According to Emma Jane Hogbin:
> re. noindex_start and _end tags:
> >No, right now there is support for only one value in each attribute.  We've
> >talked many times of extending it to support multiple values, but so far
> >no one has taken the time to implement it.
> 
> Yes, I realise this is not an insignificant idea ... but ... what about 
> having an option to set noindex_script: true.

Well, there's never any reason to index what's in between <script> and
</script>, because that's what caused the problem in the first place, so
there's no real reason to add an attribute to control this.  It comes down
to either extending noindex_start and noindex_end to allow multiple strings,
or adding extra code to handle <script> and <style> tags in a similar way.
Personally I think extending noindex_start and noindex_end wouldn't be
much harder, and would be more generally useful.

> >Ironically, I felt when getting
> >3.1.6 out that this was less of a priority now that the HTML parser had
> >built-in support for ignoring stuff between <script> and </script> tags.
> 
> Has anyone else been able to replicate this problem? (See below...)

Well, it seem Jim did, or at least he described it well enough that we
know there is a problem and how to test for it.  The key is to put your
JavaScript into an HTML comment like this:

<script language="JavaScript">
<!--

...

// -->
</script>

If you do that, the JavaScript code will still be run by your browser, but
won't cause problems for non-JavaScript-aware clients.  If you don't use
comment delimiters, or you use invalid ones (e.g. an odd number of dashes)
then htdig won't strip out the JavaScript as a comment, and will try to
parse it.

When we fix the <script> tag handling, this shouldn't matter because the
whole script will be stripped out like a comment, whether it's properly
enclosed in delimiters or not.  That doesn't mean other web clients won't
have problems with it, though.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to