Re: HTMLMetaProcessor a bug?

2006-01-10 Thread Jérôme Charron
the following code would fail in case the meta tags are in upper case Node nameNode = attrs.getNamedItem(name); Node equivNode = attrs.getNamedItem(http-equiv); Node contentNode = attrs.getNamedItem(content); This code works well, because Nutch HTML Parser uses Xerces

Re: HTMLMetaProcessor a bug?

2006-01-10 Thread Gal Nitzan
Thanks, I was checking something with the default from jdk... On Tue, 2006-01-10 at 11:06 +0100, Jérôme Charron wrote: the following code would fail in case the meta tags are in upper case Node nameNode = attrs.getNamedItem(name); Node equivNode =

Re: HTMLMetaProcessor a bug?

2006-01-10 Thread Gal Nitzan
Because I needed to add two more fields from the meta tags in the html page I have revised some of the code in HTMLMetaProcessor and in DOMContentUtils. I believe it to be a little more generic than the existing code (look at DOMContentUtils.GetMetaAttributes) and from the sample here from Jérôme

HTMLMetaProcessor a bug?

2006-01-09 Thread Gal Nitzan
Hi, I was going over the code and I noticed the following in class org.apache.nutch.parse.html.HTMLMetaProcessor method getMetaTagsHelper the following code would fail in case the meta tags are in upper case Node nameNode = attrs.getNamedItem(name); Node equivNode =