Hi!

I use CyberNeko in order to parse HTML pages. I now came across some that does 
makes me go nuts since the xpath expression /HTML/HEAD/TITLE returns null (so 
does everything except /HTML)  despite it is there - really. I even "dumped" 
the struture using dom4j's DocumentTreeModel and did so manually, too.

It seems that it does have something to do with the xmlns declaration:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
    "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd";>

<html xmlns="http://www.w3.org/1999/xhtml";>
  <head>
    <title>...

Well, but what does this have to do with my xpath?

Anybody having a clue about this?


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
dom4j-user mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dom4j-user

Reply via email to