Hi!
I use CyberNeko in order to parse HTML pages. I now came across some that does
makes me go nuts since the xpath expression /HTML/HEAD/TITLE returns null (so
does everything except /HTML) despite it is there - really. I even "dumped"
the struture using dom4j's DocumentTreeModel and did so manually, too.
It seems that it does have something to do with the xmlns declaration:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>...
Well, but what does this have to do with my xpath?
Anybody having a clue about this?
-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 -
digital self defense, top technical experts, no vendor pitches,
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
dom4j-user mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dom4j-user