On Mon, Dec 7, 2009 at 11:48 AM, Ethan Gruber <ewg4x...@gmail.com> wrote: > Ben, another problem with digestibility of the search results is that it's > not XHTML, and therefore not well-formed XML, making it impossible to > process with XPath.
What page did you find that wasn't valid XHTML? The JSON should be easier to process typically (no XPATH required), but I'm still curious. http://validator.w3.org/check?uri=http%3A%2F%2Fid.loc.gov%2Fauthorities%2Fsearch%2F%3Fq%3Dhtml&charset=%28detect+automatically%29&doctype=Inline&group=0 //Ed