[Haskell-cafe] Re: Lazy HTML parsing with HXT, HaXML/polyparse, what else?

2007-05-12 Thread apfelmus
Henning Thielemann wrote: I want to parse and process HTML lazily. I use HXT because the HTML parser is very liberal. However it uses Parsec and is thus strict. HaXML has a so called lazy parser, but it is not what I consider lazy: [...] Note that lazy parsing is inherently difficult and most

Re: [Haskell-cafe] Re: Lazy HTML parsing with HXT, HaXML/polyparse, what else?

2007-05-12 Thread Novák Zoltán
Hi, What results should a lazy parser return before emitting ⊥? At the time you read the html-tag, you cannot know whether a syntax error far down in the file makes it invalid. Thus, you may not return the top-most html-tag until you see the closing /html. But to return the top most html you

Re: [Haskell-cafe] Re: Lazy HTML parsing with HXT, HaXML/polyparse, what else?

2007-05-12 Thread neez
Hi, Hi, What results should a lazy parser return before emitting ⊥? At the time you read the html-tag, you cannot know whether a syntax error far down in the file makes it invalid. Thus, you may not return the top-most html-tag until you see the closing /html. But to return the top