Hi. When talking about redesigning Parser, it would be nice to fix a bug with broken indexes or what. It was in this list about 14 days ago. Problem is, Parser includes all referenced pages (I see it on console), but references are broken. Viewer says that document was not retrieved, sometimes it gives URL, sometimes URL is blank (in same document, without --no-urlinfo parameter). So I guess that it sometimes stores document under wrong ID (URLis shown), or uses wrong index in database (blank URL - it can't find any information abut that index).
We are starting portal which serves customized content of pages and we want to generate content in Plucker format (with iSilo). And this is critical for it - why users should use Plucker, if it doesn't work correctly? I am in half way of writting my own parser, at this time it generates valid documents, but still it doesn't parses HTML tree. Documents are longer, because i can't implement compression correctly. I am using Linux (RedHat 7.2), Python 1.5.2. Parser seems broken from 1.1.13 to current CVS version. Lami -- program, n: A magic spell cast over a computer allowing it to turn one's input into error messages. tr.v. To engage in a pastime similar to banging one's head against a wall, but with fewer opportunities for reward.
