Re: Plucking a huge set of local files

Petr Chyba Tue, 04 Jun 2002 05:41:39 -0700

Hi.

When talking about redesigning Parser, it would be nice to fix a bug 
with broken indexes or what. It was in this list about 14 days ago. 
Problem is, Parser includes all referenced pages (I see it on console), 
but references are broken. Viewer says that document was not retrieved, 
sometimes it gives URL, sometimes URL is blank (in same document, 
without --no-urlinfo parameter). So I guess that it sometimes stores 
document under wrong ID (URLis shown), or uses wrong index in database 
(blank URL - it can't find any information abut that index).


We are starting portal which serves customized content of pages and we 
want to generate content in Plucker format (with iSilo). And this is 
critical for it - why users should use Plucker, if it doesn't work 
correctly?

I am in half way of writting my own parser, at this time it generates 
valid documents, but still it doesn't parses HTML tree. Documents are 
longer, because i can't implement compression correctly.

I am using Linux (RedHat 7.2), Python 1.5.2. Parser seems broken from 
1.1.13 to current CVS version.

Lami

-- 
program, n:
  A magic spell cast over a computer allowing it to turn one's input
  into error messages.  tr.v. To engage in a pastime similar to banging
  one's head against a wall, but with fewer opportunities for reward.

Re: Plucking a huge set of local files

Reply via email to