Re: Antwort: Re: parse-plugins.xml

2006-08-04 Thread Sami Siren
Jérôme Charron wrote: What you probably mean is something equivalent to Unix strings(1). I have a plugin that implements this, which I could contribute if there's interest. +1 hmm.. strings on couple of randomply selected pdf gives me content I wouldn't wanna search against. -- Sami Siren

Re: Antwort: Re: parse-plugins.xml

2006-08-04 Thread Jérôme Charron
What you probably mean is something equivalent to Unix strings(1). I have a plugin that implements this, which I could contribute if there's interest. +1 Jérôme -- http://motrech.free.fr/ http://www.frutch.org/

Re: Antwort: Re: parse-plugins.xml

2006-08-04 Thread Andrzej Bialecki
[EMAIL PROTECTED] wrote: to index, then we may think of either (a) removing it from the default +1 -1 This is not the right way. Better keep parse-text as default parser. But do not fall back to parse-text automatically, when the custom parser fails. The custom parser (PDF in this

Antwort: Re: parse-plugins.xml

2006-08-04 Thread marcel . schnippe
Andrzej Bialecki <[EMAIL PROTECTED]> schrieb am 03.08.2006 17:19:02: > Chris Mattmann wrote: > > Hi Marko, > > > >Thanks for your question. Basically it was set up as a sort of "last > > result" of getting at least * some * information from the PDF file, albeit > > littered with garbage. If i