Jérôme Charron wrote:
What you probably mean is something equivalent to Unix strings(1). I
have a plugin that implements this, which I could contribute if there's
interest.
+1
hmm.. strings on couple of randomply selected pdf gives me content I
wouldn't wanna search against.
--
Sami Siren
What you probably mean is something equivalent to Unix strings(1). I
have a plugin that implements this, which I could contribute if there's
interest.
+1
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/
[EMAIL PROTECTED] wrote:
to index, then we may think of either (a) removing it from the default
+1
-1
This is not the right way. Better keep parse-text as default parser. But
do not
fall back to parse-text automatically, when the custom parser fails. The
custom parser (PDF in this
Andrzej Bialecki <[EMAIL PROTECTED]> schrieb am 03.08.2006 17:19:02:
> Chris Mattmann wrote:
> > Hi Marko,
> >
> >Thanks for your question. Basically it was set up as a sort of
"last
> > result" of getting at least * some * information from the PDF file,
albeit
> > littered with garbage. If i