Robert Goff wrote:
> 
> > indexer -am -u http://servername/paht/to/pdf/list/
> 
> Ok, I was using -a but not -m -- that seems to have convinced it to
> scan pdf files (or the overnight delay made the timestamps different
> enough).

-m prevents indexer from sending If-Modified-Since HTTP header.


> 
> Now the pdftotext parser is throwing errors when it's run by the
> indexer, but not when it's run at a command line:
> 
> Indexer[4210]: [1] Starting external parser: '/usr/local/bin/pdftotext
> /var/tmp/tmp.97.ruwthE.in -'
> Error (0): PDF file is damaged - attempting to reconstruct xref
> table...
> Error: Couldn't find trailer dictionary
> Error: Couldn't read xref table
> 
> Indexer[4210]: [1] Starting external parser: '/usr/local/bin/pdftotext
> /var/tmp/tmp.98.tfEX3g.in -'
> Error (0): PDF file is damaged - attempting to reconstruct xref
> table...
> Error: Top-level pages object is wrong type (null)
> Error: Couldn't read page catalog
> 
> etc.
> 
> Can I get it to keep the temporary file so I can try to see if its bad?
> 


Remove all unlink() calls from parser.c and recompile.
All temporary files will not be removed after parsing.
___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to