Sure, that's a good idea.
Here's the original PDF:
http://courtlistener.com/pdf/2008/05/28/united_states_v._ups_customhouse_brokerage_inc..pdf
If you download that, then run:
convert -depth 4 -density 300
united_states_v._ups_customhouse_brokerage_inc..pdf
united_states_v._ups_customhouse_brokerage_inc..pd.tiff
You'll have the same tiff as me, I think. Curious to see what your
results are. Thanks for the help.
Mike
On 02/03/2013 02:00 PM, zdenko podobny wrote:
Are you able to generate just one page or small example? Or can you
provide step how you create it (so I can create it)?
Tiff could be tricky. E.g. libtiff-4 do not work for me...
Zdenko
On Sun, Feb 3, 2013 at 10:29 PM, Mike Lissner
<[email protected]
<mailto:[email protected]>> wrote:
It's about 300MB, unfortunately, but I generate it
programmatically using imagemagick in a way that's worked in the
past, so I don't think the tiff file itself is the issue.
If you're willing to download this monster, I'll post it to
dropbox. I'd love the help, but I don't think it's the right problem.
On Sun, Feb 3, 2013 at 1:16 PM, zdenko podobny <[email protected]
<mailto:[email protected]>> wrote:
Can you send and example of you tif file?
Zdenko
On Sun, Feb 3, 2013 at 10:08 PM, Michael Lissner
<[email protected]
<mailto:[email protected]>> wrote:
I have Ubuntu 12.04, which has tesseract 3.02 and
leptonica version 1.69.
I've installed these, and also installed libtiff4 using
apt-get.
When I try to process a document, I get:
↪ sudo tesseract
united_states_v._ups_customhouse_brokerage_inc.tif
united_states_v._ups_customhouse_brokerage_inc -l eng
Tesseract Open Source OCR Engine v3.02 with Leptonica
Error in pixReadFromTiffStream: spp not in set {1,3,4}
Error in pixReadStreamTiff: pix not read
Error in pixReadStream: tiff: no pix returned
Error in pixRead: pix not read
Unsupported image type.
Which seems baffling to me. I've tried reinstalling
leptonica, reininstalling the tiff libraries, and
reinstalling tesseract in the hope that they'd support
tiffs once reinstalled. So far, nothing is helping.
I was hoping that Ubuntu 12.04 would support everything i
needed it to without having to compile from source, but so
far I've had bad luck. Is there a way to make this work?
Thanks,
Mike
--
--
You received this message because you are subscribed to
the Google
Groups "tesseract-ocr" group.
To post to this group, send email to
[email protected]
<mailto:[email protected]>
To unsubscribe from this group, send email to
[email protected]
<mailto:tesseract-ocr%[email protected]>
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to
the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails
from it, send an email to
[email protected]
<mailto:tesseract-ocr%[email protected]>.
For more options, visit
https://groups.google.com/groups/opt_out.
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to
[email protected]
<mailto:[email protected]>
To unsubscribe from this group, send email to
[email protected]
<mailto:tesseract-ocr%[email protected]>
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to the
Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from
it, send an email to
[email protected]
<mailto:tesseract-ocr%[email protected]>.
For more options, visit https://groups.google.com/groups/opt_out.
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to
[email protected] <mailto:[email protected]>
To unsubscribe from this group, send email to
[email protected]
<mailto:tesseract-ocr%[email protected]>
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it,
send an email to [email protected]
<mailto:tesseract-ocr%[email protected]>.
For more options, visit https://groups.google.com/groups/opt_out.
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.