Sure, that's a good idea.

Here's the original PDF: http://courtlistener.com/pdf/2008/05/28/united_states_v._ups_customhouse_brokerage_inc..pdf

If you download that, then run:

convert -depth 4 -density 300 united_states_v._ups_customhouse_brokerage_inc..pdf united_states_v._ups_customhouse_brokerage_inc..pd.tiff

You'll have the same tiff as me, I think. Curious to see what your results are. Thanks for the help.

Mike



On 02/03/2013 02:00 PM, zdenko podobny wrote:
Are you able to generate just one page or small example? Or can you provide step how you create it (so I can create it)?
Tiff could be tricky. E.g. libtiff-4 do not work for me...

Zdenko


On Sun, Feb 3, 2013 at 10:29 PM, Mike Lissner <[email protected] <mailto:[email protected]>> wrote:

    It's about 300MB, unfortunately, but I generate it
    programmatically using imagemagick in a way that's worked in the
    past, so I don't think the tiff file itself is the issue.

    If you're willing to download this monster, I'll post it to
    dropbox. I'd love the help, but I don't think it's the right problem.


    On Sun, Feb 3, 2013 at 1:16 PM, zdenko podobny <[email protected]
    <mailto:[email protected]>> wrote:

        Can you send and example of you tif file?

        Zdenko


        On Sun, Feb 3, 2013 at 10:08 PM, Michael Lissner
        <[email protected]
        <mailto:[email protected]>> wrote:

            I have Ubuntu 12.04, which has tesseract 3.02 and
            leptonica version 1.69.

            I've installed these, and also installed libtiff4 using
            apt-get.

            When I try to process a document, I get:

            ↪ sudo tesseract
            united_states_v._ups_customhouse_brokerage_inc.tif
            united_states_v._ups_customhouse_brokerage_inc -l eng
            Tesseract Open Source OCR Engine v3.02 with Leptonica
            Error in pixReadFromTiffStream: spp not in set {1,3,4}
            Error in pixReadStreamTiff: pix not read
            Error in pixReadStream: tiff: no pix returned
            Error in pixRead: pix not read
            Unsupported image type.


            Which seems baffling to me. I've tried reinstalling
            leptonica, reininstalling the tiff libraries, and
            reinstalling tesseract in the hope that they'd support
            tiffs once reinstalled. So far, nothing is helping.

            I was hoping that Ubuntu 12.04 would support everything i
            needed it to without having to compile from source, but so
            far I've had bad luck. Is there a way to make this work?

            Thanks,

            Mike
-- -- You received this message because you are subscribed to
            the Google
            Groups "tesseract-ocr" group.
            To post to this group, send email to
            [email protected]
            <mailto:[email protected]>
            To unsubscribe from this group, send email to
            [email protected]
            <mailto:tesseract-ocr%[email protected]>
            For more options, visit this group at
            http://groups.google.com/group/tesseract-ocr?hl=en

            ---
            You received this message because you are subscribed to
            the Google Groups "tesseract-ocr" group.
            To unsubscribe from this group and stop receiving emails
            from it, send an email to
            [email protected]
            <mailto:tesseract-ocr%[email protected]>.
            For more options, visit
            https://groups.google.com/groups/opt_out.



-- -- You received this message because you are subscribed to the Google
        Groups "tesseract-ocr" group.
        To post to this group, send email to
        [email protected]
        <mailto:[email protected]>
        To unsubscribe from this group, send email to
        [email protected]
        <mailto:tesseract-ocr%[email protected]>
        For more options, visit this group at
        http://groups.google.com/group/tesseract-ocr?hl=en

        ---
        You received this message because you are subscribed to the
        Google Groups "tesseract-ocr" group.
        To unsubscribe from this group and stop receiving emails from
        it, send an email to
        [email protected]
        <mailto:tesseract-ocr%[email protected]>.
        For more options, visit https://groups.google.com/groups/opt_out.



-- -- You received this message because you are subscribed to the Google
    Groups "tesseract-ocr" group.
    To post to this group, send email to
    [email protected] <mailto:[email protected]>
    To unsubscribe from this group, send email to
    [email protected]
    <mailto:tesseract-ocr%[email protected]>
    For more options, visit this group at
    http://groups.google.com/group/tesseract-ocr?hl=en

    ---
    You received this message because you are subscribed to the Google
    Groups "tesseract-ocr" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:tesseract-ocr%[email protected]>.
    For more options, visit https://groups.google.com/groups/opt_out.



--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.



--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to