Have you tried using ghostscript to convert pdf to tif files instead?
Example commands

gs   -r600x600 -sDEVICE=tiffg4   -dFirstPage=106  -dLastPage=109    -o
./tulasi/tulasikrishna%00d.tif  "TulasiPuja.pdf"

for one tif per page

gs   -r600x600 -sDEVICE=tiffg4   -dFirstPage=126  -dLastPage=131    -o
./tulasi/tulasIviShNupUjA.tif  "TulasiPuja.pdf"

for multipage tif

you can reduce resolution to -r300x300

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Jun 8, 2017 at 7:25 PM, Hari.K <[email protected]> wrote:

> Hi There,
>
>     I sometimes receive an error - "Failed to create pix, this normally
> occurs because the requested image size is too large, please check Standard
> Error Output" when doing OCR on a bitmap image.
>
>
> Below highlighted line is where it's breaking for me -
>
>  Bitmap bitmap;
> Spire.Pdf.PdfDocument document = new Spire.Pdf.PdfDocument(pdfPath);
>
>
>             for (int i = 0; i <= document.Pages.Count; i++)
>             {
>                 bitmap = (Bitmap)document.SaveAsImage(i,
> PdfImageType.Bitmap, 200, 200); // where 200 is the DPI which I am
> setting for a bitmap image
>                 ...................
>                 .................
>
>             }
>
> More details on what I am trying to do here:
> 1) Uploaded a PDF document which is of hardly 600KB
> 2) Iterate through each PDF page and convert it into a BitMap image
> 3) Then input this BitMap image to Tesseract for performing OCR
>
> Please note, I don't get this error often. Any ideas on why this error as
> I do not receive this every time ?
>
> Looking forward for some inputs on this..
>
> Thanks in Advance,
> Hari
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/dcfe7918-707b-4b56-9720-b3e39ae1a658%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/dcfe7918-707b-4b56-9720-b3e39ae1a658%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXaN-9w4LG_0SFrEGy7GnxQeJiDbn5E2J-Po6yBwRfdFA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to