We are using hocr and pdf outputs as well.
On Thursday, April 6, 2017 at 8:06:27 PM UTC-5, shree wrote:
>
> Normally, for text output, the other config files should not impact.
>
>
>
> - excuse the brevity, sent from mobile
>
> On 07-Apr-2017 2:18 AM, "Mike Hall"
> wrote:
>
Normally, for text output, the other config files should not impact.
- excuse the brevity, sent from mobile
On 07-Apr-2017 2:18 AM, "Mike Hall" wrote:
> Yes, we are using the -psm 6 command line argument. And it was not
> working.
>
> But I figured out the issue.
>
>
Yes, we are using the -psm 6 command line argument. And it was not working.
But I figured out the issue.
Tesseract has a set of config files. Inside several of these config files
(hocr, pdf, tsv, unlv) is the setting *tessedit_pageseg_mode*. This setting
was set to 1 in all the config
Have u tried --psm 6
- excuse the brevity, sent from mobile
On 06-Apr-2017 11:06 PM, "Mike Hall" wrote:
> We have a C# .Net app that is using Tesseract to do Optical Character
> Recognition (OCR) on .tiff files. I've attached a sample tiff file.
>
> We are then
We have a C# .Net app that is using Tesseract to do Optical Character
Recognition (OCR) on .tiff files. I've attached a sample tiff file.
We are then outputting the data to a text file. However, Tesseract is
reading the data in a Vertical fashion. In my example image, it is reading
the tiff
5 matches
Mail list logo