I was building based on the guide at - [1] 
https://code.google.com/p/tesseract-ocr/wiki/TesseractSvnInstallation with 
no OpenCL

*Update:* as I was able to fix this. I'll detail my whole process in case 
other OSX users need it... I was able to get it to build on OSX 10.9.4 from 
SVN and it is working with some warnings (detailed below). The only 
difference I can think of is I started from scratch by removing everything 
(ports and formulas) I had installed using either MacPorts or Homebrew, as 
well as MacPorts and Homebrew themselves. (This was probably a terribly 
inexperienced mistake having them both installed at the same time.)

Everything built well and without errors this time (Note: I did have 
warnings, but no errors.).

I have tested Tesseract with TIFF (single and multiple pages) and it is 
working well. It gives me the following error "Warning in pixReadMemTiff: 
tiff page 25 not found" in which the page # is always the last page of the 
file, but it doesn't seem to be a problem.

PNG files do not seem to work (it outputs two identically named files: one 
that can't be opened and one that only has the first page)

PDF files provide the following error and I can't remember if Leptonica is 
 supposed to be able to input PDF files or not.
    

Error in fopenReadStream: file not found
Error in pixRead: image file not found: %PDF-1.2
Image file %PDF-1.2 cannot be read!
Error during processing.


I can work on these if I find time, but since TIFF is working they aren't a 
priority. 

*So here is the process that worked for me.*

1. Open Terminal
2. Install, update, and verify Homebrew by entering the following one at a 
time:

ruby -e "$(curl -fsSL https://raw.github.com/Homebrew/homebrew/go/install)"
brew update
brew doctor

 
3. Make sure brew doctor comes back clean

4. Install the tesseract dependencies listed at [1] above again by entering 
one at a time (Note: I did not need to install aclocal or autoheader from 
Homebrew as they aren't formulas.).
   

brew install autoconf
brew install automake
brew install libtool
brew install leptonica --with-libtiff

 
5 .Run the following command (still in Terminal entering one at a time) 
(again based on the instructions in [1]):

svn checkout http://tesseract-ocr.googlecode.com/svn/trunk/ tesseract-ocr
cd tesseract-ocr
./autogen.sh
./configure
make
sudo make install
sudo make install-pangs


6. Assuming you don't get any failures or errors, you can then test using 
the following commands in Terminal (the italics should be change to your 
docs specific filenames and the filetype you want to output) (Note: 
Tesseract defaults its output to .TXT files).

tesseract *inputfilename*.tiff *outputfilename outputfiletype*

 
For example: "tesseract mytiff.tiff mysearchablepdf pdf"    will make 
"mytiff.tiff" a searchable pdf with the name "mysearchablepdf.pdf" and save 
it into whatever location you run the tesseract command from.

Hopefully this helps someone else and it may be useful to post it under a 
different (more searchable post title).

Best, 

Cory




On Sunday, August 10, 2014 12:23:04 PM UTC-4, zdenop wrote:
>
> How are you building tesseract?
> According issue tracker[1] there is problem only with OpenCL...
>
> [1] https://code.google.com/p/tesseract-ocr/issues/detail?id=1272
>
> Zdenko
>
>
> On Sat, Aug 9, 2014 at 10:28 PM, testing1234 <[email protected] 
> <javascript:>> wrote:
>
>> When compiling and running "make" I get the following error:
>>
>> scanutils.cpp:38:14: error: typedef redefinition with different types 
>> ('long' vs '__darwin_off_t'
>>       (aka 'long long'))
>> typedef long off_t;
>>              ^
>> /usr/include/sys/_types/_off_t.h:30:25: note: previous definition is here
>> typedef __darwin_off_t          off_t;
>>                                 ^
>> 1 error generated.
>> make[2]: *** [scanutils.lo] Error 1
>> make[1]: *** [install-recursive] Error 1
>> make: *** [install-recursive] Error 1
>>
>>
>> Can anyone help me resolve this?
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/97581fa0-8007-4a6f-8d22-dcb933474e24%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to