Thanks, Cory.

Nick, it maybe helpful to add/update instructions in wiki.

Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


On Tue, Aug 12, 2014 at 4:31 AM, testing1234 <[email protected]> wrote:

> Note.. Step 5 above the last command should be
>
> "sudo make install-langs"
>
>
>
> On Sunday, August 10, 2014 4:32:55 PM UTC-4, testing1234 wrote:
>>
>> I was building based on the guide at - [1] https://code.google.com/p/
>> tesseract-ocr/wiki/TesseractSvnInstallation with no OpenCL
>>
>> *Update:* as I was able to fix this. I'll detail my whole process in
>> case other OSX users need it... I was able to get it to build on OSX 10.9.4
>> from SVN and it is working with some warnings (detailed below). The only
>> difference I can think of is I started from scratch by removing everything
>> (ports and formulas) I had installed using either MacPorts or Homebrew, as
>> well as MacPorts and Homebrew themselves. (This was probably a terribly
>> inexperienced mistake having them both installed at the same time.)
>>
>> Everything built well and without errors this time (Note: I did have
>> warnings, but no errors.).
>>
>> I have tested Tesseract with TIFF (single and multiple pages) and it is
>> working well. It gives me the following error "Warning in pixReadMemTiff:
>> tiff page 25 not found" in which the page # is always the last page of the
>> file, but it doesn't seem to be a problem.
>>
>> PNG files do not seem to work (it outputs two identically named files:
>> one that can't be opened and one that only has the first page)
>>
>> PDF files provide the following error and I can't remember if Leptonica
>> is  supposed to be able to input PDF files or not.
>>
>>
>> Error in fopenReadStream: file not found
>> Error in pixRead: image file not found: %PDF-1.2
>> Image file %PDF-1.2 cannot be read!
>> Error during processing.
>>
>>
>> I can work on these if I find time, but since TIFF is working they aren't
>> a priority.
>>
>> *So here is the process that worked for me.*
>>
>> 1. Open Terminal
>> 2. Install, update, and verify Homebrew by entering the following one at
>> a time:
>>
>> ruby -e "$(curl -fsSL https://raw.github.com/Homebrew/homebrew/go/install
>> )"
>> brew update
>> brew doctor
>>
>>
>> 3. Make sure brew doctor comes back clean
>>
>> 4. Install the tesseract dependencies listed at [1] above again by
>> entering one at a time (Note: I did not need to install aclocal or
>> autoheader from Homebrew as they aren't formulas.).
>>
>>
>> brew install autoconf
>> brew install automake
>> brew install libtool
>> brew install leptonica --with-libtiff
>>
>>
>> 5 .Run the following command (still in Terminal entering one at a time)
>> (again based on the instructions in [1]):
>>
>> svn checkout http://tesseract-ocr.googlecode.com/svn/trunk/ tesseract-ocr
>> cd tesseract-ocr
>> ./autogen.sh
>> ./configure
>> make
>> sudo make install
>> sudo make install-pangs
>>
>>
>> 6. Assuming you don't get any failures or errors, you can then test using
>> the following commands in Terminal (the italics should be change to your
>> docs specific filenames and the filetype you want to output) (Note:
>> Tesseract defaults its output to .TXT files).
>>
>> tesseract *inputfilename*.tiff *outputfilename outputfiletype*
>>
>>
>> For example: "tesseract mytiff.tiff mysearchablepdf pdf"    will make
>> "mytiff.tiff" a searchable pdf with the name "mysearchablepdf.pdf" and save
>> it into whatever location you run the tesseract command from.
>>
>> Hopefully this helps someone else and it may be useful to post it under a
>> different (more searchable post title).
>>
>> Best,
>>
>> Cory
>>
>>
>>
>>
>> On Sunday, August 10, 2014 12:23:04 PM UTC-4, zdenop wrote:
>>>
>>> How are you building tesseract?
>>> According issue tracker[1] there is problem only with OpenCL...
>>>
>>> [1] https://code.google.com/p/tesseract-ocr/issues/detail?id=1272
>>>
>>> Zdenko
>>>
>>>
>>> On Sat, Aug 9, 2014 at 10:28 PM, testing1234 <[email protected]> wrote:
>>>
>>>> When compiling and running "make" I get the following error:
>>>>
>>>> scanutils.cpp:38:14: error: typedef redefinition with different types
>>>> ('long' vs '__darwin_off_t'
>>>>       (aka 'long long'))
>>>> typedef long off_t;
>>>>              ^
>>>> /usr/include/sys/_types/_off_t.h:30:25: note: previous definition is
>>>> here
>>>> typedef __darwin_off_t          off_t;
>>>>                                 ^
>>>> 1 error generated.
>>>> make[2]: *** [scanutils.lo] Error 1
>>>> make[1]: *** [install-recursive] Error 1
>>>> make: *** [install-recursive] Error 1
>>>>
>>>>
>>>> Can anyone help me resolve this?
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>> msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%
>>>> 40googlegroups.com
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>  --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/4fadf9d3-8039-4955-b9f3-a635254c8caa%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/4fadf9d3-8039-4955-b9f3-a635254c8caa%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUXF60Aw-bhvDAn2frWYrCgxP3%2BgZ1_Q3peov9Xk_Hfxw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to