Re: [tesseract-ocr] VietOCR v6.3.0 & VietOCR.NET v6.3.0 Releases

2022-07-08 Thread JB Data31
 @*JB*Δ Le ven. 8 juil. 2022 à 20:08, Quan Nguyen a écrit : > A Java/.NET WPF GUI frontend for Tesseract OCR engine. The releases > include the following improvements: > >- Upgrade to Tesseract 5.2.0 > > http://vietocr.sf.net > > -- > You received this message

Re: [tesseract-ocr] Running Tesseract 5 on Linux

2022-04-05 Thread JB Data31
To avoid difficulties, compile from src is a proper way on ubuntu. @*JB*Δ Le mar. 5 avr. 2022 à 18:26, Zdenko Podobny a écrit : > Ubuntu has its policy for accepting the new version of packages. > If https://notesalexp.org/tesseract-ocr/html/ or >

Re: [tesseract-ocr] Large app size of a tesseract app on Android

2020-07-15 Thread JB Data31
To see details, you can *unzip -v* the *apk* file. @*JB*Δ Le mer. 15 juil. 2020 à 10:01, Kunal Singh a écrit : > Hi, > > I am using Tesseract 4 in an android/ios app. The OCR part is working > fine. But the app has a large installed size on the device

Re: [tesseract-ocr] Tesseract 4 + OpenCL?

2020-03-10 Thread JB Data31
A recent successful build with V5 . I think the documentation is more for V5 and less for V3. @*JB*Δ Le lun. 9 mars 2020 à 23:37, Matt Chapman

Re: [tesseract-ocr] Can tesseract be used to read a PDF and OCR it to text?

2020-01-14 Thread JB Data31
OCRmyPDF do the job. Linux native, but windows available : https://ocrmypdf.readthedocs.io/en/latest/installation.html#installing-on-windows. 2020-01-13 7:49 UTC+01:00, 'pjfarley3' via tesseract-ocr : > > > On Sunday, January 12, 2020 at 8:52:51 PM UTC-5, shree wrote: >> >> Tesseract reads only

Re: [tesseract-ocr] Re: Tesseract 4.1.0 released

2019-08-14 Thread JB Data31
3 août 2019 à 12:22, Alex Cohn a écrit : > Oh, I now understand the problem. You need git clone --recurse-submodules. > To add the missing submodules after *clone*, > > git submodule init > git submodule update > > > BR, > Alex > > On Tuesday, August 13, 2019 at

Re: [tesseract-ocr] Re: Tesseract 4.1.0 released

2019-08-13 Thread JB Data31
ss-two/obj': No such file or directory Branch : 4.1, no *tess-two/ob*j dir, created no changes. Le lundi 5 août 2019 07:00:22 UTC, Alex Cohn a écrit : > > On Monday, August 5, 2019 at 8:21:32 AM UTC+3, JB Data31 wrote: >> >> https://github.com/tesseract-ocr/tesseract/wiki/Compiling#and

Re: [tesseract-ocr] Re: Tesseract 4.1.0 released

2019-08-04 Thread JB Data31
', needed by > `obj/local/arm64-v8a/objs/tesseract/api/tesseractmain.o'. Stop.* > make: Leaving directory `.../tess-two-git/tess-two' > $ > ? @*JB*Δ <http://jbigdata.fr/jbweb2/> Le ven. 2 août 2019 à 13:05, Alex Cohn a écrit : > > On Friday, August 2, 2019 at 8:52:02

Re: [tesseract-ocr] Re: Tesseract 4.1.0 released

2019-08-01 Thread JB Data31
> > *$ git clone https://github.com/alexcohn/tess-two.git > tess-two-git* > Cloning into 'tess-two-git'... > ... > *$ ndk-build -C tess-two-git/tess-two tesseract APP_ABI=arm64-v8a > APP_PLATFORM=android-24* > Android NDK: WARNING: APP_PLATFORM android-24

Re: [tesseract-ocr] issue #1393: Android NDK: LOCAL_MODULE definition in jni/Android.mk must not contain space

2019-07-01 Thread JB Data31
that building static tesseract library including >> dependencies is much more difficult, so maybe documenting your progress >> could be useful for a lot of people. >> >> >> Zdenko >> >> >> pi 21. 6. 2019 o 16:21 JB Data31 napísal(a): >> >>&g

Re: [tesseract-ocr] open CV : To Convert Image PDF to Searchable PDF

2019-05-19 Thread JB Data31
OCRmyPDF is an all-in-one tool that do the job. @*JB*Δ Le ven. 17 mai 2019 à 10:56, Zdenko Podobny a écrit : > First you need to convert pdf to image(s). There are several tools for it. > Than you

Re: [tesseract-ocr] Re: configure: error: Required OpenCL library not found!

2019-05-01 Thread JB Data31
If my memories are goods you have to combine 2 options : -L -l <"the name" of the lib>, "the name" because -lxxx correspond to a file libxxx.a or libxxx.so, depend on static or dynamic file. -lOpenCL correspond to a file libopencl.[a,so] , did you do the case-insensitive command *find / -iname

Re: [tesseract-ocr] Bazaar & eng.user-patterns not doing anything?

2019-04-26 Thread JB Data31
Image processing can improve the result, but this typo is very particular, i.e. "unconnected" digit. I try morphological transformations to re-connect digit, better (*tesseract ocr_inv-8.png ocr-8 --psm 6*) , but yet far away a proper result. According to me, train *tesseract* with this typo is a