commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2020-03-27 21:57:30 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new.3160 (New) Package is "tesseract-ocr" Fri Mar 27 21:57:30 2020 rev:10 rq:788742 version:4.1.1 Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2020-01-03 17:39:02.251358902 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.3160/tesseract-ocr.changes 2020-03-27 21:57:54.958814340 +0100 @@ -1,0 +2,7 @@ +Thu Mar 26 20:33:55 UTC 2020 - Bernhard Wiedemann + +- Add tesseract-ocr-no-cpudetection.patch + to avoid crashing on older CPUs + and to make package build reproducible (boo#1159231) + +--- New: tesseract-ocr-no-cpudetection.patch Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.H1VtrM/_old 2020-03-27 21:57:56.686815345 +0100 +++ /var/tmp/diff_new_pack.H1VtrM/_new 2020-03-27 21:57:56.690815348 +0100 @@ -24,6 +24,8 @@ License:Apache-2.0 AND GPL-2.0-or-later URL:https://github.com/tesseract-ocr/tesseract Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz +# PATCH-FIX-OPENSUSE -- boo#1159231 +Patch0: tesseract-ocr-no-cpudetection.patch BuildRequires: asciidoc BuildRequires: autoconf BuildRequires: automake @@ -70,7 +72,7 @@ open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google. %prep -%autosetup -n tesseract-%{version} +%autosetup -n tesseract-%{version} -p1 %build autoreconf -fiv ++ tesseract-ocr-no-cpudetection.patch ++ Author: Bernhard M. Wiedemann Date: 2020-03-26 https://bugzilla.opensuse.org/show_bug.cgi?id=1159231 avoid crashes on machines that are older than the random build machine and make package builds reproducible. See https://reproducible-builds.org/ for why this is good. Index: tesseract-4.1.1/src/arch/Makefile.am === --- tesseract-4.1.1.orig/src/arch/Makefile.am +++ tesseract-4.1.1/src/arch/Makefile.am @@ -47,9 +47,6 @@ libtesseract_sse_la_CXXFLAGS = -msse4.1 endif libtesseract_native_la_CXXFLAGS = -O3 -ffast-math -if MARCH_NATIVE_OPT -libtesseract_native_la_CXXFLAGS += -march=native -mtune=native -endif libtesseract_native_la_SOURCES = dotproduct.cpp libtesseract_arch_la_SOURCES = intsimdmatrix.cpp simddetect.cpp
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2020-01-03 17:38:36 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675 (New) Package is "tesseract-ocr" Fri Jan 3 17:38:36 2020 rev:9 rq:760613 version:4.1.1 Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2019-12-27 14:01:07.568882878 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675/tesseract-ocr.changes 2020-01-03 17:39:02.251358902 +0100 @@ -1,0 +2,5 @@ +Fri Jan 3 10:38:08 UTC 2020 - Tomáš Chvátal + +- Require libarchive in the devel package + +--- Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.TIGZyd/_old 2020-01-03 17:39:02.951359262 +0100 +++ /var/tmp/diff_new_pack.TIGZyd/_new 2020-01-03 17:39:02.951359262 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2019 SUSE LLC +# Copyright (c) 2020 SUSE LLC # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -22,7 +22,6 @@ Release:0 Summary:Open Source OCR Engine License:Apache-2.0 AND GPL-2.0-or-later -Group: Productivity/Graphics/Other URL:https://github.com/tesseract-ocr/tesseract Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz BuildRequires: asciidoc @@ -54,9 +53,9 @@ %package devel Summary:Tesseract Open Source OCR Engine Development files -Group: Development/Libraries/Other -Requires: liblept-devel Requires: libtesseract%{so_ver} = %{version} +Requires: pkgconfig(lept) >= 1.74 +Requires: pkgconfig(libarchive) %description devel This package contains development files for the Tesseract Open Source OCR @@ -64,7 +63,6 @@ %package -n libtesseract%{so_ver} Summary:Open Source OCR Engine -Group: System/Libraries %description -n libtesseract%{so_ver} A commercial quality OCR engine originally developed at HP between 1985 and
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2019-12-27 14:01:05 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675 (New) Package is "tesseract-ocr" Fri Dec 27 14:01:05 2019 rev:8 rq:759653 version:4.1.1 Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2019-12-13 12:06:05.133363375 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675/tesseract-ocr.changes 2019-12-27 14:01:07.568882878 +0100 @@ -1,0 +2,6 @@ +Fri Dec 27 08:59:17 UTC 2019 - Ismail Dönmez + +- Update to version 4.1.1 + * Bugfixes + +--- Old: tesseract-ocr-4.1.0.tar.gz New: tesseract-ocr-4.1.1.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.TvoV6Q/_old 2019-12-27 14:01:08.100883125 +0100 +++ /var/tmp/diff_new_pack.TvoV6Q/_new 2019-12-27 14:01:08.104883127 +0100 @@ -18,7 +18,7 @@ %define so_ver 4 Name: tesseract-ocr -Version:4.1.0 +Version:4.1.1 Release:0 Summary:Open Source OCR Engine License:Apache-2.0 AND GPL-2.0-or-later ++ tesseract-ocr-4.1.0.tar.gz -> tesseract-ocr-4.1.1.tar.gz ++ 59413 lines of diff (skipped)
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2019-12-13 12:05:10 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new.4691 (New) Package is "tesseract-ocr" Fri Dec 13 12:05:10 2019 rev:7 rq:756765 version:4.1.0 Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2018-02-21 14:11:47.061513395 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.4691/tesseract-ocr.changes 2019-12-13 12:06:05.133363375 +0100 @@ -1,0 +2,19 @@ +Fri Dec 13 09:03:19 UTC 2019 - Martin Pluskal + +- Packaging Cleanups +- Update dependencies and enable openCL + +--- +Fri Dec 13 03:01:02 UTC 2019 - hiwatari.se...@gmail.com + +- Update to 4.1.0 + * Added a new output option formatted in the ALTO standard + * SIMD optimization + * Bugfixes +- Update to 4.0.0 + * New OCR engine based on LSTMs + * Removed Cube OCR engine + * Updated build system + * Cleanups and fixes + +--- Old: tesseract-ocr-3.05.01.tar.gz New: tesseract-ocr-4.1.0.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.kBRd5Y/_old 2019-12-13 12:06:05.785363129 +0100 +++ /var/tmp/diff_new_pack.kBRd5Y/_new 2019-12-13 12:06:05.789363127 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2018 SUSE LINUX GmbH, Nuernberg, Germany. +# Copyright (c) 2019 SUSE LLC # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -12,31 +12,39 @@ # license that conforms to the Open Source Definition (Version 1.9) # published by the Open Source Initiative. -# Please submit bugfixes or comments via http://bugs.opensuse.org/ +# Please submit bugfixes or comments via https://bugs.opensuse.org/ # -%define so_ver 3 +%define so_ver 4 Name: tesseract-ocr -Version:3.05.01 +Version:4.1.0 Release:0 Summary:Open Source OCR Engine -License:Apache-2.0 AND GPL-2.0+ +License:Apache-2.0 AND GPL-2.0-or-later Group: Productivity/Graphics/Other -Url:https://github.com/tesseract-ocr/tesseract +URL:https://github.com/tesseract-ocr/tesseract Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz +BuildRequires: asciidoc BuildRequires: autoconf BuildRequires: automake -BuildRequires: cairo-devel BuildRequires: doxygen BuildRequires: fdupes BuildRequires: gcc-c++ -BuildRequires: libicu-devel -BuildRequires: liblept-devel >= 1.74 BuildRequires: libtool -BuildRequires: m4 -BuildRequires: pango-devel -BuildRequires: pkgconfig +BuildRequires: libxslt-tools +BuildRequires: opencl-headers +BuildRequires: pkgconfig >= 0.9.0 +BuildRequires: pkgconfig(OpenCL) +BuildRequires: pkgconfig(cairo) +BuildRequires: pkgconfig(fontconfig) +BuildRequires: pkgconfig(icu-i18n) >= 52.1 +BuildRequires: pkgconfig(icu-uc) >= 52.1 +BuildRequires: pkgconfig(lept) >= 1.74 +BuildRequires: pkgconfig(libarchive) +BuildRequires: pkgconfig(pango) >= 1.22.0 +BuildRequires: pkgconfig(pangocairo) >= 1.22.0 +BuildRequires: pkgconfig(pangoft2) >= 1.22.0 Recommends: tesseract-ocr-traineddata-english %description @@ -64,22 +72,17 @@ open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google. %prep -%setup -q -n tesseract-%{version} -autoreconf -fi - -# Remove build time references so build-compare can do its work -echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile +%autosetup -n tesseract-%{version} %build -export CXXFLAGS="%{optflags} -fno-strict-aliasing -fPIC" -%configure --disable-static -make %{?_smp_mflags} -make training %{?_smp_mflags} -make doc %{?_smp_mflags} +autoreconf -fiv +%configure \ + --enable-opencl \ + --disable-static +%make_build all training doc %install -%make_install -make DESTDIR=%{buildroot} training-install +%make_install all training-install # Remove libtool config files rm -f %{buildroot}%{_libdir}/libtesseract.la @@ -90,6 +93,9 @@ # Fix rpmlint warning "doc-file-dependency" rm -f %{buildroot}%{_defaultdocdir}/%{name}-devel/html/installdox +# Fix rpmlint warning "non-executable-in-bin" +chmod 0755 %{buildroot}%{_bindir}/tesstrain_utils.sh + # Fix rpmlint warning "files-duplicate" %fdupes -s %{buildroot} @@ -97,14 +103,15 @@ %postun -n libtesseract%{so_ver} -p /sbin/ldconfig %files -%doc AUTHORS COPYING ChangeLog README.md +%doc AUTHORS ChangeLog README.md +%license LICENSE
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2018-02-21 14:11:44 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Wed Feb 21 14:11:44 2018 rev:6 rq:578379 version:3.05.01 Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2017-02-21 13:49:49.756754009 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2018-02-21 14:11:47.061513395 +0100 @@ -1,0 +2,10 @@ +Tue Feb 20 13:38:34 UTC 2018 - jweberho...@weberhofer.at + +- Update to 3.05.01 + * Fixed several build issues + * Fixed C-API + * Backport pdfrenderer changes + * Code clean up +- Spec file cleaned up + +--- Old: tesseract-ocr-3.05.00.tar.gz New: tesseract-ocr-3.05.01.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.Bdx8xc/_old 2018-02-21 14:11:48.249470614 +0100 +++ /var/tmp/diff_new_pack.Bdx8xc/_new 2018-02-21 14:11:48.253470470 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2017 SUSE LINUX GmbH, Nuernberg, Germany. +# Copyright (c) 2018 SUSE LINUX GmbH, Nuernberg, Germany. # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -17,12 +17,11 @@ %define so_ver 3 - Name: tesseract-ocr -Version:3.05.00 +Version:3.05.01 Release:0 Summary:Open Source OCR Engine -License:Apache-2.0 and GPL-2.0+ +License:Apache-2.0 AND GPL-2.0+ Group: Productivity/Graphics/Other Url:https://github.com/tesseract-ocr/tesseract Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz @@ -37,9 +36,8 @@ BuildRequires: libtool BuildRequires: m4 BuildRequires: pango-devel -BuildRequires: pkg-config +BuildRequires: pkgconfig Recommends: tesseract-ocr-traineddata-english -BuildRoot: %{_tmppath}/%{name}-%{version}-build %description A commercial quality OCR engine originally developed at HP between 1985 and @@ -96,11 +94,9 @@ %fdupes -s %{buildroot} %post -n libtesseract%{so_ver} -p /sbin/ldconfig - %postun -n libtesseract%{so_ver} -p /sbin/ldconfig %files -%defattr(-,root,root,-) %doc AUTHORS COPYING ChangeLog README.md %{_bindir}/* %dir %{_datadir}/tessdata @@ -111,14 +107,12 @@ %{_mandir}/man5/*.5%{ext_man} %files devel -%defattr(-,root,root,-) %doc %{_defaultdocdir}/tesseract-ocr-devel/ %{_includedir}/tesseract/ %{_libdir}/libtesseract*.so %{_libdir}/pkgconfig/*.pc %files -n libtesseract%{so_ver} -%defattr(-,root,root,-) %{_libdir}/libtesseract.so.%{so_ver}* %changelog ++ tesseract-ocr-3.05.00.tar.gz -> tesseract-ocr-3.05.01.tar.gz ++ 5910 lines of diff (skipped)
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2017-02-21 13:49:48 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2016-02-22 08:58:05.0 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2017-02-21 13:49:49.756754009 +0100 @@ -1,0 +2,20 @@ +Fri Feb 17 14:39:58 UTC 2017 - idon...@suse.com + +- Update to 3.05.00 + * Made some fine tuning to the hOCR output. + * Added TSV as another optional output format. + * Fixed ABI break introduced in 3.04.00 with the AnalyseLayout() +method. + * text2image tool - Enable all OpenType ligatures available in +a font. This feature requires Pango 1.38 or newer. + * Training tools - Replaced asserts with tprintf() and exit(1). + * Improved multipage tiff processing. + * Improved the embedded pdf font (pdf.ttf). + * Enable selection of OCR engine mode from command line. + * Changed tesseract command line parameter '-psm' to '--psm'. + * Added new C API for orientation and script detection, removed +the old one. + * Fixed many compiler warning. + * Fixed memory and resource leaks. + +--- Old: 3.04.01.tar.gz New: tesseract-ocr-3.05.00.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.yGuLTK/_old 2017-02-21 13:49:50.512647626 +0100 +++ /var/tmp/diff_new_pack.yGuLTK/_new 2017-02-21 13:49:50.516647064 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2016 SUSE LINUX GmbH, Nuernberg, Germany. +# Copyright (c) 2017 SUSE LINUX GmbH, Nuernberg, Germany. # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -19,19 +19,23 @@ %define so_ver 3 Name: tesseract-ocr -Version:3.04.01 +Version:3.05.00 Release:0 Summary:Open Source OCR Engine License:Apache-2.0 and GPL-2.0+ Group: Productivity/Graphics/Other Url:https://github.com/tesseract-ocr/tesseract -Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz +Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz +BuildRequires: autoconf +BuildRequires: automake BuildRequires: cairo-devel BuildRequires: doxygen BuildRequires: fdupes BuildRequires: gcc-c++ BuildRequires: libicu-devel -BuildRequires: liblept-devel >= 1.71 +BuildRequires: liblept-devel >= 1.74 +BuildRequires: libtool +BuildRequires: m4 BuildRequires: pango-devel BuildRequires: pkg-config Recommends: tesseract-ocr-traineddata-english @@ -63,6 +67,7 @@ %prep %setup -q -n tesseract-%{version} +autoreconf -fi # Remove build time references so build-compare can do its work echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile @@ -96,7 +101,7 @@ %files %defattr(-,root,root,-) -%doc AUTHORS COPYING ChangeLog README.md ReleaseNotes +%doc AUTHORS COPYING ChangeLog README.md %{_bindir}/* %dir %{_datadir}/tessdata %{_datadir}/tessdata/configs/
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2016-02-22 08:58:00 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2015-10-06 13:26:56.0 +0200 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2016-02-22 08:58:05.0 +0100 @@ -1,0 +2,6 @@ +Fri Feb 19 11:44:19 UTC 2016 - idon...@suse.com + +- Update to 3.04.01 + * No changelog upstream + +--- Old: 3.04.00.tar.gz New: 3.04.01.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.cCYemk/_old 2016-02-22 08:58:06.0 +0100 +++ /var/tmp/diff_new_pack.cCYemk/_new 2016-02-22 08:58:06.0 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2015 SUSE LINUX GmbH, Nuernberg, Germany. +# Copyright (c) 2016 SUSE LINUX GmbH, Nuernberg, Germany. # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -19,19 +19,19 @@ %define so_ver 3 Name: tesseract-ocr -Version:3.04.00 +Version:3.04.01 Release:0 Summary:Open Source OCR Engine License:Apache-2.0 and GPL-2.0+ Group: Productivity/Graphics/Other -Url:http://code.google.com/p/tesseract-ocr/ +Url:https://github.com/tesseract-ocr/tesseract Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz BuildRequires: cairo-devel BuildRequires: doxygen BuildRequires: fdupes BuildRequires: gcc-c++ BuildRequires: libicu-devel -BuildRequires: liblept-devel +BuildRequires: liblept-devel >= 1.71 BuildRequires: pango-devel BuildRequires: pkg-config Recommends: tesseract-ocr-traineddata-english @@ -96,7 +96,7 @@ %files %defattr(-,root,root,-) -%doc AUTHORS COPYING ChangeLog README ReleaseNotes +%doc AUTHORS COPYING ChangeLog README.md ReleaseNotes %{_bindir}/* %dir %{_datadir}/tessdata %{_datadir}/tessdata/configs/ ++ 3.04.00.tar.gz -> 3.04.01.tar.gz ++ 26220 lines of diff (skipped)
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2015-10-06 13:26:54 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2015-09-19 06:55:04.0 +0200 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2015-10-06 13:26:56.0 +0200 @@ -1,0 +2,31 @@ +Fri Oct 2 21:46:07 UTC 2015 - asterios.dra...@gmail.com + +- Update to version 3.04.00: + * Added OpenCL support (experimental). + * Many bug fixes. + From version 3.03.00: + * Added new training tool text2image to generate box/tif file +pairs from text and truetype fonts. + * Added support for PDF output with searchable text. + * Removed entire IMAGE class and all code in image directory. + * Tesseract executable: support for output to stdout; limited +support for one page images from stdin (especially on Windows) + * Added Renderer to API to allow document-level processing and +output of document formats, like hOCR, PDF. + * Major refactor of word-level recognition, beam search, +eliminating dead code. + * Refactored classifier to make it easier to add new ones. + * Generalized feature extractor to allow feature extraction from +greyscale. + * Improved sub/superscript treatment. + * Improved baseline fit. + * Added set_unicharset_properties to training tools. + * Many bug fixes. + * More training source data included. +- Added new build requirements cairo-devel, doxygen, libicu-devel + and pango-devel. +- Recommend tesseract-ocr-traineddata-english instead of + tesseract-ocr-traineddata-american (based on new (3.04.00) + tesseract-ocr traineddata files). + +--- Old: tesseract-ocr-3.02.02-doc-html.tar.gz tesseract-ocr-3.02.02.tar.gz New: 3.04.00.tar.gz Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.zUGrrc/_old 2015-10-06 13:26:57.0 +0200 +++ /var/tmp/diff_new_pack.zUGrrc/_new 2015-10-06 13:26:57.0 +0200 @@ -19,19 +19,22 @@ %define so_ver 3 Name: tesseract-ocr -Version:3.02.02 +Version:3.04.00 Release:0 Summary:Open Source OCR Engine License:Apache-2.0 and GPL-2.0+ Group: Productivity/Graphics/Other Url:http://code.google.com/p/tesseract-ocr/ -Source0: http://tesseract-ocr.googlecode.com/files/%{name}-%{version}.tar.gz -Source1: http://tesseract-ocr.googlecode.com/files/%{name}-%{version}-doc-html.tar.gz +Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz +BuildRequires: cairo-devel +BuildRequires: doxygen BuildRequires: fdupes BuildRequires: gcc-c++ +BuildRequires: libicu-devel BuildRequires: liblept-devel +BuildRequires: pango-devel BuildRequires: pkg-config -Recommends: tesseract-ocr-traineddata-american +Recommends: tesseract-ocr-traineddata-english BuildRoot: %{_tmppath}/%{name}-%{version}-build %description @@ -59,15 +62,21 @@ open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google. %prep -%setup -q -b1 -n %{name} +%setup -q -n tesseract-%{version} + +# Remove build time references so build-compare can do its work +echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile %build export CXXFLAGS="%{optflags} -fno-strict-aliasing -fPIC" %configure --disable-static make %{?_smp_mflags} +make training %{?_smp_mflags} +make doc %{?_smp_mflags} %install %make_install +make DESTDIR=%{buildroot} training-install # Remove libtool config files rm -f %{buildroot}%{_libdir}/libtesseract.la @@ -92,6 +101,7 @@ %dir %{_datadir}/tessdata %{_datadir}/tessdata/configs/ %{_datadir}/tessdata/tessconfigs/ +%{_datadir}/tessdata/pdf.ttf %{_mandir}/man1/*.1%{ext_man} %{_mandir}/man5/*.5%{ext_man} ++ tesseract-ocr-3.02.02-doc-html.tar.gz -> 3.04.00.tar.gz ++ /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr-3.02.02-doc-html.tar.gz /work/SRC/openSUSE:Factory/.tesseract-ocr.new/3.04.00.tar.gz differ: char 5, line 1
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2015-09-19 06:55:03 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Changes: --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2015-09-02 00:36:41.0 +0200 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2015-09-19 06:55:04.0 +0200 @@ -1,0 +2,5 @@ +Mon Sep 14 18:22:13 UTC 2015 - asterios.dra...@gmail.com + +- Fix Recommends: entry to tesseract-ocr-traineddata-american. + +--- Other differences: -- ++ tesseract-ocr.spec ++ --- /var/tmp/diff_new_pack.oInRqt/_old 2015-09-19 06:55:05.0 +0200 +++ /var/tmp/diff_new_pack.oInRqt/_new 2015-09-19 06:55:05.0 +0200 @@ -31,7 +31,7 @@ BuildRequires: gcc-c++ BuildRequires: liblept-devel BuildRequires: pkg-config -Recommends: tesseract-traineddata-american +Recommends: tesseract-ocr-traineddata-american BuildRoot: %{_tmppath}/%{name}-%{version}-build %description
commit tesseract-ocr for openSUSE:Factory
Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2015-09-02 00:36:39 Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New) Package is "tesseract-ocr" Changes: New Changes file: --- /dev/null 2015-08-24 19:43:32.284261900 +0200 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 2015-09-02 00:36:41.0 +0200 @@ -0,0 +1,108 @@ +--- +Sat Jun 20 10:27:29 UTC 2015 - mailaen...@opensuse.org + +- rename to match upstream tarball and fix boo#900303 + +--- +Sat Jun 22 20:00:58 UTC 2013 - asterios.dra...@gmail.com + +- Split library into separate package (libtesseract3). +- Removed debuginfo package (not needed). +- There is no need anymore to regenerate the build system (removed automake and + libtool build requirements). +- Added pkg-config build requirement (fix for rpmlint error + "no-pkg-config-provides"). Removed also not needed + "Provides: pkgconfig(%{name})" entry. + +--- +Mon May 6 11:33:54 UTC 2013 - idon...@suse.com + +- Update license, some files are GPL-2.0+ licensed + +--- +Mon Oct 29 11:36:22 UTC 2012 - j...@suse.com + +- Update to version 3.02.02 + * untested +- Notable features: + * Hebrew with BiDi support. + * More languages. +- removed upstreamed patch0 + +--- +Mon Jun 25 18:35:52 UTC 2012 - asterios.dra...@gmail.com + +- Update to version 3.01: + * Removed old/dead serialise/deserialze methods on *LISTIZED classes. + * Total rewrite of DENORM to better encapsulate operation and make +for potential to extract features from images. + * Thread-safety! Moved all critical globals and statics to +members of the appropriate class. Tesseract is now +thread-safe (multiple instances can be used in parallel +in multiple threads.) with the minor exception that some +control parameters are still global and affect all threads. + * Added Cube, a new recognizer for Arabic. Cube can also be +used in combination with normal Tesseract for other languages +with an improvement in accuracy at the cost of (much) lower speed. +There is no training module for Cube yet. + * OcrEngineMode in Init replaces AccuracyVSpeed to control cube. + * Greatly improved segmentation search with consequent accuracy and +speed improvements, especially for Chinese. + * Added PageIterator and ResultIterator as cleaner ways to get the +full results out of Tesseract, that are not currently provided +by any of the TessBaseAPI::Get* methods. +All other methods, such as the ETEXT_STRUCT in particular are +deprecated and will be deleted in the future. + * ApplyBoxes totally rewritten to make training easier. +It can now cope with touching/overlapping training characters, +and a new boxfile format allows word boxes instead of character +boxes, BUT to use that you have to have already boostrapped the +language with character boxes. "Cyclic dependency" on traineddata. + * Auto orientation and script detection added to page layout analysis. + * Deleted *lots* of dead code. + * Fixxht module replaced with scalable data-driven module. + * Output font characteristics accuracy improved. + * Removed the double conversion at each classification. + * Upgraded oldest structs to be classes and deprecated PBLOB. + * Removed non-deterministic baseline fit. + * Added fixed length dawgs for Chinese. + * Handling of vertical text improved. + * Handling of leader dots improved. + * Table detection greatly improved. +- Removed the various languages traineddata subpackages (to be included in a + separate package "tesseract-traineddata"). +- Changed License to Apache-2.0 (SPDX style). +- Removed libtiff-devel build dependency (not needed anymore). +- Added new build dependency liblept-devel, required now by the package. +- Added automake and libtool build dependencies in order to regenerate the + build system because of missing Makefile.in. +- Removed tesseract-traineddata-deu from recommended entries. +- Removed nonvoid.patch (fixed upstream). +- Added a patch (svutil.cpp_fix.patch) to fix compilation due to missing + includes (taken from upstream). +- Disabled compilation of static libraries. + +--- +Mon Oct 25 08:29:19 UTC 2010 - prus...@opensuse.org + +- fixed missing returns in nonvoid functions (nonvoid.patch) +- added missing post/postun scripts calling ldconfig +