commit tesseract-ocr for openSUSE:Factory

2020-03-27 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2020-03-27 21:57:30

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new.3160 (New)


Package is "tesseract-ocr"

Fri Mar 27 21:57:30 2020 rev:10 rq:788742 version:4.1.1

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2020-01-03 17:39:02.251358902 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.3160/tesseract-ocr.changes
2020-03-27 21:57:54.958814340 +0100
@@ -1,0 +2,7 @@
+Thu Mar 26 20:33:55 UTC 2020 - Bernhard Wiedemann 
+
+- Add tesseract-ocr-no-cpudetection.patch
+  to avoid crashing on older CPUs
+  and to make package build reproducible (boo#1159231)
+
+---

New:

  tesseract-ocr-no-cpudetection.patch



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.H1VtrM/_old  2020-03-27 21:57:56.686815345 +0100
+++ /var/tmp/diff_new_pack.H1VtrM/_new  2020-03-27 21:57:56.690815348 +0100
@@ -24,6 +24,8 @@
 License:Apache-2.0 AND GPL-2.0-or-later
 URL:https://github.com/tesseract-ocr/tesseract
 Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz
+# PATCH-FIX-OPENSUSE -- boo#1159231
+Patch0: tesseract-ocr-no-cpudetection.patch
 BuildRequires:  asciidoc
 BuildRequires:  autoconf
 BuildRequires:  automake
@@ -70,7 +72,7 @@
 open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google.
 
 %prep
-%autosetup -n tesseract-%{version}
+%autosetup -n tesseract-%{version} -p1
 
 %build
 autoreconf -fiv

++ tesseract-ocr-no-cpudetection.patch ++
Author: Bernhard M. Wiedemann 
Date:   2020-03-26

https://bugzilla.opensuse.org/show_bug.cgi?id=1159231

avoid crashes on machines that are older than the random build machine
and make package builds reproducible.

See https://reproducible-builds.org/ for why this is good.

Index: tesseract-4.1.1/src/arch/Makefile.am
===
--- tesseract-4.1.1.orig/src/arch/Makefile.am
+++ tesseract-4.1.1/src/arch/Makefile.am
@@ -47,9 +47,6 @@ libtesseract_sse_la_CXXFLAGS = -msse4.1
 endif
 
 libtesseract_native_la_CXXFLAGS = -O3 -ffast-math
-if MARCH_NATIVE_OPT
-libtesseract_native_la_CXXFLAGS += -march=native -mtune=native
-endif
 libtesseract_native_la_SOURCES = dotproduct.cpp
 
 libtesseract_arch_la_SOURCES = intsimdmatrix.cpp simddetect.cpp



commit tesseract-ocr for openSUSE:Factory

2020-01-03 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2020-01-03 17:38:36

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675 (New)


Package is "tesseract-ocr"

Fri Jan  3 17:38:36 2020 rev:9 rq:760613 version:4.1.1

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2019-12-27 14:01:07.568882878 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675/tesseract-ocr.changes
2020-01-03 17:39:02.251358902 +0100
@@ -1,0 +2,5 @@
+Fri Jan  3 10:38:08 UTC 2020 - Tomáš Chvátal 
+
+- Require libarchive in the devel package
+
+---



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.TIGZyd/_old  2020-01-03 17:39:02.951359262 +0100
+++ /var/tmp/diff_new_pack.TIGZyd/_new  2020-01-03 17:39:02.951359262 +0100
@@ -1,7 +1,7 @@
 #
 # spec file for package tesseract-ocr
 #
-# Copyright (c) 2019 SUSE LLC
+# Copyright (c) 2020 SUSE LLC
 #
 # All modifications and additions to the file contributed by third parties
 # remain the property of their copyright owners, unless otherwise agreed
@@ -22,7 +22,6 @@
 Release:0
 Summary:Open Source OCR Engine
 License:Apache-2.0 AND GPL-2.0-or-later
-Group:  Productivity/Graphics/Other
 URL:https://github.com/tesseract-ocr/tesseract
 Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz
 BuildRequires:  asciidoc
@@ -54,9 +53,9 @@
 
 %package devel
 Summary:Tesseract Open Source OCR Engine Development files
-Group:  Development/Libraries/Other
-Requires:   liblept-devel
 Requires:   libtesseract%{so_ver} = %{version}
+Requires:   pkgconfig(lept) >= 1.74
+Requires:   pkgconfig(libarchive)
 
 %description devel
 This package contains development files for the Tesseract Open Source OCR
@@ -64,7 +63,6 @@
 
 %package -n libtesseract%{so_ver}
 Summary:Open Source OCR Engine
-Group:  System/Libraries
 
 %description -n libtesseract%{so_ver}
 A commercial quality OCR engine originally developed at HP between 1985 and




commit tesseract-ocr for openSUSE:Factory

2019-12-27 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2019-12-27 14:01:05

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675 (New)


Package is "tesseract-ocr"

Fri Dec 27 14:01:05 2019 rev:8 rq:759653 version:4.1.1

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2019-12-13 12:06:05.133363375 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.6675/tesseract-ocr.changes
2019-12-27 14:01:07.568882878 +0100
@@ -1,0 +2,6 @@
+Fri Dec 27 08:59:17 UTC 2019 - Ismail Dönmez 
+
+- Update to version 4.1.1
+  * Bugfixes
+
+---

Old:

  tesseract-ocr-4.1.0.tar.gz

New:

  tesseract-ocr-4.1.1.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.TvoV6Q/_old  2019-12-27 14:01:08.100883125 +0100
+++ /var/tmp/diff_new_pack.TvoV6Q/_new  2019-12-27 14:01:08.104883127 +0100
@@ -18,7 +18,7 @@
 
 %define so_ver 4
 Name:   tesseract-ocr
-Version:4.1.0
+Version:4.1.1
 Release:0
 Summary:Open Source OCR Engine
 License:Apache-2.0 AND GPL-2.0-or-later

++ tesseract-ocr-4.1.0.tar.gz -> tesseract-ocr-4.1.1.tar.gz ++
 59413 lines of diff (skipped)




commit tesseract-ocr for openSUSE:Factory

2019-12-13 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2019-12-13 12:05:10

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new.4691 (New)


Package is "tesseract-ocr"

Fri Dec 13 12:05:10 2019 rev:7 rq:756765 version:4.1.0

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2018-02-21 14:11:47.061513395 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.4691/tesseract-ocr.changes
2019-12-13 12:06:05.133363375 +0100
@@ -1,0 +2,19 @@
+Fri Dec 13 09:03:19 UTC 2019 - Martin Pluskal 
+
+- Packaging Cleanups
+- Update dependencies and enable openCL
+
+---
+Fri Dec 13 03:01:02 UTC 2019 - hiwatari.se...@gmail.com
+
+- Update to 4.1.0
+  * Added a new output option formatted in the ALTO standard
+  * SIMD optimization
+  * Bugfixes
+- Update to 4.0.0
+  * New OCR engine based on LSTMs
+  * Removed Cube OCR engine
+  * Updated build system
+  * Cleanups and fixes
+
+---

Old:

  tesseract-ocr-3.05.01.tar.gz

New:

  tesseract-ocr-4.1.0.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.kBRd5Y/_old  2019-12-13 12:06:05.785363129 +0100
+++ /var/tmp/diff_new_pack.kBRd5Y/_new  2019-12-13 12:06:05.789363127 +0100
@@ -1,7 +1,7 @@
 #
 # spec file for package tesseract-ocr
 #
-# Copyright (c) 2018 SUSE LINUX GmbH, Nuernberg, Germany.
+# Copyright (c) 2019 SUSE LLC
 #
 # All modifications and additions to the file contributed by third parties
 # remain the property of their copyright owners, unless otherwise agreed
@@ -12,31 +12,39 @@
 # license that conforms to the Open Source Definition (Version 1.9)
 # published by the Open Source Initiative.
 
-# Please submit bugfixes or comments via http://bugs.opensuse.org/
+# Please submit bugfixes or comments via https://bugs.opensuse.org/
 #
 
 
-%define so_ver 3
+%define so_ver 4
 Name:   tesseract-ocr
-Version:3.05.01
+Version:4.1.0
 Release:0
 Summary:Open Source OCR Engine
-License:Apache-2.0 AND GPL-2.0+
+License:Apache-2.0 AND GPL-2.0-or-later
 Group:  Productivity/Graphics/Other
-Url:https://github.com/tesseract-ocr/tesseract
+URL:https://github.com/tesseract-ocr/tesseract
 Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz
+BuildRequires:  asciidoc
 BuildRequires:  autoconf
 BuildRequires:  automake
-BuildRequires:  cairo-devel
 BuildRequires:  doxygen
 BuildRequires:  fdupes
 BuildRequires:  gcc-c++
-BuildRequires:  libicu-devel
-BuildRequires:  liblept-devel >= 1.74
 BuildRequires:  libtool
-BuildRequires:  m4
-BuildRequires:  pango-devel
-BuildRequires:  pkgconfig
+BuildRequires:  libxslt-tools
+BuildRequires:  opencl-headers
+BuildRequires:  pkgconfig >= 0.9.0
+BuildRequires:  pkgconfig(OpenCL)
+BuildRequires:  pkgconfig(cairo)
+BuildRequires:  pkgconfig(fontconfig)
+BuildRequires:  pkgconfig(icu-i18n) >= 52.1
+BuildRequires:  pkgconfig(icu-uc) >= 52.1
+BuildRequires:  pkgconfig(lept) >= 1.74
+BuildRequires:  pkgconfig(libarchive)
+BuildRequires:  pkgconfig(pango) >= 1.22.0
+BuildRequires:  pkgconfig(pangocairo) >= 1.22.0
+BuildRequires:  pkgconfig(pangoft2) >= 1.22.0
 Recommends: tesseract-ocr-traineddata-english
 
 %description
@@ -64,22 +72,17 @@
 open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google.
 
 %prep
-%setup -q -n tesseract-%{version}
-autoreconf -fi
-
-# Remove build time references so build-compare can do its work
-echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile
+%autosetup -n tesseract-%{version}
 
 %build
-export CXXFLAGS="%{optflags} -fno-strict-aliasing -fPIC"
-%configure --disable-static
-make %{?_smp_mflags}
-make training %{?_smp_mflags}
-make doc %{?_smp_mflags}
+autoreconf -fiv
+%configure \
+  --enable-opencl \
+   --disable-static
+%make_build all training doc
 
 %install
-%make_install
-make DESTDIR=%{buildroot} training-install
+%make_install all training-install
 
 # Remove libtool config files
 rm -f %{buildroot}%{_libdir}/libtesseract.la
@@ -90,6 +93,9 @@
 # Fix rpmlint warning "doc-file-dependency"
 rm -f %{buildroot}%{_defaultdocdir}/%{name}-devel/html/installdox
 
+# Fix rpmlint warning "non-executable-in-bin"
+chmod 0755 %{buildroot}%{_bindir}/tesstrain_utils.sh
+
 # Fix rpmlint warning "files-duplicate"
 %fdupes -s %{buildroot}
 
@@ -97,14 +103,15 @@
 %postun -n libtesseract%{so_ver} -p /sbin/ldconfig
 
 %files
-%doc AUTHORS COPYING ChangeLog README.md
+%doc AUTHORS ChangeLog README.md
+%license LICENSE
 

commit tesseract-ocr for openSUSE:Factory

2018-02-21 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2018-02-21 14:11:44

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Wed Feb 21 14:11:44 2018 rev:6 rq:578379 version:3.05.01

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2017-02-21 13:49:49.756754009 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2018-02-21 14:11:47.061513395 +0100
@@ -1,0 +2,10 @@
+Tue Feb 20 13:38:34 UTC 2018 - jweberho...@weberhofer.at
+
+- Update to 3.05.01
+  * Fixed several build issues
+  * Fixed C-API
+  * Backport pdfrenderer changes
+  * Code clean up
+- Spec file cleaned up
+
+---

Old:

  tesseract-ocr-3.05.00.tar.gz

New:

  tesseract-ocr-3.05.01.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.Bdx8xc/_old  2018-02-21 14:11:48.249470614 +0100
+++ /var/tmp/diff_new_pack.Bdx8xc/_new  2018-02-21 14:11:48.253470470 +0100
@@ -1,7 +1,7 @@
 #
 # spec file for package tesseract-ocr
 #
-# Copyright (c) 2017 SUSE LINUX GmbH, Nuernberg, Germany.
+# Copyright (c) 2018 SUSE LINUX GmbH, Nuernberg, Germany.
 #
 # All modifications and additions to the file contributed by third parties
 # remain the property of their copyright owners, unless otherwise agreed
@@ -17,12 +17,11 @@
 
 
 %define so_ver 3
-
 Name:   tesseract-ocr
-Version:3.05.00
+Version:3.05.01
 Release:0
 Summary:Open Source OCR Engine
-License:Apache-2.0 and GPL-2.0+
+License:Apache-2.0 AND GPL-2.0+
 Group:  Productivity/Graphics/Other
 Url:https://github.com/tesseract-ocr/tesseract
 Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz
@@ -37,9 +36,8 @@
 BuildRequires:  libtool
 BuildRequires:  m4
 BuildRequires:  pango-devel
-BuildRequires:  pkg-config
+BuildRequires:  pkgconfig
 Recommends: tesseract-ocr-traineddata-english
-BuildRoot:  %{_tmppath}/%{name}-%{version}-build
 
 %description
 A commercial quality OCR engine originally developed at HP between 1985 and
@@ -96,11 +94,9 @@
 %fdupes -s %{buildroot}
 
 %post -n libtesseract%{so_ver} -p /sbin/ldconfig
-
 %postun -n libtesseract%{so_ver} -p /sbin/ldconfig
 
 %files
-%defattr(-,root,root,-)
 %doc AUTHORS COPYING ChangeLog README.md
 %{_bindir}/*
 %dir %{_datadir}/tessdata
@@ -111,14 +107,12 @@
 %{_mandir}/man5/*.5%{ext_man}
 
 %files devel
-%defattr(-,root,root,-)
 %doc %{_defaultdocdir}/tesseract-ocr-devel/
 %{_includedir}/tesseract/
 %{_libdir}/libtesseract*.so
 %{_libdir}/pkgconfig/*.pc
 
 %files -n libtesseract%{so_ver}
-%defattr(-,root,root,-)
 %{_libdir}/libtesseract.so.%{so_ver}*
 
 %changelog

++ tesseract-ocr-3.05.00.tar.gz -> tesseract-ocr-3.05.01.tar.gz ++
 5910 lines of diff (skipped)




commit tesseract-ocr for openSUSE:Factory

2017-02-21 Thread root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2017-02-21 13:49:48

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2016-02-22 08:58:05.0 +0100
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2017-02-21 13:49:49.756754009 +0100
@@ -1,0 +2,20 @@
+Fri Feb 17 14:39:58 UTC 2017 - idon...@suse.com
+
+- Update to 3.05.00
+  * Made some fine tuning to the hOCR output.
+  * Added TSV as another optional output format.
+  * Fixed ABI break introduced in 3.04.00 with the AnalyseLayout()
+method.
+  * text2image tool - Enable all OpenType ligatures available in
+a font. This feature requires Pango 1.38 or newer.
+  * Training tools - Replaced asserts with tprintf() and exit(1).
+  * Improved multipage tiff processing.
+  * Improved the embedded pdf font (pdf.ttf).
+  * Enable selection of OCR engine mode from command line.
+  * Changed tesseract command line parameter '-psm' to '--psm'.
+  * Added new C API for orientation and script detection, removed
+the old one.
+  * Fixed many compiler warning.
+  * Fixed memory and resource leaks.
+
+---

Old:

  3.04.01.tar.gz

New:

  tesseract-ocr-3.05.00.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.yGuLTK/_old  2017-02-21 13:49:50.512647626 +0100
+++ /var/tmp/diff_new_pack.yGuLTK/_new  2017-02-21 13:49:50.516647064 +0100
@@ -1,7 +1,7 @@
 #
 # spec file for package tesseract-ocr
 #
-# Copyright (c) 2016 SUSE LINUX GmbH, Nuernberg, Germany.
+# Copyright (c) 2017 SUSE LINUX GmbH, Nuernberg, Germany.
 #
 # All modifications and additions to the file contributed by third parties
 # remain the property of their copyright owners, unless otherwise agreed
@@ -19,19 +19,23 @@
 %define so_ver 3
 
 Name:   tesseract-ocr
-Version:3.04.01
+Version:3.05.00
 Release:0
 Summary:Open Source OCR Engine
 License:Apache-2.0 and GPL-2.0+
 Group:  Productivity/Graphics/Other
 Url:https://github.com/tesseract-ocr/tesseract
-Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz
+Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz
+BuildRequires:  autoconf
+BuildRequires:  automake
 BuildRequires:  cairo-devel
 BuildRequires:  doxygen
 BuildRequires:  fdupes
 BuildRequires:  gcc-c++
 BuildRequires:  libicu-devel
-BuildRequires:  liblept-devel >= 1.71
+BuildRequires:  liblept-devel >= 1.74
+BuildRequires:  libtool
+BuildRequires:  m4
 BuildRequires:  pango-devel
 BuildRequires:  pkg-config
 Recommends: tesseract-ocr-traineddata-english
@@ -63,6 +67,7 @@
 
 %prep
 %setup -q -n tesseract-%{version}
+autoreconf -fi
 
 # Remove build time references so build-compare can do its work
 echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile
@@ -96,7 +101,7 @@
 
 %files
 %defattr(-,root,root,-)
-%doc AUTHORS COPYING ChangeLog README.md ReleaseNotes
+%doc AUTHORS COPYING ChangeLog README.md
 %{_bindir}/*
 %dir %{_datadir}/tessdata
 %{_datadir}/tessdata/configs/




commit tesseract-ocr for openSUSE:Factory

2016-02-21 Thread h_root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2016-02-22 08:58:00

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2015-10-06 13:26:56.0 +0200
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2016-02-22 08:58:05.0 +0100
@@ -1,0 +2,6 @@
+Fri Feb 19 11:44:19 UTC 2016 - idon...@suse.com
+
+- Update to 3.04.01
+  * No changelog upstream
+
+---

Old:

  3.04.00.tar.gz

New:

  3.04.01.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.cCYemk/_old  2016-02-22 08:58:06.0 +0100
+++ /var/tmp/diff_new_pack.cCYemk/_new  2016-02-22 08:58:06.0 +0100
@@ -1,7 +1,7 @@
 #
 # spec file for package tesseract-ocr
 #
-# Copyright (c) 2015 SUSE LINUX GmbH, Nuernberg, Germany.
+# Copyright (c) 2016 SUSE LINUX GmbH, Nuernberg, Germany.
 #
 # All modifications and additions to the file contributed by third parties
 # remain the property of their copyright owners, unless otherwise agreed
@@ -19,19 +19,19 @@
 %define so_ver 3
 
 Name:   tesseract-ocr
-Version:3.04.00
+Version:3.04.01
 Release:0
 Summary:Open Source OCR Engine
 License:Apache-2.0 and GPL-2.0+
 Group:  Productivity/Graphics/Other
-Url:http://code.google.com/p/tesseract-ocr/
+Url:https://github.com/tesseract-ocr/tesseract
 Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz
 BuildRequires:  cairo-devel
 BuildRequires:  doxygen
 BuildRequires:  fdupes
 BuildRequires:  gcc-c++
 BuildRequires:  libicu-devel
-BuildRequires:  liblept-devel
+BuildRequires:  liblept-devel >= 1.71
 BuildRequires:  pango-devel
 BuildRequires:  pkg-config
 Recommends: tesseract-ocr-traineddata-english
@@ -96,7 +96,7 @@
 
 %files
 %defattr(-,root,root,-)
-%doc AUTHORS COPYING ChangeLog README ReleaseNotes
+%doc AUTHORS COPYING ChangeLog README.md ReleaseNotes
 %{_bindir}/*
 %dir %{_datadir}/tessdata
 %{_datadir}/tessdata/configs/

++ 3.04.00.tar.gz -> 3.04.01.tar.gz ++
 26220 lines of diff (skipped)




commit tesseract-ocr for openSUSE:Factory

2015-10-06 Thread h_root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2015-10-06 13:26:54

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2015-09-19 06:55:04.0 +0200
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2015-10-06 13:26:56.0 +0200
@@ -1,0 +2,31 @@
+Fri Oct  2 21:46:07 UTC 2015 - asterios.dra...@gmail.com
+
+- Update to version 3.04.00:
+  * Added OpenCL support (experimental).
+  * Many bug fixes.
+  From version 3.03.00:
+  * Added new training tool text2image to generate box/tif file
+pairs from text and truetype fonts.
+  * Added support for PDF output with searchable text.
+  * Removed entire IMAGE class and all code in image directory.
+  * Tesseract executable: support for output to stdout; limited
+support for one page images from stdin  (especially on Windows)
+  * Added Renderer to API to allow document-level processing and
+output of document formats, like hOCR, PDF.
+  * Major refactor of word-level recognition, beam search,
+eliminating dead code.
+  * Refactored classifier to make it easier to add new ones.
+  * Generalized feature extractor to allow feature extraction from
+greyscale.
+  * Improved sub/superscript treatment.
+  * Improved baseline fit.
+  * Added set_unicharset_properties to training tools.
+  * Many bug fixes.
+  * More training source data included.
+- Added new build requirements cairo-devel, doxygen, libicu-devel
+  and pango-devel.
+- Recommend tesseract-ocr-traineddata-english instead of
+  tesseract-ocr-traineddata-american (based on new (3.04.00)
+  tesseract-ocr traineddata files).
+
+---

Old:

  tesseract-ocr-3.02.02-doc-html.tar.gz
  tesseract-ocr-3.02.02.tar.gz

New:

  3.04.00.tar.gz



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.zUGrrc/_old  2015-10-06 13:26:57.0 +0200
+++ /var/tmp/diff_new_pack.zUGrrc/_new  2015-10-06 13:26:57.0 +0200
@@ -19,19 +19,22 @@
 %define so_ver 3
 
 Name:   tesseract-ocr
-Version:3.02.02
+Version:3.04.00
 Release:0
 Summary:Open Source OCR Engine
 License:Apache-2.0 and GPL-2.0+
 Group:  Productivity/Graphics/Other
 Url:http://code.google.com/p/tesseract-ocr/
-Source0:
http://tesseract-ocr.googlecode.com/files/%{name}-%{version}.tar.gz
-Source1:
http://tesseract-ocr.googlecode.com/files/%{name}-%{version}-doc-html.tar.gz
+Source0:
https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz
+BuildRequires:  cairo-devel
+BuildRequires:  doxygen
 BuildRequires:  fdupes
 BuildRequires:  gcc-c++
+BuildRequires:  libicu-devel
 BuildRequires:  liblept-devel
+BuildRequires:  pango-devel
 BuildRequires:  pkg-config
-Recommends: tesseract-ocr-traineddata-american
+Recommends: tesseract-ocr-traineddata-english
 BuildRoot:  %{_tmppath}/%{name}-%{version}-build
 
 %description
@@ -59,15 +62,21 @@
 open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google.
 
 %prep
-%setup -q -b1 -n %{name}
+%setup -q -n tesseract-%{version}
+
+# Remove build time references so build-compare can do its work
+echo "HTML_TIMESTAMP = NO" >> doc/Doxyfile
 
 %build
 export CXXFLAGS="%{optflags} -fno-strict-aliasing -fPIC"
 %configure --disable-static
 make %{?_smp_mflags}
+make training %{?_smp_mflags}
+make doc %{?_smp_mflags}
 
 %install
 %make_install
+make DESTDIR=%{buildroot} training-install
 
 # Remove libtool config files
 rm -f %{buildroot}%{_libdir}/libtesseract.la
@@ -92,6 +101,7 @@
 %dir %{_datadir}/tessdata
 %{_datadir}/tessdata/configs/
 %{_datadir}/tessdata/tessconfigs/
+%{_datadir}/tessdata/pdf.ttf
 %{_mandir}/man1/*.1%{ext_man}
 %{_mandir}/man5/*.5%{ext_man}
 

++ tesseract-ocr-3.02.02-doc-html.tar.gz -> 3.04.00.tar.gz ++
/work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr-3.02.02-doc-html.tar.gz 
/work/SRC/openSUSE:Factory/.tesseract-ocr.new/3.04.00.tar.gz differ: char 5, 
line 1




commit tesseract-ocr for openSUSE:Factory

2015-09-18 Thread h_root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2015-09-19 06:55:03

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Changes:

--- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes  
2015-09-02 00:36:41.0 +0200
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2015-09-19 06:55:04.0 +0200
@@ -1,0 +2,5 @@
+Mon Sep 14 18:22:13 UTC 2015 - asterios.dra...@gmail.com
+
+- Fix Recommends: entry to tesseract-ocr-traineddata-american.
+
+---



Other differences:
--
++ tesseract-ocr.spec ++
--- /var/tmp/diff_new_pack.oInRqt/_old  2015-09-19 06:55:05.0 +0200
+++ /var/tmp/diff_new_pack.oInRqt/_new  2015-09-19 06:55:05.0 +0200
@@ -31,7 +31,7 @@
 BuildRequires:  gcc-c++
 BuildRequires:  liblept-devel
 BuildRequires:  pkg-config
-Recommends: tesseract-traineddata-american
+Recommends: tesseract-ocr-traineddata-american
 BuildRoot:  %{_tmppath}/%{name}-%{version}-build
 
 %description




commit tesseract-ocr for openSUSE:Factory

2015-09-01 Thread h_root
Hello community,

here is the log from the commit of package tesseract-ocr for openSUSE:Factory 
checked in at 2015-09-02 00:36:39

Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old)
 and  /work/SRC/openSUSE:Factory/.tesseract-ocr.new (New)


Package is "tesseract-ocr"

Changes:

New Changes file:

--- /dev/null   2015-08-24 19:43:32.284261900 +0200
+++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new/tesseract-ocr.changes 
2015-09-02 00:36:41.0 +0200
@@ -0,0 +1,108 @@
+---
+Sat Jun 20 10:27:29 UTC 2015 - mailaen...@opensuse.org
+
+- rename to match upstream tarball and fix boo#900303
+
+---
+Sat Jun 22 20:00:58 UTC 2013 - asterios.dra...@gmail.com
+
+- Split library into separate package (libtesseract3).
+- Removed debuginfo package (not needed).
+- There is no need anymore to regenerate the build system (removed automake and
+  libtool build requirements).
+- Added pkg-config build requirement (fix for rpmlint error
+  "no-pkg-config-provides"). Removed also not needed
+  "Provides: pkgconfig(%{name})" entry.
+
+---
+Mon May  6 11:33:54 UTC 2013 - idon...@suse.com
+
+- Update license, some files are GPL-2.0+ licensed 
+
+---
+Mon Oct 29 11:36:22 UTC 2012 - j...@suse.com
+
+- Update to version 3.02.02
+  * untested
+- Notable features:
+  * Hebrew with BiDi support.
+  * More languages. 
+- removed upstreamed patch0
+
+---
+Mon Jun 25 18:35:52 UTC 2012 - asterios.dra...@gmail.com
+
+- Update to version 3.01:
+  * Removed old/dead serialise/deserialze methods on *LISTIZED classes.
+  * Total rewrite of DENORM to better encapsulate operation and make
+for potential to extract features from images.
+  * Thread-safety! Moved all critical globals and statics to
+members of the appropriate class. Tesseract is now
+thread-safe (multiple instances can be used in parallel
+in multiple threads.) with the minor exception that some
+control parameters are still global and affect all threads.
+  * Added Cube, a new recognizer for Arabic. Cube can also be
+used in combination with normal Tesseract for other languages
+with an improvement in accuracy at the cost of (much) lower speed.
+There is no training module for Cube yet.
+  * OcrEngineMode in Init replaces AccuracyVSpeed to control cube.
+  * Greatly improved segmentation search with consequent accuracy and
+speed improvements, especially for Chinese.
+  * Added PageIterator and ResultIterator as cleaner ways to get the
+full results out of Tesseract, that are not currently provided
+by any of the TessBaseAPI::Get* methods.
+All other methods, such as the ETEXT_STRUCT in particular are
+deprecated and will be deleted in the future.
+  * ApplyBoxes totally rewritten to make training easier.
+It can now cope with touching/overlapping training characters,
+and a new boxfile format allows word boxes instead of character
+boxes, BUT to use that you have to have already boostrapped the
+language with character boxes. "Cyclic dependency" on traineddata.
+  * Auto orientation and script detection added to page layout analysis.
+  * Deleted *lots* of dead code.
+  * Fixxht module replaced with scalable data-driven module.
+  * Output font characteristics accuracy improved.
+  * Removed the double conversion at each classification.
+  * Upgraded oldest structs to be classes and deprecated PBLOB.
+  * Removed non-deterministic baseline fit.
+  * Added fixed length dawgs for Chinese.
+  * Handling of vertical text improved.
+  * Handling of leader dots improved.
+  * Table detection greatly improved.
+- Removed the various languages traineddata subpackages (to be included in a
+  separate package "tesseract-traineddata").
+- Changed License to Apache-2.0 (SPDX style).
+- Removed libtiff-devel build dependency (not needed anymore).
+- Added new build dependency liblept-devel, required now by the package.
+- Added automake and libtool build dependencies in order to regenerate the
+  build system because of missing Makefile.in.
+- Removed tesseract-traineddata-deu from recommended entries.
+- Removed nonvoid.patch (fixed upstream).
+- Added a patch (svutil.cpp_fix.patch) to fix compilation due to missing
+  includes (taken from upstream).
+- Disabled compilation of static libraries.
+
+---
+Mon Oct 25 08:29:19 UTC 2010 - prus...@opensuse.org
+
+- fixed missing returns in nonvoid functions (nonvoid.patch)
+- added missing post/postun scripts calling ldconfig
+