https://bugs.kde.org/show_bug.cgi?id=488526
Bug ID: 488526
Summary: Exporting PDF with OCR text recognition based on
tesseract doesn't work anymore
Classification: Applications
Product: Skanpage
Version: 24.05.0
Platform: Neon
OS: Linux
Status: REPORTED
Severity: normal
Priority: NOR
Component: general
Assignee: [email protected]
Reporter: [email protected]
Target Milestone: ---
***
If you're not sure this is actually a bug, instead post about it at
https://discuss.kde.org
If you're reporting a crash, attach a backtrace with debug symbols; see
https://community.kde.org/Guidelines_and_HOWTOs/Debugging/How_to_create_useful_crash_reports
***
SUMMARY
The "Export PDF" functionality allows me to create a PDF with text recognition
in different languages. Clicking the button "Export PDF" shows up a window that
should list the languages available for the tesseract module to use in the
process of the text recognition. However, with the latest update of skanpage
(deb package from the repository), the list of languages is no more visible,
and the OCR is not working anymore.
With the previous version of skanpage instead, the OCR was perfectly working.
Maybe in the latest deb package the "tesseract" dependency is missing?
STEPS TO REPRODUCE
1. Scan a page with Skanpage
2. Click the "Export PDF" button
3. The list of languages for the OCR is missing
OBSERVED RESULT
The list of languages for the OCR functionality is missing, so the PDF output
has no text recognized.
EXPECTED RESULT
It should be possible to select the languages that I want to recognize. The PDF
output should contain the recognized text, and I should be able to copy it.
SOFTWARE/OS VERSIONS
Windows: n/a
macOS: n/a
Linux/KDE Plasma: KDE neon 6.0 (based on ubuntu 22.04)
(available in About System)
KDE Plasma Version: 6.0.5
KDE Frameworks Version: 6.2.0
Qt Version: 6.7.0
ADDITIONAL INFORMATION
--
You are receiving this mail because:
You are watching all bug changes.