user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Getting language of parsed text
Peter Kronenberg
Re: Getting language of parsed text
Tim Allison
RE: Getting language of parsed text
Peter Kronenberg
Re: Getting language of parsed text
Tim Allison
Re: Rotation script
Tim Allison
Re: [EXTERNAL] Re: Rotation script
Chris Mattmann
Re: Rotation script
Peter Kronenberg
Re: Rotation script
Tim Allison
Re: Rotation script
Tim Allison
Re: Rotation script
Tim Allison
Re: Rotation script
Tim Allison
RE: Rotation script
Peter Kronenberg
RE: Rotation script
Peter Kronenberg
RE: Rotation script
Peter Kronenberg
PDFs and detectAngles
Tim Allison
Re: PDFs and detectAngles
Tim Allison
Image processing timings
Peter Kronenberg
Re: Image processing timings
Tim Allison
RE: Image processing timings
Peter Kronenberg
RE: Image processing timings
Peter Kronenberg
Turning off ImageProcessing
Peter Kronenberg
Re: Turning off ImageProcessing
Tim Allison
RE: Turning off ImageProcessing
Peter Kronenberg
RE: Turning off ImageProcessing
Peter Kronenberg
RE: Turning off ImageProcessing
Peter Kronenberg
Re: Turning off ImageProcessing
Tim Allison
RE: Turning off ImageProcessing
Peter Kronenberg
Re: Turning off ImageProcessing
Tim Allison
RE: Turning off ImageProcessing
Peter Kronenberg
OCR of other than PDF files
Peter Kronenberg
Re: OCR of other than PDF files
Tim Allison
OCR_STRATEGY=AUTO
Peter Kronenberg
Re: OCR_STRATEGY=AUTO
Tim Allison
ApplyRotation default?
Peter Kronenberg
Re: ApplyRotation default?
Tim Allison
RE: ApplyRotation default?
Peter Kronenberg
TesseractOCRConfig which jar?
Peter Kronenberg
Re: TesseractOCRConfig which jar?
Tim Allison
Re: TesseractOCRConfig which jar?
Tim Allison
Re: TesseractOCRConfig which jar?
Peter Kronenberg
tesseract resize option
Tim Allison
RE: {EXTERNAL}tesseract resize option
Peter Kronenberg
RE: {EXTERNAL}tesseract resize option
Peter Kronenberg
Re: {EXTERNAL}tesseract resize option
Tim Allison
RE: {EXTERNAL}tesseract resize option
Peter Kronenberg
PDFBox's detectAngles
Tim Allison
Re: Problem parsing DOCX
Tim Allison
RE: Problem parsing DOCX
Peter Kronenberg
RE: Problem parsing DOCX
Peter Kronenberg
Tika on repository.apache.org
Peter Kronenberg
Re: Tika on repository.apache.org
Tim Allison
RE: Tika on repository.apache.org
Peter Kronenberg
Re: Tika on repository.apache.org
Tim Allison
RE: Tika on repository.apache.org
Peter Kronenberg
Re: Tika on repository.apache.org
Tim Allison
Re: Tika on repository.apache.org
Peter Kronenberg
Language detection
Peter Kronenberg
Re: Language detection
Tim Allison
RE: Language detection
Peter Kronenberg
ocr examples
Tim Allison
PDFParser.properties formatting
Peter Kronenberg
Re: PDFParser.properties formatting
Tim Allison
Setting parser options
Peter Kronenberg
Re: Setting parser options
Tim Allison
RE: Setting parser options
Peter Kronenberg
Page Segmentation Mode
Peter Kronenberg
Re: Page Segmentation Mode
Tim Allison
RE: Page Segmentation Mode
Peter Kronenberg
RE: Page Segmentation Mode
Peter Kronenberg
Re: Page Segmentation Mode
Luís Filipe Nassif
OCR on PDFs
Peter Kronenberg
Re: OCR on PDFs
Nick Burch
Re: OCR on PDFs
Tim Allison
RE: OCR on PDFs
Peter Kronenberg
Exceeding character limit on parse
Peter Kronenberg
Metadata
Peter Kronenberg
Re: Metadata
Nick Burch
Re: Apache Tika issue
Tim Allison
Re: Apache Tika issue
Tilman Hausherr
Re: Apache Tika issue
Tilman Hausherr
Mimetypes
Peter Kronenberg
Re: Mimetypes
Tim Allison
Re: Mimetypes
Tim Allison
RE: Mimetypes
Peter Kronenberg
Re: Mimetypes
Tamás Cservenák
RE: Mimetypes
Peter Kronenberg
Re: Mimetypes
Nick Burch
RE: Mimetypes
Peter Kronenberg
RE: Mimetypes
Nick Burch
RE: Mimetypes
Peter Kronenberg
RE: Mimetypes
Nick Burch
RE: Mimetypes
Peter Kronenberg
RE: Mimetypes
Nick Burch
[ANNOUNCE] Apache Tika 1.25 released
Tim Allison
[ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Tim Allison
Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Peter Lee
Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Chris Mattmann
Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Luís Filipe Nassif
Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Dave Meikle
Re: [ANNOUNCE] Welcome Peter Lee as Tika PMC member and committer
Furkan KAMACI
[VOTE] Release Apache Tika 1.25 Candidate #2
Tim Allison
Re: [VOTE] Release Apache Tika 1.25 Candidate #2
Dave Meikle
Re: [VOTE] Release Apache Tika 1.25 Candidate #2
Ken Krugler
Re: [VOTE] Release Apache Tika 1.25 Candidate #2
Oleg Tikhonov
Re: [VOTE] Release Apache Tika 1.25 Candidate #2
Sebastian Nagel
[RESULT] [VOTE] Release Apache Tika 1.25 Candidate #2
Tim Allison
Why does Tika offer a client-server option?
Robert Raines
Re: Why does Tika offer a client-server option?
Tim Allison
Re: Why does Tika offer a client-server option?
Ralph Soika
Re: Why does Tika offer a client-server option?
Slava G
Re: Why does Tika offer a client-server option?
Tucker B
Re: Why does Tika offer a client-server option?
Ken Krugler
Re: Why does Tika offer a client-server option?
Adam Rauch
[VOTE] Release Apache Tika 1.25 Candidate #1
Tim Allison
Re: [VOTE] Release Apache Tika 1.25 Candidate #1
David Pilato
Re: [VOTE] Release Apache Tika 1.25 Candidate #1
Tilman Hausherr
Re: [VOTE] Release Apache Tika 1.25 Candidate #1
Tim Allison
Re: [VOTE] Release Apache Tika 1.25 Candidate #1
David Meikle
[RESULT] [VOTE] Release Apache Tika 1.25 Candidate #1
Tim Allison
Re: [RESULT] [VOTE] Release Apache Tika 1.25 Candidate #1
Tim Allison
Getting font style and size out of PDFs
Bogdan Kostic
Re: Getting font style and size out of PDFs
Tim Allison
Extract only normal/OCR text from a document
nensick
Extract URLs from a document
nensick
Re: Extract URLs from a document
Nick Burch
Re: Extract URLs from a document
nensick
RE: Extract URLs from a document
Markus Jelsma
tika parser detecting "IBM500" for small files
Satinder Singh
Re: tika parser detecting "IBM500" for small files
John Patrick
Re: tika parser detecting "IBM500" for small files
Satinder Singh
Re: tika parser detecting "IBM500" for small files
Satinder Singh
Re: tika parser detecting "IBM500" for small files
John Patrick
Re: tika parser detecting "IBM500" for small files
Satinder Singh
Re: tika parser detecting "IBM500" for small files
John Patrick
Re: tika parser detecting "IBM500" for small files
John Patrick
Missing hyperlink after parsing .odt file
Robert Kaulbach
Re: Missing hyperlink after parsing .odt file
Tim Allison
Error when parsing of Excel files
Slava G
Re: Error when parsing of Excel files
Tim Allison
Re: Error when parsing of Excel files
Slava G
Re: Error when parsing of Excel files
Tim Allison
Re: Error when parsing of Excel files
Slava G
Rika, a Tika Wrapper for JRuby
Keith Bennett
Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Jim Garrison
Re: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Tilman Hausherr
Re: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Slava G
Re: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Tilman Hausherr
Re: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Jim Garrison
Re: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()
Tilman Hausherr
Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Jim Garrison
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Tim Allison
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Tilman Hausherr
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Tilman Hausherr
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Tilman Hausherr
Re: Test Failure building 1.25_SHAPSHOT [was: Tika App 1.24.1 NPE in AbstractPDF2XHTML.extractXMPXFA()]
Tilman Hausherr
Re: Getting white space between characters in PDF extraction.
Tim Allison
Re: Getting white space between characters in PDF extraction.
Tim Allison
Re: Getting white space between characters in PDF extraction.
Tilman Hausherr
Parsing OneNote on TIKA 1.24 makes entire JAVA process to crash
Slava G
Re: Parsing OneNote on TIKA 1.24 makes entire JAVA process to crash
Tim Allison
Re: Parsing OneNote on TIKA 1.24 makes entire JAVA process to crash
Slava G
Announcing ApacheCon @Home 2020
Rich Bowen
ExceptionInInitializationError - PDDocument
aravinth thangasami
Re: ExceptionInInitializationError - PDDocument
Tilman Hausherr
Inconsistent MIME type detection
Maloney, Patrick (ITS)
Re: Inconsistent MIME type detection
Tim Allison
TesseractOCRParser - As separate process - Clarification
aravinth thangasami
Re: TesseractOCRParser - As separate process - Clarification
Tim Allison
Missing XMP Metadata from PDF
Tucker B
Re: Missing XMP Metadata from PDF
Tim Allison
Re: Missing XMP Metadata from PDF
Tim Allison
[CVE-2020-9489] Denial of Service (DOS) Vulnerabilities in Some of Apache Tika's Parsers
Tim Allison
[ANNOUNCE] Apache Tika 1.24.1 released
Tim Allison
WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Bradley Beach
Re: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Nick Burch
Re: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Tim Allison
RE: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Bradley Beach
Re: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Tim Allison
Re: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Nick Burch
RE: WARNING: org.xerial's sqlite-jdbc is not loaded for 1.2.4
Bradley Beach
[VOTE] Release Apache Tika 1.24.1 Candidate #1
Tim Allison
Re: [VOTE] Release Apache Tika 1.24.1 Candidate #1
Sebastian Nagel
[RESULT][VOTE] Release Apache Tika 1.24.1 Candidate #1
Tim Allison
Clarification on Javax/* package inside tika-app-1.24 jar
aravinth thangasami
Re: Clarification on Javax/* package inside tika-app-1.24 jar
Tim Allison
Re: Clarification on Javax/* package inside tika-app-1.24 jar
Maxim Solodovnik
Re: Clarification on Javax/* package inside tika-app-1.24 jar
Konstantin Gribov
Re: Clarification on Javax/* package inside tika-app-1.24 jar
Maxim Solodovnik
Re: Clarification on Javax/* package inside tika-app-1.24 jar
aravinth thangasami
[CVE-2020-1951] Infinite Loop (DoS) vulnerability in Apache Tika's PSDParser
Tim Allison
[CVE-2020-1950] Excessive memory usage (DoS) vulnerability in Apache Tika's PSDParser
Tim Allison
Final CFP CodiEsp: Clinical Case Coding Task (eHealth CLEF 2020)
Martin Krallinger
[ANNOUNCE] Apache Tika 1.24 released
Tim Allison
[VOTE] Release Apache Tika 1.24 Candidate #3
Tim Allison
Re: [VOTE] Release Apache Tika 1.24 Candidate #3
Tilman Hausherr
Unable to parse PDF due to NoSuchFieldError: HAS_XMP
Markus Jelsma
Re: Unable to parse PDF due to NoSuchFieldError: HAS_XMP
Tim Allison
RE: Unable to parse PDF due to NoSuchFieldError: HAS_XMP
Markus Jelsma
Identifying Document Containing Images
aravinth thangasami
Apache Tika Server Warning
Toni Ojsteršek
Earlier messages
Later messages