Messages by Thread
-
[jira] [Created] (TIKA-4413) Update tika-eval-app's xlsx writing to Zip64Mode.AlwaysWithCompatibility
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4391) Detect inline images in msg files
Tim Allison (Jira)
-
[PR] TIKA-4374 -- embedded file path names [tika]
via GitHub
-
[PR] TIKA-4413 -- use Zip64Mode.AlwaysWithCompatibility [tika]
via GitHub
-
[jira] [Comment Edited] (TIKA-4411) Run the 3.2.0 release process
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4411) Run the 3.2.0 release process
Tim Allison (Jira)
-
[jira] [Updated] (TIKA-4411) Run the 3.2.0 release process
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4400) Consider simplifying the build with a sandbox profile
Tim Allison (Jira)
-
[PR] TIKA-4391 -- identify "inline" images within .msg files. [tika]
via GitHub
-
[jira] [Created] (TIKA-4412) Tika cannot parse XPS files that are split into "pieces"
Ruairidh Williamson (Jira)
-
[jira] [Resolved] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
Release schedule
Tim Allison
-
[jira] [Created] (TIKA-4411) Run the 3.2.0 release process
Tim Allison (Jira)
-
[ANNOUNCE] Apache Tika 2.9.4 released
Tim Allison
-
[PR] Bump aws.version from 1.12.782 to 1.12.783 [tika]
via GitHub
-
[PR] Bump junit5.version from 5.13.0-M2 to 5.13.0-M3 [tika]
via GitHub
-
[PR] Bump com.github.luben:zstd-jni from 1.5.7-2 to 1.5.7-3 [tika]
via GitHub
-
[PR] Bump com.google.cloud:google-cloud-storage from 2.51.0 to 2.52.1 [tika]
via GitHub
-
[PR] Bump org.jsoup:jsoup from 1.19.1 to 1.20.1 [tika]
via GitHub
-
[PR] Bump pdfbox.version from 3.0.4 to 3.0.5 [tika]
via GitHub
-
[PR] Bump org.apache.commons:commons-configuration2 from 2.11.0 to 2.12.0 [tika]
via GitHub
-
[jira] [Created] (TIKA-4410) Improve feature extraction from xlsx
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4383) General updates for 2.9.4
Tilman Hausherr (Jira)
-
[VOTE] Release Apache Tika 2.9.4 Candidate #1
Tim Allison
-
[jira] [Commented] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[jira] [Created] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[jira] [Updated] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[PR] Bump protobuf.version from 3.25.6 to 3.25.7 [tika]
via GitHub
-
Re: [PR] Tika-2820: detection of Unix dump files (includes test files) [tika]
via GitHub
-
[jira] [Created] (TIKA-4408) python file identified as application/x-sh under several circumstances
Carol Alexandru (Jira)
-
[jira] [Closed] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Comment Edited] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Updated] (TIKA-4407) Docker: Could not find or load main class org.apache.tika.server.core.TikaServerCli
Alexander Skwar (Jira)
-
[jira] [Created] (TIKA-4407) Docker: Could not find or load main class org.apache.tika.server.core.TikaServerCli
Alexander Skwar (Jira)
-
[PR] TIKA-4406: add missing backslash [tika-docker]
via GitHub
-
[jira] [Resolved] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
ASF GitHub Bot (Jira)
-
[jira] [Created] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4405) XWPFEventBasedWordExtractor does not support run text that is marked as capitalized
PJ Fanning (Jira)
-
[jira] [Commented] (TIKA-4244) Tika idenifies MIME type of ics files with html content as text/html
Andreas Hubold (Jira)
-
[jira] [Created] (TIKA-4405) XWPFEventBasedWordExtractor does not support run text that is marked as capitalized
PJ Fanning (Jira)
-
[PR] [MINOR] mark some fields as final [tika]
via GitHub
-
[jira] [Created] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4404) PDFX conformance is never used
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4404) PDFX conformance is never used
Hudson (Jira)
-
"TODO: find an example where basic.getThumbNail is not null"
Tilman Hausherr
-
[jira] [Created] (TIKA-4404) PDFX conformance is never used
Tilman Hausherr (Jira)
-
[jira] [Resolved] (TIKA-4401) Catch jempbox's NumberFormatException
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4403) Implement transferTo in BoundedInputStream
Tim Allison (Jira)
-
[PR] TIKA-4395 -- improve handling and logging in container detection [tika]
via GitHub
-
[jira] [Commented] (TIKA-4403) Implement transferTo in BoundedInputStream
Hudson (Jira)
-
[jira] [Created] (TIKA-4403) Implement transferTo in BoundedInputStream
Tim Allison (Jira)
-
[jira] [Comment Edited] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tilman Hausherr (Jira)
-
[jira] [Resolved] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Comment Edited] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[PR] TIKA-4401 -- catch jempbox's numberformat exception [tika]
via GitHub
-
[jira] [Commented] (TIKA-4401) Catch jempbox's NumberFormatException
ASF GitHub Bot (Jira)
-
[jira] [Resolved] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tim Allison (Jira)
-
[jira] [Created] (TIKA-4401) Catch jempbox's NumberFormatException
Tim Allison (Jira)
-
[PR] TIKA-4400 -- move some modules to the sandbox profile [tika]
via GitHub
-
[jira] [Commented] (TIKA-4400) Consider simplifying the build with a sandbox profile
ASF GitHub Bot (Jira)
-
[jira] [Created] (TIKA-4400) Consider simplifying the build with a sandbox profile
Tim Allison (Jira)
-
next releases?
Tim Allison
-
[jira] [Commented] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
ASF GitHub Bot (Jira)
-
[PR] TIKA-4399 -- require TikaInputStream for embedded documents [tika]
via GitHub
-
[jira] [Reopened] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tim Allison (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Resolved] (TIKA-3604) Upgrade pdfbox3
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
OpenJDK Quality Outreach: Java 24 Is Now Available
David Delabassee
-
EmbeddedDocumentExtractor or OCR module for extracting images?
Cristian Zamfir
-
[PR] Bump io.netty:netty-bom from 4.2.0.RC4 to 4.2.0.Final [tika]
via GitHub
-
[PR] Bump org.codehaus.plexus:plexus-classworlds from 2.8.0 to 2.9.0 [tika]
via GitHub
-
[PR] Bump org.ops4j.pax.url:pax-url-aether from 2.6.16 to 3.0.0 [tika]
via GitHub
-
[PR] Bump poi.version from 5.4.0 to 5.4.1 [tika]
via GitHub
-
[PR] Bump com.microsoft.graph:microsoft-graph from 6.33.0 to 6.34.0 [tika]
via GitHub