Messages by Thread
-
Attributes in XHTML output
Ken Krugler
-
[jira] Created: (TIKA-422) Wrong charset conversion in some RTF documents.
Piotr B. (JIRA)
-
[jira] Created: (TIKA-421) DOAP file to recognize Tika on projects.a.o
Chris A. Mattmann (JIRA)
-
[jira] Created: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages
JIRA
-
Re: [jira] Resolved: (TIKA-419) Allow parser lookup from a custom class loader
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-418) RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps file types
Rajiv Kumar (JIRA)
-
[jira] Created: (TIKA-417) Unable to parse the content for UCS2 Litte Endian encoded file
Rajiv Kumar (JIRA)
-
[jira] Created: (TIKA-415) Findbugs: XHTMLDowngradeHandler equals() comparing different types
peter_lena...@ibi.com (JIRA)
-
[jira] Created: (TIKA-414) bug in CompositeParser.getParser function
Piotr B. (JIRA)
-
Re: [netcdfgroup] NetCDF jars=>Maven Central Repos?
Mattmann, Chris A (388J)
-
Apache Tika is a top-level project!
Mattmann, Chris A (388J)
-
TLP Status
Grant Ingersoll
-
[jira] Created: (TIKA-412) Exclude the xml-apis dependency
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-411) Generate list of supported and detected types automatically
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-410) textbox content extaction for word documents
Ali Oral (JIRA)
-
[jira] Created: (TIKA-409) Missing poi-ooxml-schemas-3.6.jar in tika-bundle
Jukka Zitting (JIRA)
-
[jira] Resolved: (TIKA-92) Image metadata extraction
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-408) Word 6.0/7.0 documents support in office parser
Dmitry Kuzmenko (JIRA)
-
[jira] Updated: (TIKA-92) Image metadata extraction
Dmitry Kuzmenko (JIRA)
-
[jira] Commented: (TIKA-92) Image metadata extraction
Dmitry Kuzmenko (JIRA)
-
Missing poi-ooxml-schemas-3.6.jar in tika-bundle
Timo Boehme
-
[jira] Created: (TIKA-407) Push NetCDF4 lib dependency to Maven Central and Update Tika POM
Chris A. Mattmann (JIRA)
-
[jira] Created: (TIKA-406) Push NetCDF4 lib dependency to Maven Central and Update Tika POM
Chris A. Mattmann (JIRA)
-
[jira] Created: (TIKA-405) Problems handling Hyperlinks and Tables in Word 97 Docs
Curtis Warner (JIRA)
-
NetCDF jars=>Maven Central Repos?
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-404) Media-type handling depends on the locale
Jukka Zitting (JIRA)
-
Build with Maven. OutOfMemoryError
Николай Ижиков
-
[jira] Created: (TIKA-403) Refactor log library usage in tika-parsers
JIRA
-
IRC channel created
Mattmann, Chris A (388J)
-
Fwd: [NOTICE] compromised jira passwords
Jukka Zitting
-
[jira] Created: (TIKA-402) Support for Keynote and Pages documents
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-401) Tika hangs on corrupt zip files
Tom De Leu (JIRA)
-
[jira] Created: (TIKA-400) netCDF Tika Parser
Chris A. Mattmann (JIRA)
-
[jira] Created: (TIKA-399) HDF4/5 Tika Parser
Chris A. Mattmann (JIRA)
-
[ANNOUNCE] Apache Tika 0.7 released
Chris Mattmann
-
[RESULT] [VOTE] Apache Tika 0.7 Release Candidate #1
Mattmann, Chris A (388J)
-
Student Project, Apache Tika
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-398) TestParsers fails when classpathh contains special characters like spaces
Uwe Schindler (JIRA)
-
[jira] Created: (TIKA-397) Parser crashes on very simple file
Ross Keatinge (JIRA)
-
[VOTE] Apache Tika 0.7 Release Candidate #1
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-396) Parser Attachements from Outlook Messages
Dave Meikle (JIRA)
-
[jira] Created: (TIKA-395) Tika fails to extract Messages from Outlook 2007
Dave Meikle (JIRA)
-
[jira] Created: (TIKA-394) Missing spaces on html parsing
Andrey Barhatov (JIRA)
-
[jira] Created: (TIKA-393) Upgrade to PDFBOX 1.1.0
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-392) RTF parser smashes words together in subsequent table cells
Jukka Zitting (JIRA)
-
[jira] Created: (TIKA-391) Intermittent errors detectig xls files
Simon Tyler (JIRA)
-
[jira] Created: (TIKA-390) Missing Header/Footer text for ODT documents
JIRA
-
[VOTE] Apache Tika TLP Board Resolution
Mattmann, Chris A (388J)
-
OutOfMemory exception
sangri
-
[jira] Created: (TIKA-389) Garbled metadata when dealing with encrypted PDF files.
Gabriel Miklos (JIRA)
-
[PROPOSAL] Apache Tika TLP board resolution
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-388) Don't trust streams that claim mark support
Jukka Zitting (JIRA)
-
Detector results for Excel formats
Simon Tyler
-
[DISCUSS] Apache Tika as TLP
Mattmann, Chris A (388J)
-
[jira] Created: (TIKA-387) htmlparser throws IllegalCharsetNameException
Piotr B. (JIRA)
-
Streaming files diectly to Tika
Wick2804
-
[jira] Created: (TIKA-386) Tika relies on X11
Kenny Neal (JIRA)
-
[jira] Created: (TIKA-385) Incorrect handling of hyperlinks in .docx
Liam O'Boyle (JIRA)
-
[jira] Created: (TIKA-384) incorrect mime type detection when Metadata.RESOURCE_NAME_KEY set
Jim Kay (JIRA)
-
[jira] Created: (TIKA-383) new option for TIKA CLI to get only the languages of a document
Markus Goldbach (JIRA)
-
[jira] Created: (TIKA-382) No textextraction in tika-app
Markus Goldbach (JIRA)