tika-dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
[RESULT] [VOTE] Apache Tika 0.5 release candidate #1
Mattmann, Chris A (388J)
Re: [RESULT] [VOTE] Apache Tika 0.5 release candidate #1
Julien Nioche
Re: [RESULT] [VOTE] Apache Tika 0.5 release candidate #1
Jukka Zitting
Re: [RESULT] [VOTE] Apache Tika 0.5 release candidate #1
Jukka Zitting
Re: [VOTE] Apache Tika 0.5 release candidate #1
Grant Ingersoll
[jira] Created: (TIKA-323) Make Tika site look like Lucene ecosystem Apache Forrest-built sites
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-323) Make Tika site look like Lucene ecosystem Apache Forrest-built sites
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-323) Make Tika web site have its own style and identity outside of the default Maven look
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-323) Make Tika web site have its own style and identity outside of the default Maven look
Chris A. Mattmann (JIRA)
Build failed in Hudson: Tika-t runk » Apache Tika parent #217
Apache Hudson Server
Hudson build is back to normal: Ti ka-trunk » Apache Tika parent #218
Apache Hudson Server
Build failed in Hudson: Tika-trunk #217
Apache Hudson Server
Hudson build is back to normal: Tika-trunk #218
Apache Hudson Server
Build Unstable
Mattmann, Chris A (388J)
Hudson build became unstable: Tika-trunk #213
Apache Hudson Server
Hudson build is still unstable: Tika-trunk #214
Apache Hudson Server
Hudson build is still unstable: Tika-trunk #215
Apache Hudson Server
Hudson build is back to stable: Tika-trunk #216
Apache Hudson Server
Hudson build became unstable: Tik a-trunk » Apache Tika parsers #213
Apache Hudson Server
Hudson build is still unstable: Ti ka-trunk » Apache Tika parsers #214
Apache Hudson Server
Hudson build is still unstable: Ti ka-trunk » Apache Tika parsers #215
Apache Hudson Server
Hudson build is back to stable: Ti ka-trunk » Apache Tika parsers #216
Apache Hudson Server
[jira] Created: (TIKA-322) Improve encoding detection speed and accuracy
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-322) Improve encoding detection speed and accuracy
Luke Nezda (JIRA)
[jira] Created: (TIKA-321) Optimize type detection speed
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-321) Optimize type detection speed
Jukka Zitting (JIRA)
[jira] Created: (TIKA-320) Allow disabling language detection in AutoDetectParser
Erik Hetzner (JIRA)
[jira] Resolved: (TIKA-320) Allow disabling language detection in AutoDetectParser
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-320) Allow disabling language detection in AutoDetectParser
Erik Hetzner (JIRA)
[jira] Created: (TIKA-319) HtmlParser - use encoding hint only if charset is supported
Piotr B. (JIRA)
[jira] Resolved: (TIKA-319) HtmlParser - use encoding hint only if charset is supported
Jukka Zitting (JIRA)
Parse context - class or map?
Jukka Zitting
Re: Parse context - class or map?
Michael Wechner
Re: Parse context - class or map?
Jukka Zitting
Re: Parse context - class or map?
Mattmann, Chris A (388J)
RE: Parse context - class or map?
Uwe Schindler
Re: Parse context - class or map?
Jukka Zitting
Tika facade - static or not
Jukka Zitting
Re: Tika facade - static or not
Michael Wechner
Re: Tika facade - static or not
Jukka Zitting
Re: Tika facade - static or not
Mattmann, Chris A (388J)
Re: Tika facade - static or not
Jérôme Charron
Re: Tika facade - static or not
Jukka Zitting
Re: Tika facade - static or not
Mattmann, Chris A (388J)
Re: Tika facade - static or not
Jukka Zitting
Re: Tika facade - static or not
Mattmann, Chris A (388J)
[jira] Commented: (TIKA-94) Speech recognition
David Woollard (JIRA)
[jira] Created: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13
JIRA
[jira] Updated: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13
Benson Margulies (JIRA)
[jira] Resolved: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13
Jukka Zitting (JIRA)
Free live video streaming of ApacheCon US 2009
Michael McCandless
Re: Free live video streaming of ApacheCon US 2009
Israel Ekpo
[jira] Created: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Mike Hays (JIRA)
[jira] Updated: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Mike Hays (JIRA)
[jira] Updated: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Maxim Valyanskiy (JIRA)
[jira] Created: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document
Sanjeev Rao (JIRA)
[jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document
Sanjeev Rao (JIRA)
[jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document
Sanjeev Rao (JIRA)
[jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document
Jukka Zitting (JIRA)
[jira] Created: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Maxim Valyanskiy (JIRA)
[jira] Commented: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-314) Initial support for JPEG EXIF metadata extraction
Jukka Zitting (JIRA)
[jira] Created: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes
Bart Hanssens (JIRA)
[jira] Updated: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes
Bart Hanssens (JIRA)
[jira] Resolved: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes
Jukka Zitting (JIRA)
MarkUnsupportedException
mastcheshmi
Re: MarkUnsupportedException
Jukka Zitting
[jira] Created: (TIKA-312) TikaCLI can't print metadata
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-312) TikaCLI can't print metadata
Maxim Valyanskiy (JIRA)
[jira] Resolved: (TIKA-312) TikaCLI can't print metadata
Jukka Zitting (JIRA)
[jira] Created: (TIKA-311) Broken handling of <a name="..."/> tags
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-311) Broken handling of <a name="..."/> tags
Jukka Zitting (JIRA)
FYI: NekoHTML/Xerces dependency replaced with TagSoup
Jukka Zitting
Re: FYI: NekoHTML/Xerces dependency replaced with TagSoup
Ken Krugler
[jira] Created: (TIKA-310) Use TagSoup to parse HTML
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-310) Use TagSoup to parse HTML
Jukka Zitting (JIRA)
Eclipse formatter (Was: [jira] Commented: (TIKA-295) Rough cut of mbox parser)
Jukka Zitting
[jira] Created: (TIKA-309) Mime type application/rdf+xml not correctly detected
Yuan-Fang Li (JIRA)
[jira] Resolved: (TIKA-309) Mime type application/rdf+xml not correctly detected
Jukka Zitting (JIRA)
[jira] Reopened: (TIKA-309) Mime type application/rdf+xml not correctly detected
Yuan-Fang Li (JIRA)
[jira] Issue Comment Edited: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Issue Comment Edited: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Issue Comment Edited: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Reopened: (TIKA-309) Mime type application/rdf+xml not correctly detected
Yuan-Fang Li (JIRA)
[jira] Commented: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-309) Mime type application/rdf+xml not correctly detected
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-309) Mime type application/rdf+xml not correctly detected
Yuan-Fang Li (JIRA)
[jira] Commented: (TIKA-309) Mime type application/rdf+xml not correctly detected
Chris A. Mattmann (JIRA)
[jira] Created: (TIKA-308) Improve supertype handling in type registry
Ken Krugler (JIRA)
[jira] Created: (TIKA-307) Better handling of partial/truncated input data to parsers
Ken Krugler (JIRA)
[jira] Created: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser
Bart Hanssens (JIRA)
[jira] Updated: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser
Bart Hanssens (JIRA)
[jira] Resolved: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser
Jukka Zitting (JIRA)
[jira] Created: (TIKA-305) XHTML href attributes end up in the wrong namespace
Benson Margulies (JIRA)
[jira] Updated: (TIKA-305) XHTML href attributes end up in the wrong namespace
Benson Margulies (JIRA)
[jira] Resolved: (TIKA-305) XHTML href attributes end up in the wrong namespace
Jukka Zitting (JIRA)
[jira] Created: (TIKA-304) HtmlParser could be easier to subclass
Benson Margulies (JIRA)
[jira] Updated: (TIKA-304) HtmlParser could be easier to subclass
Benson Margulies (JIRA)
[jira] Updated: (TIKA-304) HtmlParser could be easier to subclass
Benson Margulies (JIRA)
[jira] Commented: (TIKA-304) HtmlParser could be easier to subclass
Ken Krugler (JIRA)
[jira] Commented: (TIKA-304) HtmlParser could be easier to subclass
Benson Margulies (JIRA)
[jira] Resolved: (TIKA-304) HtmlParser could be easier to subclass
Jukka Zitting (JIRA)
[jira] Created: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)
[jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)
[jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)
[jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)
[jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)
[jira] Resolved: (TIKA-303) XHTMLContentHandler mishandles headers
Jukka Zitting (JIRA)
Info from parser on handling partial input
Ken Krugler
RE: [bulk] Info from parser on handling partial input
Hanssens Bart
Re: [bulk] Info from parser on handling partial input
Jukka Zitting
Re: Info from parser on handling partial input
Jukka Zitting
Re: Info from parser on handling partial input
Ken Krugler
[jira] Created: (TIKA-302) patch: initial support for ePUB
Bart Hanssens (JIRA)
[jira] Updated: (TIKA-302) patch: initial support for ePUB
Bart Hanssens (JIRA)
[jira] Commented: (TIKA-302) patch: initial support for ePUB
Jukka Zitting (JIRA)
RE: [bulk] [jira] Commented: (TIKA-302) patch: initial support for ePUB
Hanssens Bart
[jira] Resolved: (TIKA-302) patch: initial support for ePUB
Jukka Zitting (JIRA)
[jira] Created: (TIKA-301) patch: embedded ODF and office:annotation
Bart Hanssens (JIRA)
[jira] Updated: (TIKA-301) patch: embedded ODF and office:annotation
Bart Hanssens (JIRA)
[jira] Resolved: (TIKA-301) patch: embedded ODF and office:annotation
Jukka Zitting (JIRA)
[jira] Created: (TIKA-300) rename openoffice.. parser classes to odf..
Bart Hanssens (JIRA)
[jira] Resolved: (TIKA-300) rename openoffice.. parser classes to odf..
Jukka Zitting (JIRA)
[jira] Created: (TIKA-299) Update Geronimo dependency in tika-parsers pom.xml to 1.0.1
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-299) Update Geronimo dependency in tika-parsers pom.xml to 1.0.1
Jukka Zitting (JIRA)
[jira] Created: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back
Ken Krugler (JIRA)
[jira] Commented: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back
Ken Krugler (JIRA)
[jira] Updated: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back
Chris A. Mattmann (JIRA)
General question about patches
Ken Krugler
Re: General question about patches
Jukka Zitting
Re: General question about patches
Ken Krugler
[jira] Created: (TIKA-297) The HtmlParser ignores <menu> tags, resulting in invalid XHTML
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-297) The HtmlParser ignores <menu> tags, resulting in invalid XHTML
Jukka Zitting (JIRA)
[jira] Created: (TIKA-296) Automatically set the supertype for "+xml" mimetypes
Ken Krugler (JIRA)
[jira] Updated: (TIKA-296) Automatically set the supertype for "+xml" mimetypes
Ken Krugler (JIRA)
[jira] Updated: (TIKA-296) Automatically set the supertype for "+xml" mimetypes
Ken Krugler (JIRA)
[jira] Updated: (TIKA-296) Automatically set the supertype for "+xml" mimetypes
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-296) Automatically set the supertype for "+xml" mimetypes
Jukka Zitting (JIRA)
[jira] Created: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
[jira] Updated: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-295) Rough cut of mbox parser
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Alex Baranov (JIRA)
[jira] Issue Comment Edited: (TIKA-295) Rough cut of mbox parser
Alex Baranov (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Thilo Goetz (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)
Fall-back parser in AutoDetectParser
Ken Krugler
Re: Fall-back parser in AutoDetectParser
Jukka Zitting
Re: Fall-back parser in AutoDetectParser
Ken Krugler
Re: Fall-back parser in AutoDetectParser
Jukka Zitting
[jira] Created: (TIKA-294) TikaCLI always uses System.in for input
Ken Krugler (JIRA)
[jira] Updated: (TIKA-294) TikaCLI always uses System.in for input
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-294) TikaCLI always uses System.in for input
Jukka Zitting (JIRA)
Super-types for text mime types
Ken Krugler
Re: Super-types for text mime types
Jukka Zitting
Re: Super-types for text mime types
Ken Krugler
Towards Tika 0.5
Jukka Zitting
Re: Towards Tika 0.5
Mattmann, Chris A (388J)
[jira] Resolved: (TIKA-61) Add namespaces to our metadata keys
Jukka Zitting (JIRA)
[jira] Created: (TIKA-293) XWPFWordExtractorDecorator does not extract bookmarks
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-293) XWPFWordExtractorDecorator does not extract bookmarks
Maxim Valyanskiy (JIRA)
[jira] Resolved: (TIKA-293) XWPFWordExtractorDecorator does not extract bookmarks
Jukka Zitting (JIRA)
[jira] Created: (TIKA-292) PDFBox is too verbose
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-292) PDFBox is too verbose
Jukka Zitting (JIRA)
[jira] Created: (TIKA-291) Adobe InDesign suport
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-291) Adobe InDesign support
Jukka Zitting (JIRA)
Test failures from trunk
Ken Krugler
Re: Test failures from trunk
Jukka Zitting
[jira] Created: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
MRIT64 (JIRA)
[jira] Updated: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
MRIT64 (JIRA)
[jira] Updated: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
MRIT64 (JIRA)
[jira] Issue Comment Edited: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
MRIT64 (JIRA)
[jira] Resolved: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.txtpar...@6caf16
MRIT64 (JIRA)
[jira] Created: (TIKA-289) Add magic byte patterns from file(1)
Jukka Zitting (JIRA)
Error in Eclipse with ordering of libs
Ken Krugler
Re: Error in Eclipse with ordering of libs
Jukka Zitting
Re: Error in Eclipse with ordering of libs
Ken Krugler
Re: Error in Eclipse with ordering of libs
Ken Krugler
[jira] Created: (TIKA-288) Support override parsers in AutoDetectParser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Ken Krugler (JIRA)
[jira] Created: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements
Ken Krugler (JIRA)
[jira] Commented: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements
Uwe Schindler (JIRA)
Earlier messages
Later messages