tika-dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
[jira] Updated: (TIKA-354) ProfilingHandler should take a length-limiting parameter
Ken Krugler (JIRA)
[jira] Updated: (TIKA-354) ProfilingHandler should take a length-limiting parameter
Ken Krugler (JIRA)
The case of the unexpected error
Ken Krugler
Re: The case of the unexpected error
Jukka Zitting
Re: The case of the unexpected error
Luke Nezda
Re: The case of the unexpected error
Felix Meschberger
[jira] Created: (TIKA-353) Upgrade to POI 3.6
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-353) Upgrade to POI 3.6
Jukka Zitting (JIRA)
[jira] Created: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Ken Krugler (JIRA)
[jira] Updated: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Ken Krugler (JIRA)
[jira] Commented: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-352) Use MediaType.parse when extracting charset from content-type metadata in parsers
Ken Krugler (JIRA)
[jira] Created: (TIKA-351) MediaType.parse should be more forgiving of broken input
Ken Krugler (JIRA)
[jira] Updated: (TIKA-351) MediaType.parse should be more forgiving of broken input
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-351) MediaType.parse should be more forgiving of broken input
Jukka Zitting (JIRA)
[jira] Created: (TIKA-350) HtmlParser's content-type handling code needs to be more flexible
Ken Krugler (JIRA)
[jira] Updated: (TIKA-350) HtmlParser's content-type handling code needs to be more flexible
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-350) HtmlParser's content-type handling code needs to be more flexible
Jukka Zitting (JIRA)
[jira] Created: (TIKA-349) HtmlParser's http-equiv code needs to be more flexible
Ken Krugler (JIRA)
[jira] Updated: (TIKA-349) HtmlParser's http-equiv code needs to be more flexible
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-349) HtmlParser's http-equiv code needs to be more flexible
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-125) Pass Locale information to parsers
Jukka Zitting (JIRA)
[jira] Created: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Chris A. Mattmann (JIRA)
[jira] Assigned: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-348) Tika can't parse XLSX when build with latest POI trunk version
Jukka Zitting (JIRA)
HtmlMapper
Jukka Zitting
[jira] Created: (TIKA-347) Make HtmlParser customizable through ParseContext
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-347) Make HtmlParser customizable through ParseContext
Jukka Zitting (JIRA)
[jira] Created: (TIKA-346) ZipParser throws "invalid compression method" error for some archives
Robert Trickey (JIRA)
[jira] Updated: (TIKA-346) ZipParser throws "invalid compression method" error for some archives
Robert Trickey (JIRA)
[jira] Updated: (TIKA-346) ZipParser throws "invalid compression method" error for some archives
Jukka Zitting (JIRA)
Charset detection
Antoni Mylka
Re: Charset detection
Jérôme Charron
Re: Charset detection
Alex Ott
Re: [Aperture-devel] Charset detection
darren
Re: [Aperture-devel] Charset detection
Thilo Goetz
Re: [Aperture-devel] Charset detection
Christiaan Fluit
[jira] Created: (TIKA-345) Add application/vnd.wap.xhtml+xml to list of mimetypes handled by HtmlParser
Ken Krugler (JIRA)
[jira] Updated: (TIKA-345) Add application/vnd.wap.xhtml+xml to list of mimetypes handled by HtmlParser
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-345) Add application/vnd.wap.xhtml+xml to list of mimetypes handled by HtmlParser
Jukka Zitting (JIRA)
[jira] Created: (TIKA-344) Charset hint in metadata
Piotr B. (JIRA)
[jira] Commented: (TIKA-344) Charset hint in metadata
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-344) Charset hint in metadata
Jukka Zitting (JIRA)
[jira] Created: (TIKA-343) some parsers produces glued words
Piotr B. (JIRA)
[jira] Resolved: (TIKA-343) some parsers produces glued words
Jukka Zitting (JIRA)
HTML mime-types
Ken Krugler
Re: HTML mime-types
Jukka Zitting
source repository on Tika page
Julien Nioche
Re: source repository on Tika page
Karl Heinz Marbaise
Re: source repository on Tika page
Ken Krugler
Better Ohloh history for Tika
Jukka Zitting
Re: Better Ohloh history for Tika
Mattmann, Chris A (388J)
Re: Better Ohloh history for Tika
Jukka Zitting
New Tika committer
Jukka Zitting
Re: New Tika committer
Mattmann, Chris A (388J)
Re: New Tika committer
Ken Krugler
What to export from the tika-bundle ?
Felix Meschberger
[jira] Created: (TIKA-342) Improve OSGi bundling
Felix Meschberger (JIRA)
[jira] Updated: (TIKA-342) Improve OSGi bundling
Felix Meschberger (JIRA)
[jira] Resolved: (TIKA-342) Improve OSGi bundling
Jukka Zitting (JIRA)
[jira] Created: (TIKA-341) Use charset in CONTENT_TYPE metadata when detecting the character encoding
Ken Krugler (JIRA)
[jira] Updated: (TIKA-341) Use charset in CONTENT_TYPE metadata when detecting the character encoding
Ken Krugler (JIRA)
[jira] Updated: (TIKA-341) Use charset in CONTENT_TYPE metadata when detecting the character encoding
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-341) Use charset in CONTENT_TYPE metadata when detecting the character encoding
Jukka Zitting (JIRA)
Fwd: a 'lite' version of ooxml-schemas jar
Jukka Zitting
Tika 0.6 soon?
Jukka Zitting
Re: Tika 0.6 soon?
Mattmann, Chris A (388J)
Re: Tika 0.6 soon?
Jukka Zitting
RE: Tika 0.6 soon?
Jana, Kumar Raja
Re: Tika 0.6 soon?
Mattmann, Chris A (388J)
Re: Tika 0.6 soon?
Jukka Zitting
Re: Tika 0.6 soon?
Ken Krugler
[jira] Created: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Updated: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Updated: (TIKA-340) Provide full Tika bundle
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Updated: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Ken Krugler (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Andrzej Bialecki (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Updated: (TIKA-340) Provide full Tika bundle
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-340) Provide full Tika bundle
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Issue Comment Edited: (TIKA-340) Provide full Tika bundle
Felix Meschberger (JIRA)
[jira] Created: (TIKA-339) HtmlParser & TXTParser should not use language returned by CharsetDetector if language hint has been provided
Ken Krugler (JIRA)
[jira] Commented: (TIKA-339) HtmlParser & TXTParser should not use language returned by CharsetDetector if language hint has been provided
Ken Krugler (JIRA)
[jira] Updated: (TIKA-339) HtmlParser & TXTParser should not use language returned by CharsetDetector if language hint has been provided
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-339) HtmlParser & TXTParser should not use language returned by CharsetDetector if language hint has been provided
Jukka Zitting (JIRA)
[jira] Created: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception
Peter Wolanin (JIRA)
[jira] Closed: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception
Jukka Zitting (JIRA)
[jira] Created: (TIKA-337) SWF parser
Julien Nioche (JIRA)
[jira] Updated: (TIKA-337) SWF parser
Julien Nioche (JIRA)
[jira] Resolved: (TIKA-337) SWF parser
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-337) SWF parser
Julien Nioche (JIRA)
[jira] Created: (TIKA-336) More issues with RDF mime detection
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-336) More issues with RDF mime detection
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-336) More issues with RDF mime detection
Yuan-Fang Li (JIRA)
[jira] Created: (TIKA-335) TXTParser use of CharsetDetector has several bugs
Ken Krugler (JIRA)
[jira] Updated: (TIKA-335) TXTParser should use incoming charset
Ken Krugler (JIRA)
[jira] Updated: (TIKA-335) TXTParser should use incoming charset
Ken Krugler (JIRA)
[jira] Commented: (TIKA-335) TXTParser should use incoming charset
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-335) TXTParser should use incoming charset
Ken Krugler (JIRA)
[jira] Updated: (TIKA-335) TXTParser should use incoming charset
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-335) TXTParser should use incoming charset
Jukka Zitting (JIRA)
[jira] Created: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag
Ken Krugler (JIRA)
[jira] Updated: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag
Jukka Zitting (JIRA)
[jira] Created: (TIKA-333) Improve accuracy of charset detection for HTML pages
Ken Krugler (JIRA)
[jira] Closed: (TIKA-333) Improve accuracy of charset detection for HTML pages
Ken Krugler (JIRA)
[jira] Created: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Ken Krugler (JIRA)
[jira] Commented: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Ken Krugler (JIRA)
[jira] Updated: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Ken Krugler (JIRA)
[jira] Updated: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Ken Krugler (JIRA)
[jira] Updated: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents
Jukka Zitting (JIRA)
[jira] Created: (TIKA-331) Windings font recognition in Tika parsing + spacing issue
MRIT64 (JIRA)
[jira] Updated: (TIKA-331) Windings font recognition in Tika parsing + spacing issue
MRIT64 (JIRA)
[jira] Updated: (TIKA-331) Windings font recognition in Tika parsing + spacing issue
MRIT64 (JIRA)
[jira] Commented: (TIKA-331) Windings font recognition in Tika parsing + spacing issue
MRIT64 (JIRA)
[jira] Commented: (TIKA-331) Windings font recognition in Tika parsing + spacing issue
Ken Krugler (JIRA)
[jira] Created: (TIKA-330) Better HWP (Hangul Word Processor) detection pattern
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-330) Better HWP (Hangul Word Processor) detection pattern
Jukka Zitting (JIRA)
Build failed in Hudson: Tika-trunk #226
Apache Hudson Server
Build failed in Hudson: Tika-trunk #227
Apache Hudson Server
Build failed in Hudson: Tika-trunk #228
Apache Hudson Server
FW: Build failed in Hudson: Tika-trunk #228
Mattmann, Chris A (388J)
Re: FW: Build failed in Hudson: Tika-trunk #228
Jukka Zitting
Re: Build failed in Hudson: Tika-trunk #228
Mattmann, Chris A (388J)
RE: Build failed in Hudson: Tika-trunk #228
Gavin
Re: Build failed in Hudson: Tika-trunk #228
Mattmann, Chris A (388J)
Build failed in Hudson: Tika-trunk #229
Apache Hudson Server
Build failed in Hudson: Tika-trunk #230
Apache Hudson Server
Hudson build is back to normal: Tika-trunk #231
Apache Hudson Server
[ANNOUNCE] Apache Tika 0.5 Released
Mattmann, Chris A (388J)
Re: [ANNOUNCE] Apache Tika 0.5 Released
Steen Manniche
Re: [ANNOUNCE] Apache Tika 0.5 Released
Mattmann, Chris A (388J)
Re: [ANNOUNCE] Apache Tika 0.5 Released
Karl Heinz Marbaise
Re: [ANNOUNCE] Apache Tika 0.5 Released
Jukka Zitting
Re: [ANNOUNCE] Apache Tika 0.5 Released
Mattmann, Chris A (388J)
Re: [ANNOUNCE] Apache Tika 0.5 Released
Karl Heinz Marbaise
[jira] Created: (TIKA-329) secure-processing not supported by some JAXP implementations (2)
Julien Nioche (JIRA)
[jira] Updated: (TIKA-329) secure-processing not supported by some JAXP implementations (2)
Julien Nioche (JIRA)
[jira] Resolved: (TIKA-329) secure-processing not supported by some JAXP implementations (2)
Jukka Zitting (JIRA)
[jira] Created: (TIKA-328) Add parser for .flv videos
Sami Siren (JIRA)
[jira] Updated: (TIKA-328) Add parser for .flv videos
Sami Siren (JIRA)
[jira] Commented: (TIKA-328) Add parser for .flv videos
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-328) Add parser for .flv videos
Sami Siren (JIRA)
[jira] Updated: (TIKA-328) Add parser for .flv videos
Sami Siren (JIRA)
[jira] Resolved: (TIKA-328) Add parser for .flv videos
Jukka Zitting (JIRA)
[jira] Created: (TIKA-327) Parsing "HTML" as DcXML
Erik Hetzner (JIRA)
[jira] Updated: (TIKA-327) Parsing "HTML" as DcXML
Erik Hetzner (JIRA)
[jira] Commented: (TIKA-327) Parsing "HTML" as DcXML
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-327) Parsing "HTML" as DcXML
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-327) Parsing "HTML" as DcXML
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-327) Parsing "HTML" as DcXML
Erik Hetzner (JIRA)
[jira] Created: (TIKA-326) Map javax.imageio.IIOException to TikaException
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-326) Map javax.imageio.IIOException to TikaException
Jukka Zitting (JIRA)
[jira] Created: (TIKA-325) tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear>
Luke Nezda (JIRA)
[jira] Updated: (TIKA-325) tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear>
Luke Nezda (JIRA)
[jira] Updated: (TIKA-325) tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear>
Luke Nezda (JIRA)
[jira] Resolved: (TIKA-325) tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear>
Jukka Zitting (JIRA)
[jira] Created: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Issue Comment Edited: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Issue Comment Edited: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[jira] Resolved: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[jira] Issue Comment Edited: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Peter Wolanin (JIRA)
[VOTE] Apache Tika 0.5 release candidate #1
Mattmann, Chris A (388J)
Re: [VOTE] Apache Tika 0.5 release candidate #1
Karl Heinz Marbaise
Re: [VOTE] Apache Tika 0.5 release candidate #1
Jukka Zitting
Re: [VOTE] Apache Tika 0.5 release candidate #1
Karl Heinz Marbaise
Re: [VOTE] Apache Tika 0.5 release candidate #1
Jukka Zitting
Re: [VOTE] Apache Tika 0.5 release candidate #1
Mattmann, Chris A (388J)
Earlier messages
Later messages