tika-dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
jempbox missing from Apache Maven repo?
Ken Krugler
Re: jempbox missing from Apache Maven repo?
Jukka Zitting
[jira] Created: (TIKA-381) HtmlParser should strip linefeeds out of links
Ken Krugler (JIRA)
[jira] Commented: (TIKA-381) HtmlParser should strip linefeeds out of links
Ken Krugler (JIRA)
[BUG ?] MimeType "IOException: Stream closed" with VFS streams
Ronan KERDUDOU - VirageGroup
Re: [BUG ?] MimeType "IOException: Stream closed" with VFS streams
Jukka Zitting
[jira] Resolved: (TIKA-317) Service provider -based Tika configuration
Jukka Zitting (JIRA)
Re: [jira] Resolved: (TIKA-317) Service provider -based Tika configuration
Sami Siren
[jira] Updated: (TIKA-317) Service provider -based Tika configuration
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-317) Annotation-based Tika configuration
Jukka Zitting (JIRA)
[jira] Created: (TIKA-380) Upgrade to PDFBox 1.0.0
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-380) Upgrade to PDFBox 1.0.0
Jukka Zitting (JIRA)
[jira] Created: (TIKA-379) Attribute on html tag not represented in XHTML
Julien Nioche (JIRA)
[jira] Updated: (TIKA-379) Lang attribute on html tag skipped
Julien Nioche (JIRA)
[jira] Commented: (TIKA-379) Lang attribute on html tag skipped
Ken Krugler (JIRA)
[jira] Updated: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Commented: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Issue Comment Edited: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Commented: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Commented: (TIKA-379) Html elements and attributes not available in XHTML representation
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Updated: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Updated: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Commented: (TIKA-379) Html elements and attributes not available in XHTML representation
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-379) Html elements and attributes not available in XHTML representation
Julien Nioche (JIRA)
[jira] Assigned: (TIKA-379) Html elements and attributes not available in XHTML representation
Chris A. Mattmann (JIRA)
[jira] Created: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Sami Siren (JIRA)
[jira] Commented: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Ken Krugler (JIRA)
[jira] Commented: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Sami Siren (JIRA)
[jira] Updated: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-378) TikaConfig should notify users if it cannot initialize some parser
Jukka Zitting (JIRA)
maven build depends on en locale
Timo Boehme
BAD pgp signature with release 0.6
Timo Boehme
Re: BAD pgp signature with release 0.6
Timo Boehme
[jira] Created: (TIKA-377) Error parsing HTML partial with AutoDetect parser
Brett S. (JIRA)
[jira] Updated: (TIKA-377) Error parsing HTML partial with AutoDetect parser
Brett S. (JIRA)
[jira] Resolved: (TIKA-377) Error parsing HTML partial with AutoDetect parser
Jukka Zitting (JIRA)
[jira] Created: (TIKA-376) Typo in parse-rtf spec in tika-config.xml
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-376) Typo in parse-rtf spec in tika-config.xml
Chris A. Mattmann (JIRA)
Bug in tika-config xml
Martin Gerhardy
Bug in tika-config xml
Martin Gerhardy
Re: Bug in tika-config xml
Mattmann, Chris A (388J)
Build failed in Hudson: Tika-trunk #265
Apache Hudson Server
Re: Build failed in Hudson: Tika-trunk #265
Jukka Zitting
Hudson build is back to normal: Tika-trunk #266
Apache Hudson Server
[jira] Created: (TIKA-375) Improve code quality metrics
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-375) Improve code quality metrics
Jeroen Reijn (JIRA)
[jira] Commented: (TIKA-375) Improve code quality metrics
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-375) Improve code quality metrics
Jeroen Reijn (JIRA)
Tika quality metrics
Jukka Zitting
[ANNOUNCE] Apache Tika 0.6 released
Mattmann, Chris A (388J)
Character encodings on the web
Jukka Zitting
Re: Character encodings on the web
Ken Krugler
[RESULT] [VOTE] Apache Tika 0.6 release candidate #1
Mattmann, Chris A (388J)
Fwd: [RESULT] [VOTE] Apache Tika 0.6 release candidate #1
Dave Meikle
[jira] Created: (TIKA-374) AutoDetectParser not thread-safe?
Adam Rauch (JIRA)
[jira] Resolved: (TIKA-374) AutoDetectParser not thread-safe?
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-141) Mime Content Type detection of a web document from its URL.
Jukka Zitting (JIRA)
Re: Timeout support with parsers
Jukka Zitting
Re: Timeout support with parsers
Ken Krugler
Re: Timeout support with parsers
Jukka Zitting
[jira] Created: (TIKA-373) Upgrade to POI 3.7 (or 4.0?)
Jukka Zitting (JIRA)
[jira] Created: (TIKA-372) Channel and SampleRate information for MP3 files
Nick Burch (JIRA)
[jira] Updated: (TIKA-372) Channel and SampleRate information for MP3 files
Nick Burch (JIRA)
[jira] Commented: (TIKA-372) Channel and SampleRate information for MP3 files
Jukka Zitting (JIRA)
[jira] Resolved: (TIKA-372) Channel and SampleRate information for MP3 files
Jukka Zitting (JIRA)
[jira] Created: (TIKA-371) Excel formatting depends on the default locale
Jukka Zitting (JIRA)
[jira] Updated: (TIKA-371) Excel formatting depends on the default locale
Jukka Zitting (JIRA)
[jira] Created: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox
Ken Krugler (JIRA)
[jira] Commented: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox
Ken Krugler (JIRA)
[jira] Issue Comment Edited: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox
Kenny Neal (JIRA)
[jira] Created: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Updated: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Issue Comment Edited: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Issue Comment Edited: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Commented: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Updated: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Updated: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Updated: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
[jira] Updated: (TIKA-369) Improve accuracy of language detection
Ken Krugler (JIRA)
Ogg vorbis metadata?
Nick Burch
Re: Ogg vorbis metadata?
Jukka Zitting
Re: Ogg vorbis metadata?
Nick Burch
[jira] Created: (TIKA-368) ID3v2 support for mp3 parser
Nick Burch (JIRA)
[jira] Updated: (TIKA-368) ID3v2 support for mp3 parser
Nick Burch (JIRA)
[jira] Updated: (TIKA-368) ID3v2 support for mp3 parser
Nick Burch (JIRA)
[jira] Updated: (TIKA-368) ID3v2 support for mp3 parser
Nick Burch (JIRA)
[jira] Resolved: (TIKA-368) ID3v2 support for mp3 parser
Jukka Zitting (JIRA)
[VOTE] Apache Tika 0.6 release candidate #1
Mattmann, Chris A (388J)
Re: [VOTE] Apache Tika 0.6 release candidate #1
Dave Meikle
Re: [VOTE] Apache Tika 0.6 release candidate #1
Mattmann, Chris A (388J)
Re: [VOTE] Apache Tika 0.6 release candidate #1
Dave Meikle
Re: [VOTE] Apache Tika 0.6 release candidate #1
Karl Heinz Marbaise
Re: [VOTE] Apache Tika 0.6 release candidate #1
Mattmann, Chris A (388J)
Re: [VOTE] Apache Tika 0.6 release candidate #1
Karl Heinz Marbaise
Re: [VOTE] Apache Tika 0.6 release candidate #1
Jukka Zitting
Re: [VOTE] Apache Tika 0.6 release candidate #1
Mattmann, Chris A (388J)
Re: [VOTE] Apache Tika 0.6 release candidate #1
Dave Meikle
Re: [VOTE] Apache Tika 0.6 release candidate #1
Grant Ingersoll
Re: [VOTE] Apache Tika 0.6 release candidate #1
Ted Dunning
[jira] Created: (TIKA-367) Mime type rootXML equality improvement
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-367) Mime type rootXML equality improvement
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-367) Mime type rootXML equality improvement
Chris A. Mattmann (JIRA)
[jira] Created: (TIKA-366) Increase buffer size for mime type sniffing
Chris A. Mattmann (JIRA)
[jira] Resolved: (TIKA-366) Increase buffer size for mime type sniffing
Chris A. Mattmann (JIRA)
Tika 0.5 API
Stefan Burger
Re: Tika 0.5 API
Jukka Zitting
[jira] Created: (TIKA-365) Extract more OpenDocument metadata
Nick Burch (JIRA)
[jira] Updated: (TIKA-365) Extract more OpenDocument metadata
Nick Burch (JIRA)
[jira] Updated: (TIKA-365) Extract more OpenDocument metadata
Nick Burch (JIRA)
[jira] Updated: (TIKA-365) Extract more OpenDocument metadata
Nick Burch (JIRA)
[jira] Updated: (TIKA-365) Extract more OpenDocument metadata
Nick Burch (JIRA)
[jira] Resolved: (TIKA-365) Extract more OpenDocument metadata
Jukka Zitting (JIRA)
[jira] Commented: (TIKA-365) Extract more OpenDocument metadata
Ingo Renner (JIRA)
[jira] Commented: (TIKA-365) Extract more OpenDocument metadata
Uwe Schindler (JIRA)
Extracting dublin core metadata in HtmlParser?
Nick Burch
Re: Extracting dublin core metadata in HtmlParser?
Ken Krugler
Hudson build became unstable: Tik a-trunk » Apache Tika parsers #252
Apache Hudson Server
Hudson build is still unstable: Ti ka-trunk » Apache Tika parsers #253
Apache Hudson Server
Hudson build is back to stable: Ti ka-trunk » Apache Tika parsers #254
Apache Hudson Server
Hudson build became unstable: Tika-trunk #252
Apache Hudson Server
Hudson build is still unstable: Tika-trunk #253
Apache Hudson Server
Hudson build is back to stable: Tika-trunk #254
Apache Hudson Server
Tika command line performance
Doug Carter
Re: Tika command line performance
Ken Krugler
Re: Tika command line performance
Doug Carter
Re: Tika command line performance
Ken Krugler
Re: Tika command line performance
Doug Carter
Re: Tika command line performance
Luke Nezda
[jira] Created: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets
Maxim Valyanskiy (JIRA)
[jira] Updated: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets
Maxim Valyanskiy (JIRA)
[jira] Resolved: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets
Jukka Zitting (JIRA)
[jira] Created: (TIKA-363) PDF Content Type seen as application/rdf+xml not appliction/pdf
Tim Reynolds (JIRA)
[jira] Resolved: (TIKA-363) PDF Content Type seen as application/rdf+xml not appliction/pdf
Jukka Zitting (JIRA)
PDF parser exception
Doug Carter
Re: PDF parser exception
Ken Krugler
Re: PDF parser exception
Doug Carter
Re: PDF parser exception
Ken Krugler
[jira] Created: (TIKA-362) Add publisher support
Nick Burch (JIRA)
[jira] Updated: (TIKA-362) Add publisher support
Nick Burch (JIRA)
[jira] Resolved: (TIKA-362) Add publisher support
Jukka Zitting (JIRA)
[jira] Created: (TIKA-361) Update OutlookExtractor to match new POI API
Nick Burch (JIRA)
[jira] Updated: (TIKA-361) Update OutlookExtractor to match new POI API
Nick Burch (JIRA)
Tika Dependency to bouncycastle lib..Tika 0.5 / Tika 0.6-SNAPSHOT...
Karl Heinz Marbaise
Re: Tika Dependency to bouncycastle lib..Tika 0.5 / Tika 0.6-SNAPSHOT...
Ken Krugler
TIKA-103 - Excel Number/Date Formatting.
Dave Meikle
Re: TIKA-103 - Excel Number/Date Formatting.
Mattmann, Chris A (388J)
Re: TIKA-103 - Excel Number/Date Formatting.
Dave Meikle
Re: TIKA-103 - Excel Number/Date Formatting.
Mattmann, Chris A (388J)
Re: TIKA-103 - Excel Number/Date Formatting.
Dave Meikle
[jira] Resolved: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
[jira] Created: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser
Dave Meikle (JIRA)
[jira] Commented: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser
Dave Meikle (JIRA)
[jira] Created: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names
Ken Krugler (JIRA)
[jira] Commented: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names
Ken Krugler (JIRA)
[jira] Updated: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names
Ken Krugler (JIRA)
[jira] Created: (TIKA-358) Auto-detection of HTML fails with common auto-generated template
Ken Krugler (JIRA)
[jira] Updated: (TIKA-358) Auto-detection of HTML fails with common auto-generated template
Ken Krugler (JIRA)
Another shutdown error thrown during parsing
Ken Krugler
Re: Another shutdown error thrown during parsing
Jukka Zitting
[jira] Assigned: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
[jira] Commented: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
[jira] Commented: (TIKA-103) Excel parsing ignores cell formating
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
[jira] Updated: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
[jira] Updated: (TIKA-103) Excel parsing ignores cell formating
Dave Meikle (JIRA)
PDFBox bug in 0.8-incubating
Ken Krugler
[jira] Created: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Updated: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Updated: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing
Jukka Zitting (JIRA)
[jira] Assigned: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
[jira] Issue Comment Edited: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
[jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
[jira] Updated: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-357) Increase buffer size for meta tag sniffing
Chris A. Mattmann (JIRA)
Committer questions
Ken Krugler
Re: Committer questions
Mattmann, Chris A (388J)
Re: Committer questions
Grant Ingersoll
Re: Committer questions
Andrzej Bialecki
Tika jar without dependencies
Jana, Kumar Raja
Re: Tika jar without dependencies
Mattmann, Chris A (388J)
[jira] Created: (TIKA-356) Wrong Repository URL on the Web-Site
Karl Heinz Marbaise (JIRA)
[jira] Commented: (TIKA-356) Wrong Repository URL on the Web-Site
Karl Heinz Marbaise (JIRA)
[jira] Commented: (TIKA-356) Wrong Repository URL on the Web-Site
Ken Krugler (JIRA)
[jira] Updated: (TIKA-356) Wrong Repository URL on the Web-Site
Ken Krugler (JIRA)
[jira] Resolved: (TIKA-356) Wrong Repository URL on the Web-Site
Jukka Zitting (JIRA)
[jira] Created: (TIKA-355) DublinCore constants should be prefixed with "dc."
Vivek Magotra (JIRA)
[jira] Created: (TIKA-354) ProfilingHandler should take a length-limiting parameter
Vivek Magotra (JIRA)
[jira] Assigned: (TIKA-354) ProfilingHandler should take a length-limiting parameter
Ken Krugler (JIRA)
[jira] Commented: (TIKA-354) ProfilingHandler should take a length-limiting parameter
Ken Krugler (JIRA)
Earlier messages
Later messages