[jira] [Resolved] (TIKA-1024) An MP3 with an UTF-16 ID3 tag containing only the BOM should produce empty string value for that tag

2012-11-18 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-1024. -- Resolution: Fixed An MP3 with an UTF-16 ID3 tag containing only the BOM should

[jira] [Resolved] (TIKA-1025) Powerpoint (.ppt) parser doesn't leave placeholder where documents are embedded

2012-11-18 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-1025. -- Resolution: Fixed Fix Version/s: 1.3 Powerpoint (.ppt) parser doesn't leave

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-11-18 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13499838#comment-13499838 ] Michael McCandless commented on TIKA-369: - +1 to cut over to

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-11-18 Thread Pander Musubi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13499847#comment-13499847 ] Pander Musubi commented on TIKA-369: language-detection uses a variable length n-grams.

Build failed in Jenkins: Tika-trunk #943

2012-11-18 Thread Apache Jenkins Server
See https://builds.apache.org/job/Tika-trunk/943/changes Changes: [mikemccand] TIKA-1025: leave placeholder where embedded docs appear in .ppt extraction [mikemccand] TIKA-1024: don't returned naked BOM for MP3 ID3 tag values -- [...truncated 1878

Re: Build failed in Jenkins: Tika-trunk #943

2012-11-18 Thread Michael McCandless
Looks like another Jenkins hiccup: Nov 19, 2012 12:30:20 AM hudson.remoting.SynchronousCommandTransport$ReaderThread run SEVERE: I/O error in channel channel java.io.StreamCorruptedException at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1332) at

Jenkins build is back to normal : Tika-trunk #944

2012-11-18 Thread Apache Jenkins Server
See https://builds.apache.org/job/Tika-trunk/944/