Re: [PROPOSAL] Any23 to join the incubator

2011-09-26 Thread Mattmann, Chris A (388J)
Hi All, OK, since the chatter about this proposal has died down and since I've agreed to champion it, I'll call a formal VOTE tomorrow afternoon and let it run through the rest of the week. The Tika PMC has not registered any objections to sponsoring the proposal, so I will go ahead and update

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Oleg Tikhonov
In favor of releasing the Tika 0.10, +1 On Mon, Sep 26, 2011 at 9:50 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hi Folks, A first release candidate for the Tika 0.10 release is available at: http://people.apache.org/~mattmann/apache-tika-0.10/rc1/ The release

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Christian Göller
+1 BR Christian Am 26.09.2011 08:50, schrieb Mattmann, Chris A (388J): Hi Folks, A first release candidate for the Tika 0.10 release is available at: http://people.apache.org/~mattmann/apache-tika-0.10/rc1/ The release candidate is a zip archive of the sources in:

[jira] [Created] (TIKA-731) NPE in WordExtractor.handleParagraph()

2011-09-26 Thread Pablo Queixalos (JIRA)
NPE in WordExtractor.handleParagraph() -- Key: TIKA-731 URL: https://issues.apache.org/jira/browse/TIKA-731 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.10

[jira] [Updated] (TIKA-731) NPE in WordExtractor.handleParagraph()

2011-09-26 Thread Pablo Queixalos (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pablo Queixalos updated TIKA-731: - Attachment: document_proposition_referencement.doc Attachment #2 NPE in

[jira] [Updated] (TIKA-731) NPE in WordExtractor.handleParagraph()

2011-09-26 Thread Pablo Queixalos (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pablo Queixalos updated TIKA-731: - Attachment: energie_nucleaire_france_fiche1.doc Throws NPE NPE in

[jira] [Issue Comment Edited] (TIKA-731) NPE in WordExtractor.handleParagraph()

2011-09-26 Thread Pablo Queixalos (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13114567#comment-13114567 ] Pablo Queixalos edited comment on TIKA-731 at 9/26/11 9:57 AM: ---

[jira] [Assigned] (TIKA-731) NPE in WordExtractor.handleParagraph()

2011-09-26 Thread Maxim Valyanskiy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Valyanskiy reassigned TIKA-731: - Assignee: Maxim Valyanskiy NPE in WordExtractor.handleParagraph()

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Maxim Valyanskiy
+1 26.09.2011 10:50, Mattmann, Chris A (388J) пишет: Hi Folks, A first release candidate for the Tika 0.10 release is available at: http://people.apache.org/~mattmann/apache-tika-0.10/rc1/ The release candidate is a zip archive of the sources in:

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Nick Burch
On Sun, 25 Sep 2011, Mattmann, Chris A (388J) wrote: A first release candidate for the Tika 0.10 release is available at: http://people.apache.org/~mattmann/apache-tika-0.10/rc1/ Couple of minor things: * The signature on the tika src file seems to have the wrong name (it's missing the

[jira] [Commented] (TIKA-727) Improve the outputed XHTML by HSLFExtractor

2011-09-26 Thread Pablo Queixalos (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13114663#comment-13114663 ] Pablo Queixalos commented on TIKA-727: -- bq. Looking at the html, there are still some

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Mattmann, Chris A (388J)
Nice catch, Nick, thanks. I'd be happy to be at the keysigning in Vancouver :) One other quick thing I noticed is that the CHANGES.txt file doesn't include the typical set of contributors list that I've been putting in all the releases so far. Would anyone object to me updating the CHANGES.txt

[jira] [Commented] (TIKA-712) Master slide text isn't extracted

2011-09-26 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13114700#comment-13114700 ] Nick Burch commented on TIKA-712: - It looks like we only want to exclude the placeholder

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Jukka Zitting
Hi, [x] +1 Release this package as Apache Tika 0.10 [ ] -1 Do not release this package because... On Mon, Sep 26, 2011 at 3:37 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Would anyone object to me updating the CHANGES.txt file and then respinning a new RC? I have no

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Mattmann, Chris A (388J)
Thanks Jukka. If that's the case, I'd prefer to simply update CHANGES.txt in the RC zip file, in the tag and the branch I'll create (and trunk), and then update the .asc file. I'll count the existing +1s (and my own +1! :) ) towards the VOTE, and let it go until 72 hours and then go from

[jira] [Resolved] (TIKA-732) Upgrade to Commons Codec 1.5

2011-09-26 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-732. Resolution: Fixed Done in revision 1175915. Upgrade to Commons Codec 1.5

Re: commons-codec dependency

2011-09-26 Thread Jukka Zitting
Hi, On Mon, Sep 26, 2011 at 4:48 PM, gross gros...@gmail.com wrote: As I see in maven tree now tika-parsers depends on commons-codec:1.4 and on commons-codec:1.5 throught poi:3.8-beta4. Check, deps, please. Good catch, thanks! I filed TIKA-732 for this and updated the dependency in revision

Re: [VOTE] Apache Tika 0.10 release rc #1

2011-09-26 Thread Michael McCandless
+1 to release! I verified the signatures, and smoke tested the JAR on a few docs (should we name it apache-tika-app-NN.jar in the future? Ie, add apache- in front), and ran mvn clean install test from the src zip. Mike McCandless http://blog.mikemccandless.com On Mon, Sep 26, 2011 at 11:44

Re: apache-tika-app? (Was: [VOTE] Apache Tika 0.10 release rc #1)

2011-09-26 Thread Michael McCandless
On Mon, Sep 26, 2011 at 12:20 PM, Jukka Zitting jukka.zitt...@gmail.com wrote: Hi, On Mon, Sep 26, 2011 at 6:03 PM, Michael McCandless luc...@mikemccandless.com wrote: (should we name it apache-tika-app-NN.jar in the future?  Ie, add apache- in front) I don't think that's needed, just like