Re: Announcing go-tika, a Go package for Tika

2017-10-06 Thread Chris Mattmann
I saw this Tyler, and it’s awesome. I forked it already though I’m not a Go programmer thank you for increasing the community here ( CC’ing Jim Jag who I know has done some Go programming, Jim spread the word ;) Cheers, Chris On 10/6/17, 10:12 AM, "Tyler Bui-Palsulich"

Re: [DISCUSS] Enable specific ContentHandler for tika-server

2017-10-06 Thread Sergey Beryozkin
On 06/10/17 18:08, Konstantin Gribov wrote: My +1 to this idea. IMHO, second option is more flexible. I also like Nick's suggestion about using default package for handlers and interpret dot-separated string as fqcn. Solr does similar thing and it's very convenient to use (but they use prefix

[jira] [Commented] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194974#comment-16194974 ] Eugene Kirpichov commented on TIKA-2472: See my comment on the commit

Re: Announcing go-tika, a Go package for Tika

2017-10-06 Thread Tyler Bui-Palsulich
(Bumping this since it looks like the first message didn't go through.) Tyler On Mon, Oct 2, 2017 at 1:27 PM, Tyler Bui-Palsulich wrote: > Hi Everyone, > > I am happy to announce go-tika, a Go package which makes it easy to use > Tika from Go! See

Re: [DISCUSS] Enable specific ContentHandler for tika-server

2017-10-06 Thread Konstantin Gribov
My +1 to this idea. IMHO, second option is more flexible. I also like Nick's suggestion about using default package for handlers and interpret dot-separated string as fqcn. Solr does similar thing and it's very convenient to use (but they use prefix `solr.` for their classes in predefined package

[jira] [Commented] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195303#comment-16195303 ] Sergey Beryozkin commented on TIKA-2472: I've got a bit of shock with this code: {code:java} @Test

[jira] [Reopened] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin reopened TIKA-2472: With thanks to Eugene... > Implement Metadata.hashCode > --- > >

[jira] [Commented] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195275#comment-16195275 ] Sergey Beryozkin commented on TIKA-2472: I'd not qualify it as incorrect but as sub-optimal. And I

[jira] [Commented] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Eugene Kirpichov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195284#comment-16195284 ] Eugene Kirpichov commented on TIKA-2472: It is not consistent with equals(), so it is definitely

[jira] [Commented] (TIKA-2472) Implement Metadata.hashCode

2017-10-06 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195386#comment-16195386 ] Ken Krugler commented on TIKA-2472: --- I had to deal with this before in another project - FWIR, I

Re: [DISCUSS] Enable specific ContentHandler for tika-server

2017-10-06 Thread Sergey Beryozkin
Konstantin, by the way, if you are interested in having a good discussion to do with using the serialized lambdas then you will be welcome to comment on the relevant text in the Tika Concerns Beam thread, though may be Beam knows how to take care of the issues you raised... Thanks, Sergey On

Re: Announcing go-tika, a Go package for Tika

2017-10-06 Thread Tom Barber
Thats pretty cool, I was thinking about rewriting some of the OODT stuff in Go as a bit of a side project but then wondered how I'd integrate Tika, I wonder no more! ;) On Fri, Oct 6, 2017 at 6:15 PM, Chris Mattmann wrote: > I saw this Tyler, and it’s awesome. I forked it

[jira] [Created] (TIKA-2473) PCX and DCX image support

2017-10-06 Thread Matthew Caruana Galizia (JIRA)
Matthew Caruana Galizia created TIKA-2473: - Summary: PCX and DCX image support Key: TIKA-2473 URL: https://issues.apache.org/jira/browse/TIKA-2473 Project: Tika Issue Type:

[jira] [Commented] (TIKA-2473) PCX and DCX image support

2017-10-06 Thread Matthew Caruana Galizia (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194435#comment-16194435 ] Matthew Caruana Galizia commented on TIKA-2473: --- Magic: byte 0: x0A byte 1: either x00,

[jira] [Comment Edited] (TIKA-2473) PCX and DCX image support

2017-10-06 Thread Matthew Caruana Galizia (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16194435#comment-16194435 ] Matthew Caruana Galizia edited comment on TIKA-2473 at 10/6/17 10:42 AM:

[jira] [Commented] (TIKA-2473) PCX and DCX image support

2017-10-06 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195579#comment-16195579 ] Nick Burch commented on TIKA-2473: -- I've added some test files, mime magic and detection. The magic for