RE: My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Allison, Timothy B.
Great slides. Thank you, Nick. Wish I could be there... Any feedback/guidance from the audience? -Original Message- From: Nick Burch [mailto:n...@apache.org] Sent: Wednesday, May 11, 2016 5:09 PM To: user@tika.apache.org Cc: d...@tika.apache.org Subject: My "What's new with Apache Tika

My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Nick Burch
Hi All For those who couldn't make it to Vancouver this week, the slides from my "What's new with Apache Tika 2.0" talk are now available online: http://www.slideshare.net/NickBurch2/apache-tika-whats-new-with-20 The audio was recorded, hopefully that will be available to go with the slides i

Re: XML Parser with type recognition

2016-05-11 Thread plugman
Ok found! My xml files received doesn't have the "" description. So without it, the detection is application/xml therefore it's application/xml-vast if present. But I have a last question. It's possible to run the validation of an XML to reinforce the detection? Thanks a lot for your help. Yo

Re: XML Parser with type recognition

2016-05-11 Thread plugman
Already tested, but doesn't work, I have again application/xml content-type Yoyoma. - Mail original - De: "Nick Burch" À: user@tika.apache.org Envoyé: Mercredi 11 Mai 2016 21:14:55 Objet: Re: XML Parser with type recognition On Wed, 11 May 2016, plug...@free.fr wrote: > If you can take

Re: XML Parser with type recognition

2016-05-11 Thread Nick Burch
On Wed, 11 May 2016, plug...@free.fr wrote: If you can take a look at my little gist example https://gist.github.com/anonymous/3506db4367040ea8f381c5b7b435b3f9 it will be very helpful. The localName parameter is case sensitive. Your sample file starts with Nick

Re: XML Parser with type recognition

2016-05-11 Thread plugman
I have tested a lot and found finally how to override custom types. But I do not understand why it can't return the good Content-Type with my code. If I let a glob pattern like this : and rename my Vast3.xml to Vast3.dda it's working due to file extension, but I want to read Vast3.xml file, th

Re: XML Parser with type recognition

2016-05-11 Thread Nick Burch
On Wed, 11 May 2016, plug...@free.fr wrote: Ok if I understand I can create a specific mime type into tika-mimetypes.xml resource file like this: http://www.w3.org/2001/XMLSchema-instance"/> Almost - you can't set that glob as it's already claimed. Otherwise, assuming that is the righ

Configuring GrobidJournalParser from Java code?

2016-05-11 Thread Betsey Benagh
(X-posted from StackOverflow) up vote down votefavorite I'm trying to write a Java application that uses the JournalParser to extract authors, citations, etc. from journal articles. The documentati

RE: Need Help

2016-05-11 Thread Allison, Timothy B.
Haven’t gotten around to this yet. Sorry. Anyone else have any input? From: harsh kumar [mailto:kumarhars...@gmail.com] Sent: Friday, May 6, 2016 8:48 AM To: Allison, Timothy B. Subject: Re: Need Help Hey Timothy, Can you please help me with your findings of the TIKA? I would be thankful to

RE: [VOTE] Release Apache Tika 1.13 Candidate #1

2016-05-11 Thread Allison, Timothy B.
+1 Built on Windows and Linux. I'm relying on earlier pre-release tests for no surprises. :) Thank you, Dave! -Original Message- From: David Meikle [mailto:loo...@gmail.com] On Behalf Of David Meikle Sent: Monday, May 9, 2016 3:35 PM To: d...@tika.apache.org; user@tika.apache.org Subje

Re: XML Parser with type recognition

2016-05-11 Thread plugman
Hi Nick, Ok if I understand I can create a specific mime type into tika-mimetypes.xml resource file like this: http://www.w3.org/2001/XMLSchema-instance"/> But if I understand, Tika parser (or detector) will check that the type-mime will be type of "application/xml" and return "appli