Hey Ken, This is fine. I wanted to get going with our Julia/MIT-LL Text.jl based detector and turning LanguageIdentifier into an interface. Me and Trevor (CC’ed) are working on it, but not sure where we’re at and shouldn’t be a blocker to moving forward.
Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Ken Krugler <kkrugler_li...@transpac.com> Reply-To: "dev@tika.apache.org" <dev@tika.apache.org> Date: Thursday, February 4, 2016 at 12:23 PM To: "tika-...@lucene.apache.org" <tika-...@lucene.apache.org> Subject: Tika 2.0 and language detection >Hi all, > >Over at https://issues.apache.org/jira/browse/TIKA-1723, Tim & I have >been discussing whether to focus these pending changes on the 2.0 branch, >and leave 1.x as-is. > >As part of that, we could do a cut-and-run in 2.0, and not spend the time >to port the current (Tika 1.x) language detector code. > >I'm in favor of that approach, as I think leveraging the new detector >project(s) gives us faster & more accurate results over more languages. > >But we're posting to the more general audience here, to gather input on >things that we might not be considering. > >Thanks, > >-- Ken > > > >-------------------------- >Ken Krugler >+1 530-210-6378 >http://www.scaleunlimited.com >custom big data solutions & training >Hadoop, Cascading, Cassandra & Solr > > > > >