Hi Chris, Thanks for the input. RE#3 Yeah, me and Sebastien are now discussing this and will address it within NUTCH-1537 Thanks Lewis
On Sun, Mar 3, 2013 at 9:41 PM, Mattmann, Chris A (388J) < [email protected]> wrote: > Hey Lewis, > > RE: #3 — it would be great to get Nutch using Tika's metadata container > — I don't think we have anything special in Nutch that prevents it. > RE: #2 — I committed your Tika doc patch during ApacheCon NA 2013 so > thanks! > > Thanks! > > Cheers, > Chris > > > From: Lewis John Mcgibbney <[email protected]> > Reply-To: "[email protected]" <[email protected]> > Date: Tuesday, February 26, 2013 3:25 PM > To: "[email protected]" <[email protected]> > Subject: Improvement in Metadata Class > > Hi, > (This is maybe traffic for dev@ but I hope it is OK here on user@) > > 1. > In Apache Nutch we are using the Metadata class [0] as follows > if (tikaMDName.equalsIgnoreCase(Metadata.TITLE)) continue; > TITLE value is deprecated and I want to upgrade API usage. > What should I be using? > > 2. > I would like to contribute to the Tika Java documentation for this as I am > not happy with the current Java documentation for this class. > > 3. > We also currently maintain a legacy Metadata package [1] within Nutch. > This is a multi-valued Metadata container including sets of constant fields > for Nutch webpage and host metadata. > How much of this stuff do we actually need (to be maintaining)? Should we > not be leveraging more of the stuff available within Apache Tika for > Metadata fields. Is this a case of the more the merrier here? > > Thank you very much in advance. I look forward to hearing back from anyone > on this, I am at ApacheCon just now and will cook up a patch based on the > feedback. Thank you. > > Lewis > > [0] > http://tika.apache.org/1.3/api/index.html?org/apache/tika/metadata/Metadata.html > [1] > http://svn.apache.org/repos/asf/nutch/trunk/src/java/org/apache/nutch/metadata/ > -- > *Lewis* > -- *Lewis*
