Hey Jukka, Looks good, +1. How about adding the following:
> > <draft> > Apache Tika is a toolkit for detecting and extracting metadata and > structured text content from various documents using existing parser > libraries. > > Development towards Tika 0.3 is ongoing. Metadata handling and > metadata frameworks like XMP have been a source of much discussion, > but so far no clear consensus on has been reached on whether or how > the metadata features in Tika should be extended. > > A wiki was created for Tika. The 0.3 release candidate should be in place and the release should be pushed out in March. > </draft> > WDTY? Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.mattm...@jpl.nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Disclaimer: The opinions presented within are my own and do not reflect those of either NASA, JPL, or the California Institute of Technology.