Thanks Pankaj. You may want to start a new thread with specific topics that you'd like to discuss. This is a thread related to JIRA and TIKA-992 specific to OpenGraph.
I suggest you: * hang around on dev@ and see if there are topics that interest you that spring up and contribute to the discussion there * review Tika code and suggest improvements, etc., to it, in new threads, on in Tika JIRA, reff'ed below. * review Tika JIRA and existing open bugs/issues and contribute there HTH! Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Pankaj Kumar <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Monday, May 13, 2013 10:04 AM To: "[email protected]" <[email protected]> Subject: Re: [jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values >Hello All, > >I am new learner of Apache Tika and am very much interested to do some >projects using it. >So, it would be very kind of you, if you could suggest me some project >ideas. > >With Regards, >Pankaj Kumar > > > >On Sun, May 12, 2013 at 12:49 PM, kiran (JIRA) <[email protected]> wrote: > >> >> [ >> >>https://issues.apache.org/jira/browse/TIKA-992?page=com.atlassian.jira.pl >>ugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13655622#com >>ment-13655622] >> >> kiran commented on TIKA-992: >> ---------------------------- >> >> Hi, >> >> MultiValues are not stored for any metatags in the HTML and any metatag >> can have multiValued fields too. >> >> When we use Tika for parsing with Nutch, we noticed that Tika does not >> store the multiValues for any html metatag. Tika only places one value >>in >> the DOM tree as reported in NUTCH-1467. >> >> Does this patch allow Tika to have multiValues for any metatag or just >> OpenGraph metatags ? >> >> >> > OpenGraph meta tags to allow multiple values >> > -------------------------------------------- >> > >> > Key: TIKA-992 >> > URL: https://issues.apache.org/jira/browse/TIKA-992 >> > Project: Tika >> > Issue Type: Bug >> > Affects Versions: 1.3 >> > Reporter: Markus Jelsma >> > Priority: Minor >> > Fix For: 1.4 >> > >> > Attachments: TIKA-992-1.3-1.patch >> > >> > >> > HtmlHandler should use Metadata.add() for Open Graph properties >>instead >> of the HtmlHandler.addHtmlMetadata() method which uses Metadata.set(). >>The >> og:* properties can be multivalued. The Metadata.set() method overwrites >> previous entries because it doesn't use Metadata.appendedValues(). >> >> -- >> This message is automatically generated by JIRA. >> If you think it was sent incorrectly, please contact your JIRA >> administrators >> For more information on JIRA, see: >>http://www.atlassian.com/software/jira >>
