Hello All,

I am new learner of Apache Tika and am very much interested to do some
projects using it.
So, it would be very kind of you, if you could suggest me some project
ideas.

With Regards,
Pankaj Kumar



On Sun, May 12, 2013 at 12:49 PM, kiran (JIRA) <[email protected]> wrote:

>
>     [
> https://issues.apache.org/jira/browse/TIKA-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13655622#comment-13655622]
>
> kiran commented on TIKA-992:
> ----------------------------
>
> Hi,
>
> MultiValues are not stored for any metatags in the HTML and any metatag
> can have multiValued fields too.
>
> When we use Tika for parsing with Nutch, we noticed that Tika does not
> store the multiValues for any html metatag. Tika only places one value in
> the DOM tree as reported in NUTCH-1467.
>
> Does this patch allow Tika to have multiValues for any metatag or just
> OpenGraph metatags ?
>
>
> > OpenGraph meta tags to allow multiple values
> > --------------------------------------------
> >
> >                 Key: TIKA-992
> >                 URL: https://issues.apache.org/jira/browse/TIKA-992
> >             Project: Tika
> >          Issue Type: Bug
> >    Affects Versions: 1.3
> >            Reporter: Markus Jelsma
> >            Priority: Minor
> >             Fix For: 1.4
> >
> >         Attachments: TIKA-992-1.3-1.patch
> >
> >
> > HtmlHandler should use Metadata.add() for Open Graph properties instead
> of the HtmlHandler.addHtmlMetadata() method which uses Metadata.set(). The
> og:* properties can be multivalued. The Metadata.set() method overwrites
> previous entries because it doesn't use Metadata.appendedValues().
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>

Reply via email to