[jira] Created: (JCR-2365) HTML Text Extractor does not extract or index numerics

Jeremy Anderson (JIRA) Tue, 27 Oct 2009 05:47:27 -0700

HTML Text Extractor does not extract or index numerics
------------------------------------------------------


                 Key: JCR-2365
                 URL: https://issues.apache.org/jira/browse/JCR-2365
             Project: Jackrabbit Content Repository
          Issue Type: Bug
          Components: indexing, jackrabbit-text-extractors
    Affects Versions: 1.6.0
         Environment: Win XP-Pro; Win 2003 Enterprise 32bit
            Reporter: Jeremy Anderson


Numerics such as addresses/dates/financial figures are not extracted or indexed 
by the current HTML Extractor.  These values are handled properly and 
searchable when done via the PlainTextExtractor

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (JCR-2365) HTML Text Extractor does not extract or index numerics

Reply via email to