HTML Text Extractor does not extract or index numerics
------------------------------------------------------
Key: JCR-2365
URL: https://issues.apache.org/jira/browse/JCR-2365
Project: Jackrabbit Content Repository
Issue Type: Bug
Components: indexing, jackrabbit-text-extractors
Affects Versions: 1.6.0
Environment: Win XP-Pro; Win 2003 Enterprise 32bit
Reporter: Jeremy Anderson
Numerics such as addresses/dates/financial figures are not extracted or indexed
by the current HTML Extractor. These values are handled properly and
searchable when done via the PlainTextExtractor
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.