[ http://issues.apache.org/jira/browse/JCR-281?page=all ]
     
Marcel Reutegger reopened JCR-281:
----------------------------------


Hmm, that's too bad.

I'm still a bit confused what kind of libraries we may use for apache projects. 
Roy, is there a list of licenses that are compatible with the apache license? 
Just to make sure we don't spend too much time in the future for extensions 
that we cannot include. Thanks.

Would it be ok with you Martin, to remove the HTML filter from the patch? The 
XML and RTF filters are still very good contributions that I'd like to include.

> textfilters module patch: Support for text extraction for HTML,XML and RTF 
> files
> --------------------------------------------------------------------------------
>
>          Key: JCR-281
>          URL: http://issues.apache.org/jira/browse/JCR-281
>      Project: Jackrabbit
>         Type: Improvement
>   Components: query
>     Reporter: Martin Perez
>  Attachments: patch.diff
>
> This patch adds text extraction support form XML, RTF and HTML files.
> The unique dependency is htmlparser library for handling HTML text extraction.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to