[
https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650368#action_12650368
]
Grant Ingersoll commented on SOLR-284:
--------------------------------------
I like how Erik has given names to contribs, etc.: Flare, Celeritas, etc. So,
I thought I would give one too:
I was typing the javadocs and wrote "Solr Content Extraction Library". Which
then lead me to "Solr Cell" as the project name?
http://en.wikipedia.org/wiki/Solar_cell It's also nice, b/c a Solar Cell's job
is to convert the raw energy of the Sun to electricity, and this contrib's
module is responsible for "raw" content of a document to something usable by
Solr.
I know, I know, get a life... ;-) Still, it beats "ExtractingRequestHandler"
as a name!
> Parsing Rich Document Types
> ---------------------------
>
> Key: SOLR-284
> URL: https://issues.apache.org/jira/browse/SOLR-284
> Project: Solr
> Issue Type: New Feature
> Components: update
> Reporter: Eric Pugh
> Assignee: Grant Ingersoll
> Fix For: 1.4
>
> Attachments: libs.zip, rich.patch, rich.patch, rich.patch,
> rich.patch, rich.patch, rich.patch, rich.patch, SOLR-284.patch,
> SOLR-284.patch, solr-word.pdf, source.zip, test-files.zip, test-files.zip,
> test.zip, un-hardcode-id.diff
>
>
> I have developed a RichDocumentRequestHandler based on the CSVRequestHandler
> that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into
> Solr.
> There is a wiki page with information here:
> http://wiki.apache.org/solr/UpdateRichDocuments
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.