[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650368#action_12650368 ]
Grant Ingersoll commented on SOLR-284: -------------------------------------- I like how Erik has given names to contribs, etc.: Flare, Celeritas, etc. So, I thought I would give one too: I was typing the javadocs and wrote "Solr Content Extraction Library". Which then lead me to "Solr Cell" as the project name? http://en.wikipedia.org/wiki/Solar_cell It's also nice, b/c a Solar Cell's job is to convert the raw energy of the Sun to electricity, and this contrib's module is responsible for "raw" content of a document to something usable by Solr. I know, I know, get a life... ;-) Still, it beats "ExtractingRequestHandler" as a name! > Parsing Rich Document Types > --------------------------- > > Key: SOLR-284 > URL: https://issues.apache.org/jira/browse/SOLR-284 > Project: Solr > Issue Type: New Feature > Components: update > Reporter: Eric Pugh > Assignee: Grant Ingersoll > Fix For: 1.4 > > Attachments: libs.zip, rich.patch, rich.patch, rich.patch, > rich.patch, rich.patch, rich.patch, rich.patch, SOLR-284.patch, > SOLR-284.patch, solr-word.pdf, source.zip, test-files.zip, test-files.zip, > test.zip, un-hardcode-id.diff > > > I have developed a RichDocumentRequestHandler based on the CSVRequestHandler > that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into > Solr. > There is a wiki page with information here: > http://wiki.apache.org/solr/UpdateRichDocuments > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.