Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.

The "ExtractingRequestHandler" page has been changed by JanHoydahl:
http://wiki.apache.org/solr/ExtractingRequestHandler?action=diff&rev1=77&rev2=78

Comment:
Link to Tika1.2

  
  = Additional Resources =
   * 
[[http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Content-Extraction-Tika#example.source|Lucid
 Imagination article]] 
-  * [[http://tika.apache.org/0.10/formats.html|Supported document formats via 
Tika (0.10)]]
+  * [[http://tika.apache.org/1.2/formats.html|Supported document formats via 
Tika (1.2)]]
  
  = What's in a Name =
  Grant was writing the javadocs for the code and needed an entry for the 
<title> tag and wrote out "Solr Content Extraction Library", since the contrib 
directory is named "extraction".  This then lead to an "acronym":  Solr CEL 
which then gets mashed to: Solr Cell.  Hence, the project name is "Solr Cell".  
It's also appropriate because a Solar Cell's job is to convert the raw energy 
of the Sun to electricity, and this contrib's module is responsible for 
converting the "raw" content of a document to something usable by Solr. 
http://en.wikipedia.org/wiki/Solar_cell

Reply via email to