Trying to index pdf docs - lazy loading error - ClassNotFoundException: solr.extraction.ExtractingRequestHandler

Rode González Tue, 09 Aug 2011 04:04:50 -0700

Hi all.


I've tried to index pdf documents using the libraries includes in the
example distribution of solr 3.3.0. 

 

I've copied all the jars includes in /dist and /contrib directories in a
common /lib directory and I've included this path to the solrconfig.xml
file.

 

The request handler for binary docs has no changes from the example:

 

  <requestHandler name="/update/extract" 

                  startup="lazy"

                  class="solr.extraction.ExtractingRequestHandler" >

    <lst name="defaults">

      <!-- All the main content goes into "text"... if you need to return

           the extracted text or do highlighting, use a stored field. -->

      <str name="fmap.content">text</str>

     <!-- <str name="lowernames">true</str> -->

      <!-- <str name="uprefix">ignored_</str> -->

 

      <!-- capture link hrefs but ignore div attributes -->

      <!-- <str name="captureAttr">true</str> -->

      <!-- <str name="fmap.a">links</str> -->

      <!-- <str name="fmap.div">ignored_</str> -->

    </lst>

  </requestHandler>

 

I've commented all subnodes except fmap.content because I don't use the rest
of them.

 

--------------------------------------------------------

...BUT... :)

 

When I try :

 

curl "http://myserver:8080/solr/update/extract/?literal.id=1000&commit=true";
-F "myfile=@myfile_.pdf"

 

I get:

 

Status HTTP 500 - lazy loading error org.apache.solr.common.SolrException:
lazy loading error

...

Caused by: org.apache.solr.common.SolrException: Error loading class
'solr.extraction.ExtractingRequestHandler'

...

 

--------------------------------------------------------

I've moved contrib/extraction/lib/* to my lib/*  . 

Restart the server and I can see in the log that apache-solr-cell-3.3.0.jar
was added to the classloader. But I get the same result :(  ... lazy loading
error, error loading class.

--------------------------------------------------------

 

 

# What am I forgetting? what am I missing?

 

Thanks

 

 

---

Rode González

 

  _____  

No se encontraron virus en este mensaje.
Comprobado por AVG - www.avg.com
Versión: 10.0.1392 / Base de datos de virus: 1520/3822 - Fecha de
publicación: 08/08/11

Trying to index pdf docs - lazy loading error - ClassNotFoundException: solr.extraction.ExtractingRequestHandler

Reply via email to