Hi all.
I've tried to index pdf documents using the libraries includes in the example distribution of solr 3.3.0. I've copied all the jars includes in /dist and /contrib directories in a common /lib directory and I've included this path to the solrconfig.xml file. The request handler for binary docs has no changes from the example: <requestHandler name="/update/extract" startup="lazy" class="solr.extraction.ExtractingRequestHandler" > <lst name="defaults"> <!-- All the main content goes into "text"... if you need to return the extracted text or do highlighting, use a stored field. --> <str name="fmap.content">text</str> <!-- <str name="lowernames">true</str> --> <!-- <str name="uprefix">ignored_</str> --> <!-- capture link hrefs but ignore div attributes --> <!-- <str name="captureAttr">true</str> --> <!-- <str name="fmap.a">links</str> --> <!-- <str name="fmap.div">ignored_</str> --> </lst> </requestHandler> I've commented all subnodes except fmap.content because I don't use the rest of them. -------------------------------------------------------- ...BUT... :) When I try : curl "http://myserver:8080/solr/update/extract/?literal.id=1000&commit=true" -F "myfile=@myfile_.pdf" I get: Status HTTP 500 - lazy loading error org.apache.solr.common.SolrException: lazy loading error ... Caused by: org.apache.solr.common.SolrException: Error loading class 'solr.extraction.ExtractingRequestHandler' ... -------------------------------------------------------- I've moved contrib/extraction/lib/* to my lib/* . Restart the server and I can see in the log that apache-solr-cell-3.3.0.jar was added to the classloader. But I get the same result :( ... lazy loading error, error loading class. -------------------------------------------------------- # What am I forgetting? what am I missing? Thanks --- Rode González _____ No se encontraron virus en este mensaje. Comprobado por AVG - www.avg.com Versión: 10.0.1392 / Base de datos de virus: 1520/3822 - Fecha de publicación: 08/08/11