here the log:

org.apache.solr.handler.dataimport.DataImporter doFullImport
Grave: Full Import failed
org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' is
a required attribute Processing Document # 1
 at
org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:117)
 at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:71)
 at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:319)
 at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:242)
 at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:180)
 at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:331)
 at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:389)
 at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:370)
feb 12, 2012 7:06:00 PM org.apache.solr.update.DirectUpdateHandler2 rollback
Informazioni: start rollback
feb 12, 2012 7:06:00 PM org.apache.solr.update.DirectUpdateHandler2 rollback
Informazioni: end_rollback
feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.DataImporter
doFullImport
Informazioni: Starting Full Import
feb 12, 2012 7:06:02 PM org.apache.solr.core.SolrCore execute
Informazioni: [] webapp=/solr path=/select
params={clean=false&commit=true&command=full-import&qt=/dataimport}
status=0 QTime=16
feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.SolrWriter
readIndexerProperties
Informazioni: Read dataimport.properties
feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.DataImporter
doFullImport
Grave: Full Import failed
org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' is
a required attribute Processing Document # 1
 at
org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:117)
 at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:71)
 at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:319)
 at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:242)
 at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:180)
 at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:331)
 at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:389)
 at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:370)
feb 12, 2012 7:06:02 PM org.apache.solr.update.DirectUpdateHandler2 rollback
Informazioni: start rollback
feb 12, 2012 7:06:02 PM org.apache.solr.update.DirectUpdateHandler2 rollback
Informazioni: end_rollback
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol pause
Informazioni: Pausing ProtocolHandler ["http-bio-8983"]
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol pause
Informazioni: Pausing ProtocolHandler ["ajp-bio-8009"]
feb 12, 2012 7:06:42 PM org.apache.catalina.core.StandardService
stopInternal
Informazioni: Stopping service Catalina
feb 12, 2012 7:06:42 PM org.apache.solr.core.SolrCore close
Informazioni: []  CLOSING SolrCore org.apache.solr.core.SolrCore@7d1217
feb 12, 2012 7:06:42 PM org.apache.solr.core.SolrCore closeSearcher
Informazioni: [] Closing main searcher on request.
feb 12, 2012 7:06:42 PM org.apache.solr.search.SolrIndexSearcher close
Informazioni: Closing Searcher@19fabda main
 
fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
 
filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
 
queryResultCache{lookups=0,hits=0,hitratio=0.00,inserts=2,evictions=0,size=2,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
 
documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
feb 12, 2012 7:06:42 PM org.apache.solr.update.DirectUpdateHandler2 close
Informazioni: closing
DirectUpdateHandler2{commits=0,autocommits=0,optimizes=0,rollbacks=4,expungeDeletes=0,docsPending=0,adds=0,deletesById=0,deletesByQuery=0,errors=0,cumulative_adds=0,cumulative_deletesById=0,cumulative_deletesByQuery=0,cumulative_errors=0}
feb 12, 2012 7:06:42 PM org.apache.solr.update.DirectUpdateHandler2 close
Informazioni: closed
DirectUpdateHandler2{commits=0,autocommits=0,optimizes=0,rollbacks=4,expungeDeletes=0,docsPending=0,adds=0,deletesById=0,deletesByQuery=0,errors=0,cumulative_adds=0,cumulative_deletesById=0,cumulative_deletesByQuery=0,cumulative_errors=0}
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol stop
Informazioni: Stopping ProtocolHandler ["http-bio-8983"]
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol stop
Informazioni: Stopping ProtocolHandler ["ajp-bio-8009"]
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol destroy
Informazioni: Destroying ProtocolHandler ["http-bio-8983"]
feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol destroy
Informazioni: Destroying ProtocolHandler ["ajp-bio-8009"]


2012/2/16 alessio crisantemi <alessio.crisant...@gmail.com>

> yes, but if I use TikaEntityProcessor the result of my full-import is
>
> <str name="Total Requests made to DataSource">0</str>
>  <str name="Total Rows Fetched">1</str>
>
> <str name="Total Documents Skipped">0</str>
>
> <str name="">Indexing failed. Rolled back all changes.</str>
>
>
>
>
> 2012/2/16 alessio crisantemi <alessio.crisant...@gmail.com>
>
>> Hi all,
>> I have a problem to configure a pdf indexing from a directory in my solr
>> wit DIH:
>>
>> with this data-config
>>
>>
>> <dataConfig>
>>  <dataSource type="BinFileDataSource" />
>>  <document>
>>   <entity
>>     name="tika-test"
>>     processor="FileListEntityProcessor"
>>     baseDir="D:\gioconews_archivio\marzo2011"
>>     fileName=".*pdf"
>>     recursive="true"
>>     rootEntity="false"
>>     dataSource="null"/>
>>   <entity processor="FileListEntityProcessor"
>> url="D:\gioconews_archivio\marzo2011" format="text" >
>>    <field column="author"  name="author" meta="true"/>
>>    <field column="title" name="title" meta="true"/>
>>      <field column="description" name="description" />
>>      <field column="comments" name="comments" />
>>
>>      <field column="content_type" name="content_type" />
>>      <field column="last_modified" name="last_modified" />
>>   </entity>
>>  </document>
>> </dataConfig>
>>
>> I obtain this result:
>>
>>
>>
>>   <str name="command">full-import</str>
>>
>>   <str name="status">idle</str>
>>
>>   <str name="importResponse" />
>>
>> - <lst name="statusMessages">
>>
>>   <str name="Time Elapsed">0:0:2.44</str>
>>
>>   <str name="Total Requests made to DataSource">0</str>
>>
>>   <str name="Total Rows Fetched">43</str>
>>
>>   <str name="Total Documents Skipped">0</str>
>>
>>   <str name="Full Dump Started">2012-02-12 19:06:00</str>
>>
>>   <str name="">Indexing failed. Rolled back all changes.</str>
>>
>>   <str name="Rolledback">2012-02-12 19:06:00</str>
>>   </lst>
>>
>>
>> suggestions?
>> thank you
>> alessio
>>
>
>

Reply via email to