here the log:
org.apache.solr.handler.dataimport.DataImporter doFullImport Grave: Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' is a required attribute Processing Document # 1 at org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:117) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:71) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:319) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:242) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:180) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:331) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:389) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:370) feb 12, 2012 7:06:00 PM org.apache.solr.update.DirectUpdateHandler2 rollback Informazioni: start rollback feb 12, 2012 7:06:00 PM org.apache.solr.update.DirectUpdateHandler2 rollback Informazioni: end_rollback feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.DataImporter doFullImport Informazioni: Starting Full Import feb 12, 2012 7:06:02 PM org.apache.solr.core.SolrCore execute Informazioni: [] webapp=/solr path=/select params={clean=false&commit=true&command=full-import&qt=/dataimport} status=0 QTime=16 feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.SolrWriter readIndexerProperties Informazioni: Read dataimport.properties feb 12, 2012 7:06:02 PM org.apache.solr.handler.dataimport.DataImporter doFullImport Grave: Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' is a required attribute Processing Document # 1 at org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:117) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:71) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:319) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:242) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:180) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:331) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:389) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:370) feb 12, 2012 7:06:02 PM org.apache.solr.update.DirectUpdateHandler2 rollback Informazioni: start rollback feb 12, 2012 7:06:02 PM org.apache.solr.update.DirectUpdateHandler2 rollback Informazioni: end_rollback feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol pause Informazioni: Pausing ProtocolHandler ["http-bio-8983"] feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol pause Informazioni: Pausing ProtocolHandler ["ajp-bio-8009"] feb 12, 2012 7:06:42 PM org.apache.catalina.core.StandardService stopInternal Informazioni: Stopping service Catalina feb 12, 2012 7:06:42 PM org.apache.solr.core.SolrCore close Informazioni: [] CLOSING SolrCore org.apache.solr.core.SolrCore@7d1217 feb 12, 2012 7:06:42 PM org.apache.solr.core.SolrCore closeSearcher Informazioni: [] Closing main searcher on request. feb 12, 2012 7:06:42 PM org.apache.solr.search.SolrIndexSearcher close Informazioni: Closing Searcher@19fabda main fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0} filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0} queryResultCache{lookups=0,hits=0,hitratio=0.00,inserts=2,evictions=0,size=2,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0} documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0} feb 12, 2012 7:06:42 PM org.apache.solr.update.DirectUpdateHandler2 close Informazioni: closing DirectUpdateHandler2{commits=0,autocommits=0,optimizes=0,rollbacks=4,expungeDeletes=0,docsPending=0,adds=0,deletesById=0,deletesByQuery=0,errors=0,cumulative_adds=0,cumulative_deletesById=0,cumulative_deletesByQuery=0,cumulative_errors=0} feb 12, 2012 7:06:42 PM org.apache.solr.update.DirectUpdateHandler2 close Informazioni: closed DirectUpdateHandler2{commits=0,autocommits=0,optimizes=0,rollbacks=4,expungeDeletes=0,docsPending=0,adds=0,deletesById=0,deletesByQuery=0,errors=0,cumulative_adds=0,cumulative_deletesById=0,cumulative_deletesByQuery=0,cumulative_errors=0} feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol stop Informazioni: Stopping ProtocolHandler ["http-bio-8983"] feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol stop Informazioni: Stopping ProtocolHandler ["ajp-bio-8009"] feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol destroy Informazioni: Destroying ProtocolHandler ["http-bio-8983"] feb 12, 2012 7:06:42 PM org.apache.coyote.AbstractProtocol destroy Informazioni: Destroying ProtocolHandler ["ajp-bio-8009"] 2012/2/16 alessio crisantemi <alessio.crisant...@gmail.com> > yes, but if I use TikaEntityProcessor the result of my full-import is > > <str name="Total Requests made to DataSource">0</str> > <str name="Total Rows Fetched">1</str> > > <str name="Total Documents Skipped">0</str> > > <str name="">Indexing failed. Rolled back all changes.</str> > > > > > 2012/2/16 alessio crisantemi <alessio.crisant...@gmail.com> > >> Hi all, >> I have a problem to configure a pdf indexing from a directory in my solr >> wit DIH: >> >> with this data-config >> >> >> <dataConfig> >> <dataSource type="BinFileDataSource" /> >> <document> >> <entity >> name="tika-test" >> processor="FileListEntityProcessor" >> baseDir="D:\gioconews_archivio\marzo2011" >> fileName=".*pdf" >> recursive="true" >> rootEntity="false" >> dataSource="null"/> >> <entity processor="FileListEntityProcessor" >> url="D:\gioconews_archivio\marzo2011" format="text" > >> <field column="author" name="author" meta="true"/> >> <field column="title" name="title" meta="true"/> >> <field column="description" name="description" /> >> <field column="comments" name="comments" /> >> >> <field column="content_type" name="content_type" /> >> <field column="last_modified" name="last_modified" /> >> </entity> >> </document> >> </dataConfig> >> >> I obtain this result: >> >> >> >> <str name="command">full-import</str> >> >> <str name="status">idle</str> >> >> <str name="importResponse" /> >> >> - <lst name="statusMessages"> >> >> <str name="Time Elapsed">0:0:2.44</str> >> >> <str name="Total Requests made to DataSource">0</str> >> >> <str name="Total Rows Fetched">43</str> >> >> <str name="Total Documents Skipped">0</str> >> >> <str name="Full Dump Started">2012-02-12 19:06:00</str> >> >> <str name="">Indexing failed. Rolled back all changes.</str> >> >> <str name="Rolledback">2012-02-12 19:06:00</str> >> </lst> >> >> >> suggestions? >> thank you >> alessio >> > >