sorry to bother you again, but the delta import still does not work for
me :-(
We tried:
* delta-import by full-import
<entity name="articles-delta rootEntity="false"
query="<your-delta-query-here>"> with entity=articles-delta&clean=false
* delta-import by full-import with simplified query
* delta-import with simplified query
<entity name="articles-delta" pk="article_ref"
deltaQuery="SELECT * FROM full_text_view WHERE article_id < 300">
* replaced files below with files from nightly-build 15.10.08 and rerun
the delta and full imports as described above
dist/apache-solr-dataimporthandler-1.3.0.jar
dist/solrj-lib/slf4j-api-1.5.3.jar
dist/solrj-lib/slf4j-jdk14-1.5.3.jar
No matter what we do, we always end up in a situation, when the
dataimport status looks fine:
<lst name="statusMessages">
<str name="Time Elapsed">0:0:8.442</str>
<str name="Total Requests made to DataSource">1</str>
<str name="Total Rows Fetched">218</str>
<str name="Total Documents Skipped">0</str>
<str name="Delta Dump started">2008-10-20 15:31:54</str>
<str name="Identifying Delta">2008-10-20 15:31:54</str>
<str name="Deltas Obtained">2008-10-20 15:31:57</str>
<str name="Building documents">2008-10-20 15:31:57</str>
<str name="Total Changed Documents">218</str>
but the log reads:
Oct 20, 2008 3:56:44 PM org.apache.solr.core.SolrCore execute
INFO: [test] webapp=/solr path=/dataimport params={command=delta-import}
status=0 QTime=0
Oct 20, 2008 3:56:44 PM org.apache.solr.handler.dataimport.DataImporter
doDeltaImport
INFO: Starting Delta Import
Oct 20, 2008 3:56:44 PM org.apache.solr.handler.dataimport.SolrWriter
readIndexerProperties
INFO: Read dataimport.properties
Oct 20, 2008 3:56:44 PM org.apache.solr.handler.dataimport.DocBuilder
doDelta
INFO: Starting delta collection.
Oct 20, 2008 3:56:44 PM org.apache.solr.handler.dataimport.DocBuilder
collectDelta
INFO: Running ModifiedRowKey() for Entity: articles-full
Oct 20, 2008 3:56:44 PM
org.apache.solr.handler.dataimport.JdbcDataSource$1 call
INFO: Creating a connection for entity articles-full with URL:
jdbc:postgresql://blogmonitor02:5432/blogmonitor
Oct 20, 2008 3:56:44 PM
org.apache.solr.handler.dataimport.JdbcDataSource$1 call
INFO: Time taken for getConnection(): 5
Oct 20, 2008 3:56:46 PM org.apache.solr.handler.dataimport.DocBuilder
collectDelta
INFO: Completed ModifiedRowKey for Entity: articles-full rows obtained : 218
Oct 20, 2008 3:56:46 PM org.apache.solr.handler.dataimport.DocBuilder
collectDelta
INFO: Running DeletedRowKey() for Entity: articles-full
Oct 20, 2008 3:56:46 PM org.apache.solr.handler.dataimport.DocBuilder
collectDelta
INFO: Completed DeletedRowKey for Entity: articles-full rows obtained : 0
Oct 20, 2008 3:56:46 PM org.apache.solr.handler.dataimport.DocBuilder
collectDelta
INFO: Completed parentDeltaQuery for Entity: articles-full
Oct 20, 2008 3:56:46 PM org.apache.solr.handler.dataimport.DataImporter
doDeltaImport
SEVERE: Delta Import Failed
java.lang.NullPointerException
at
org.apache.solr.handler.dataimport.SqlEntityProcessor.getDeltaImportQuery(SqlEntityProcessor.java:153)
at
org.apache.solr.handler.dataimport.SqlEntityProcessor.getQuery(SqlEntityProcessor.java:125)
at
org.apache.solr.handler.dataimport.SqlEntityProcessor.nextRow(SqlEntityProcessor.java:73)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:285)
at
org.apache.solr.handler.dataimport.DocBuilder.doDelta(DocBuilder.java:211)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:133)
at
org.apache.solr.handler.dataimport.DataImporter.doDeltaImport(DataImporter.java:359)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:388)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:377)
here is the full data-config:
<dataConfig>
<dataSource type="JdbcDataSource" driver="org.postgresql.Driver"
url="jdbc:postgresql://bm02:5432/bm" user="bm" />
<document name="articles">
<entity name="articles-full" pk="id" query="SELECT * FROM
full_text_view where article_id < 200" deltaQuery="SELECT * FROM
full_text_view WHERE article_id < 300">
<field column="article_id" name="a_id" />
<field column="normalized_text" name="norm_text" />
<field column="article_ref" name="id" />
<field column="article_stub" name="stub" />
<field column="id_blogs" name="blog_id" />
<field column="article_title" name="a_title" />
<field column="article_url" name="article_url" />
<field column="ts" name="ts" />
<field column="rank" name="rank" />
<field column="blog_ref" name="blog_ref" />
<field column="blog_title" name="b_title" />
<field column="blog_subtitle" name="subtitle" />
<field column="blog_url" name="blog_url" />
</entity>
</document>
</dataConfig>
what are we doing wrong?
Florian