Hi, On Tue, Jun 1, 2010 at 18:08, Matthias Paul <[email protected]>wrote:
> Hi, > > if I run bin/nutch solrdedup <SOLR_URL> it crashes with > java.lang.NullPointerException > at org.apache.hadoop.io.Text.encode(Text.java:388) > at org.apache.hadoop.io.Text.set(Text.java:178). > > I suppose this happens because in my solr-index there are not only > documents > from nutch but also from a database. So not all records have the > digest-field compiled. > What can I do? > Isn't there the possibility to override the query which Nutch sends to Solr > id:[* TO *]? > > Dedup will not work without digest field. Perhaps we can extend solrdedup so it skips all documents with a digest field. Will that work for you? > Thanks > Matthias > -- Doğacan Güney

