Am 02.07.2013 22:29, schrieb Sebastian Nagel:

no field "digest" showing up in the indexchecker
That's correct to some extend. The class of indexchecker is called
IndexingFiltersChecker and it shows the fields added by the
configured IndexingFilters. The field digest is added as a field by
the class IndexerMapReduce. The digest or signature is used to detect
notmodified pages. Hence, it's central to crawling and is stored in
CrawlDb and segments. But it may be a good idea to add it to
indexchecker, although it's not added by a indexing filter plugin.

Ah alright, thx for the explanation. And yes, it would be a good idea to
make that show up in indexchecker.

thus not in our Solr
That should not happen. Which Solr version is used? Is Nutch's
schema.xml properly deployed and loaded by Sorl?

Field is present in our schema.xml [1] and showing up in the Solr schema
browser, but remains empty. Shows up in hadoop.log as well[2].

[1] <field name="digest" type="string" indexed="false" stored="true"/>
[2] solr.SolrMappingReader - source: digest dest: digest

Greetings!

--
-c

Reply via email to