Hi Michael, does it work if metatags in "index.parse.md" are lowercased?
<property> <name>index.parse.md</name> <value>metatag.groupsallowed,metatag.gtitle</value> </property> See https://issues.apache.org/jira/browse/NUTCH-1561 Sorry, that's an open issue for one year now. If you find time to review the patch, would be great! Thanks, Sebastian On 05/23/2014 07:53 PM, [email protected] wrote: > > I am using Nutch 1.8 with Solr 4.3 and I want to index two custom meta tags > that we have on our site. I have followed the tutorial at > http://wiki.apache.org/nutch/IndexMetatags but I cannot get it to work. If I > run parsechecker, it shows that the fields are being parsed, but if I run > indexchecker, those fields do not appear. Nor do they appear in Solr. So it > appears that these tags are being parsed, but not indexed for some reason. > > Here is the pertinent section of my nutch-site.xml which shows the > configuration for the parse-metadata and index-metadata plugins: > > <property> > <name>plugin.includes</name> > > <value>nutch-extensionpoints|lib-nekohtml|lib-http|lib-regex-filter|protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|metadata)|scoring-opic|urlnormalizer-(pass|regex|basic)|indexer-solr</value> > </property> > > <property> > <name>metatags.names</name> > <value>groupsAllowed;gTitle</value> > </property> > > <property> > <name>index.parse.md</name> > <value>metatag.groupsAllowed,metatag.gTitle</value> > </property> > > I have added the following fields in schema-solr4.xml and in the solr schema > as well: > > <field name="metatag.groupsAllowed" type="text_general" stored="true" > indexed="true"/> > <field name="metatag.gTitle" type="text_general" stored="true" > indexed="true"/> > > Any help would be greatly appreciated. > > - Michael > > >

