Hi Michael,

does it work if metatags in "index.parse.md" are lowercased?

<property>
 <name>index.parse.md</name>
 <value>metatag.groupsallowed,metatag.gtitle</value>
</property>

See https://issues.apache.org/jira/browse/NUTCH-1561
Sorry, that's an open issue for one year now.
If you find time to review the patch, would be great!

Thanks,
Sebastian

On 05/23/2014 07:53 PM, [email protected] wrote:
> 
> I am using Nutch 1.8 with Solr 4.3 and I want to index two custom meta tags 
> that we have on our site. I have followed the tutorial at 
> http://wiki.apache.org/nutch/IndexMetatags but I cannot get it to work. If I 
> run parsechecker, it shows that the fields are being parsed, but if I run 
> indexchecker, those fields do not appear. Nor do they appear in Solr. So it 
> appears that these tags are being parsed, but not indexed for some reason.
>  
> Here is the pertinent section of my nutch-site.xml which shows the 
> configuration for the parse-metadata and index-metadata plugins:
>  
> <property>
>  <name>plugin.includes</name>
>  
> <value>nutch-extensionpoints|lib-nekohtml|lib-http|lib-regex-filter|protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|metadata)|scoring-opic|urlnormalizer-(pass|regex|basic)|indexer-solr</value>
> </property>
>  
> <property>
>  <name>metatags.names</name>
>  <value>groupsAllowed;gTitle</value>
> </property>
>  
> <property>
>  <name>index.parse.md</name>
>  <value>metatag.groupsAllowed,metatag.gTitle</value>
> </property>
>  
> I have added the following fields in schema-solr4.xml and in the solr schema 
> as well:
>  
> <field name="metatag.groupsAllowed" type="text_general" stored="true" 
> indexed="true"/>
> <field name="metatag.gTitle" type="text_general" stored="true" 
> indexed="true"/>
>  
> Any help would be greatly appreciated.
>  
> - Michael
>  
>  
> 

Reply via email to