I am getting duplicate metatag.description values in my indexed results.
When running a parse checker, I am picking up meta name=description and the
meta property=og:description values.

Has anyone else ran into this issue?  If so, how have you fixed it?

If not, any clues on how to resolve.

Thank you in advance,
jeff


configuration: Nutch 1.9
nutch-site.xml(partial):
<!-- Plugin Control statement -->
<property>
  <name>plugin.includes</name>

<value>protocol-httpclient|urlfilter-(prefix|suffix|regex)|feed|headings|parse-(tika|html|metatags)|urlmeta|index-(basic|anchor|metadata|img)|indexer-solr|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
  <description></description>
</property>

<!-- Parse Meta Tag parameters -->
<property>
  <name>metatags.names</name>
  <value>description</value>
</property>

<!-- Parse - Tika Controls -->
<property>
  <name>tika.boilerpipe</name>
  <value>true</value>
</property>

<property>
  <name>tika.boilerpipe.extractor</name>
  <value>JeffExtractor</value>
</property>

<!-- Index-Metadata Plugin -->
<property>
  <name>index.parse.md</name>
  <value>metatag.description</value>
</property>
<property>
  <name>index.content.md</name>
  <value>description</value>
</property>

Reply via email to