Dear Lewis,

Thanks for the README about the parse-metatags plugin. I have now double 
checked and I have the metatags.names property in my nutch-site.xml config file 
as well as the other required properties. Still when running "nutch 
indexchecker URL" I don't see any description or keywords fields :( 

Below I have pasted the relevant parts of my nutch-site.xml config file:

<property>
        <name>index.parse.md</name>
        <value>metatag.description,metatag.keywords</value>
</property>


<property>
        <name>metatags.names</name>
        <value>description;keywords</value>
</property>


<property>
        <name>plugin.includes</name>
        
<value>protocol-http|urlfilter-regex|parse-(html|tika|metatags)|index-(basic|anchor|metadata)|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
</property>

As far as I know this all looks correct but maybe you can see something wrong? 
or anything else I might check?

Regards



________________________________
 From: Lewis John Mcgibbney <[email protected]>
To: [email protected]; ML mail <[email protected]> 
Sent: Wednesday, May 2, 2012 12:49 PM
Subject: Re: Indexing meta tags in Nutch 1.4
 
Hi,

Please also see the README Julien kindly provided with the
parse-metatags plugin.

https://svn.apache.org/viewvc/nutch/trunk/src/plugin/parse-metatags/README.txt?view=markup

I'm hoping there should be enough info to get it working flawlessly.
Remember, any changes you make to your config files should really be
recompiled before moving on to a more serious deployment.

On Tue, May 1, 2012 at 12:38 PM, ML mail <[email protected]> wrote:
> Hi Lewis,
>
> Thanks to your explanations, I managed to get the parse-metatags plugin built 
> and installed into the runtime/local/plugins directory. So no I have the 
> index-metatags from the ZIP file as well as the parse-metatags plugin from 
> the patch installed and wanted to check if they are working. I followed 
> step-by-step the guide on http://wiki.apache.org/nutch/IndexMetatags and came 
> to the part where you check with the "nutch indexchecker URL" command for the 
> metatag fields. Unfortunately, in the output of that command I don't see any 
> keywords or description fields :( just the usual ones 
> (site,title,content,etc).
>
> Am I missing something here?
>
> Also let me know if you need more details or my nutch-site.xml config file...
>
> Regards

Reply via email to