For lastModified just enable the index|query-more plugins it will do
the job for you.

For other meta searc the mailing list its explained many times how to do it

2010/1/8, Erlend Garåsen <e.f.gara...@usit.uio.no>:
>
> Hello,
>
> I have tried to add additional metadata by changing the code in
> HtmlParser.java and MoreIndexingFilter.java without any luck. Do I
> really have to do something which is mentioned on the following wiki in
> order to fetch the content of the metadata, i.e. write my own parser,
> filter and a plugin.xml file:
> http://sujitpal.blogspot.com/2009/07/nutch-custom-plugin-to-parse-and-add.html
>
> I find the plugin examples complicated and difficult to understand. What
> the existing HtmlParser does is good for me as long as I am able to
> fetch two additional metadata (author and lastModified) which are
> included in many of my university's webpages.
>
> The last thing I tried to do was to make HtmlParser implement the
> HtmlParseFilter interface, but the implemented required method does not run.
>
> My hope was that we could use Solr/Nutch instead of Ultraseek, but it
> requires that we are able to parse our metadata successfully.
>
> Erlend
> --
> Erlend Garåsen
> Center for Information Technology Services
> University of Oslo
> P.O. Box 1086 Blindern, N-0317 OSLO, Norway
> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
>


-- 
-MilleBii-

Reply via email to