[ 
http://issues.apache.org/jira/browse/NUTCH-59?page=comments#action_12364127 ] 

Doug Cutting commented on NUTCH-59:
-----------------------------------

This patch is to the 0.7 release and will not work in the current trunk.

Please see:

http://www.mail-archive.com/[email protected]/msg02140.html

and 

http://issues.apache.org/jira/browse/NUTCH-61

So extensible metadata should be added to CrawlDatum when a fix for NUTCH-61 is 
committed to trunk.


> meta data support in webdb
> --------------------------
>
>          Key: NUTCH-59
>          URL: http://issues.apache.org/jira/browse/NUTCH-59
>      Project: Nutch
>         Type: New Feature
>     Reporter: Stefan Groschupf
>     Priority: Minor
>  Attachments: webDBMetaDataPatch.txt
>
> Meta data support in web db would very usefully for a new set of nutch 
> feature that needs long life meta data. 
> Actually page meta data need to be regenerated or lookup every 30 days a page 
> is re-fetched, in a long context web db meta data would bring a dramatically 
> performance improvement for such tasks.
> Furthermore Storage of meta data in webdb would make a new generation of 
> linklist generation filters possible.  

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to