[ 
http://issues.apache.org/jira/browse/NUTCH-192?page=comments#action_12364674 ] 

Doug Cutting commented on NUTCH-192:
------------------------------------

I agree that Writable is probably overkill, that strings should be sufficient.

A mapping dictionary would save a lot of space, even with strings.  This could 
be a useful optimization, but should be left until after the initial (less 
optimized) addition of metadata to CrawlDatum.

> meta data support for CrawlDatum
> --------------------------------
>
>          Key: NUTCH-192
>          URL: http://issues.apache.org/jira/browse/NUTCH-192
>      Project: Nutch
>         Type: Improvement
>     Versions: 0.8-dev
>     Reporter: Stefan Groschupf
>      Fix For: 0.8-dev
>  Attachments: metadata300106.patch
>
> Supporting meta data in CrawlDatum would help to get a set of new nutch 
> features realized and makes a lot possible to smaller special focused search 
> engines.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to