[ http://issues.apache.org/jira/browse/NUTCH-54?page=comments#action_64270 
]
     
Doug Cutting commented on NUTCH-54:
-----------------------------------

Currently a Parse is decomposed into ParseText and ParseData so that these may 
be stored separately on disk so that, when constructing summaries, the text may 
be accessed independently.  The full Parse is reconstructed when indexing and 
passed to indexing filters.

Should ParseStatus be available to indexing filters?  If so, then it must be 
stored.  The simplest way to do this would be to make ParseStatus a field of 
ParseData rather than Parse itself.  Even if not, the indexing code must still 
be modified to be able to construct a Parse without a ParseStatus.

Also, a minor note, I would name the method Parser.getStatus(), not 
parser.getPArserStatus().

> Fetcher  improvements
> ---------------------
>
>          Key: NUTCH-54
>          URL: http://issues.apache.org/jira/browse/NUTCH-54
>      Project: Nutch
>         Type: Improvement
>   Components: fetcher
>     Reporter: Andrzej Bialecki 
>     Assignee: Andrzej Bialecki 
>  Attachments: parsestatus.patch
>
> Fetcher improvements.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to