[ 
https://issues.apache.org/jira/browse/OODT-150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13001516#comment-13001516
 ] 

Chris A. Mattmann commented on OODT-150:
----------------------------------------

Hi Brian:

bq. just register a checksummer to a mime-type or whatnot and the crawler would 
automatically determine the checksum for the file 

Can't you already do this with the MimeType action, that calls a checksummer?

bq. i need the crawler stuff to be instant (milliseconds) – using the workflow 
manager to do checksumming is going to slow down the crawler

Maybe we could actually benchmark this before we claim that it will 
unequivocally slow it down without having any hard # to say how much or why.

bq. and i had a bunch of checksum stuff in the crawler2 initially (MD5, PGP, . 
. .) – you make me remove them because of you wanted to keep the dependency 
tree small

Keeping the dependency tree small was one part of it. The main part was that 
all of your actions had a bunch of PEATE specific nomenclature in them (e.g., 
the policy to configure them, including environment variable refs, etc.). That 
was the main thing I objected to.

bq. i still have them on PEATE if you want them . . . i also have an example on 
PEATE which uses the new ExternAction to perform a checksum.

Yah, probably some baked in example config with the ExternAction would probably 
make a lot of sense. Let's think about doing that and what a patch would look 
like.


> ToggleAction addition to crawler
> --------------------------------
>
>                 Key: OODT-150
>                 URL: https://issues.apache.org/jira/browse/OODT-150
>             Project: OODT
>          Issue Type: New Feature
>          Components: crawler
>    Affects Versions: 0.2
>         Environment: none
>            Reporter: Brian Foster
>            Assignee: Brian Foster
>            Priority: Minor
>             Fix For: 0.3
>
>         Attachments: crawler.OODT-150.2011-03-01.patch.txt
>
>
> Allows for cases where there are one of two (or more) actions which needs to 
> run. For example: sometimes the same file types have different checksum files 
> associated with it, this allows for either case.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to