[ 
https://issues.apache.org/jira/browse/CONNECTORS-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14003314#comment-14003314
 ] 

Takumi Yoshida commented on CONNECTORS-916:
-------------------------------------------

Hi Karl,

bq. I have an idea about that. What you could do is to keep an entire local 
disk image of what should go to Amazon, not just pieces. Then, the 
notifyOfJobCompletion() can try to send it. If it fails to send, then the job 
aborts. Meanwhile, no data is lost, and if you fix the job/schema and run it 
again, everything should work.

it sounds good idea to me. But why do we need to keep entire document ? I 
thought if a job send some documents successfully, MCF does not need keep these 
documents any more (so MCF delete documents data from disk at the end of 
notifyOfJobCompletion().)

Would you describe more about this ?


> Amazon CloudSearch output connector
> -----------------------------------
>
>                 Key: CONNECTORS-916
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-916
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Amazon CloudSearch output connector
>    Affects Versions: ManifoldCF 1.7
>            Reporter: Takumi Yoshida
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 1.7
>
>         Attachments: 0507.diff, 0520.diff, 0520_2.diff, 1.patch, 2.diff, 
> 3.diff, AmazonCloudSearchParam.java, AmazonCloudSearchSpecs.java, 
> exception_handling.diff, exception_handling_2.diff, licenselist.txt
>
>
> I wrote some codes snipetts of output connector for Amazon CloudSearch.
> I would like you to review my code. You can crawl web site and feed HTML page 
> to Amazon CloudSearch.
> but it is not perfectly completed followoing reason.
> - does not write any codes for configuration page.
> - supporting file type is only HTML
> Thank you for your time,
>  Takumi Yoshida



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to