[ 
https://issues.apache.org/jira/browse/CONNECTORS-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14002099#comment-14002099
 ] 

Karl Wright commented on CONNECTORS-916:
----------------------------------------

bq. So now, I am going to add a option "exporting file" on sever configuration 
page.

I have an idea about that.  What you could do is to keep an *entire* local disk 
image of what should go to Amazon, not just pieces.  Then, the 
notifyOfJobCompletion() can try to send it.  If it fails to send, then the job 
aborts.  Meanwhile, no data is lost, and if you fix the job/schema and run it 
again, everything should work.

The cost should be much less this way, but it may take some time because you 
update the entire corpus on the Amazon Cloud Search node on every job run.

What do you think?



> Amazon CloudSearch output connector
> -----------------------------------
>
>                 Key: CONNECTORS-916
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-916
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Amazon CloudSearch output connector
>    Affects Versions: ManifoldCF 1.7
>            Reporter: Takumi Yoshida
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 1.7
>
>         Attachments: 0507.diff, 0520.diff, 1.patch, 2.diff, 3.diff, 
> AmazonCloudSearchParam.java, AmazonCloudSearchSpecs.java, 
> exception_handling.diff, exception_handling_2.diff
>
>
> I wrote some codes snipetts of output connector for Amazon CloudSearch.
> I would like you to review my code. You can crawl web site and feed HTML page 
> to Amazon CloudSearch.
> but it is not perfectly completed followoing reason.
> - does not write any codes for configuration page.
> - supporting file type is only HTML
> Thank you for your time,
>  Takumi Yoshida



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to