[ 
https://issues.apache.org/jira/browse/NUTCH-2920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695148#comment-17695148
 ] 

Tim Allison commented on NUTCH-2920:
------------------------------------

Well, that was a funny notion...

Turns out there is no BulkProcessor currently in the regular java-client (only 
exists in the high level java client) -- 
https://github.com/opensearch-project/opensearch-java/issues/181

So, we can make bulk requests with the basic java client, but we'd have to 
cache the bulk operations and have logic for when to run the operations.

The BulkProcessor takes care of all of this and has triggers for when to send 
the bulk data (size or time) and has retry logic and some other useful things.

This means that we'd have to reimplement that functionality, which I did on 
Tika ... and I don't want to do again. LOL...

> Implement a indexer-opensearch plugin
> -------------------------------------
>
>                 Key: NUTCH-2920
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2920
>             Project: Nutch
>          Issue Type: New Feature
>          Components: plugin
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Major
>             Fix For: 1.20
>
>
> We will be moving to AWS-managed OpenSearch in the near term and I would like 
> to index our content there.
> As of writing the OpenSearch project has published two plugin versions under 
> thw Apache License v2 so far
> https://github.com/opensearch-project/opensearch-java/



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to