FYI, providing the logs attached, the code already sends docs in 1K batch.

On Fri, Dec 15, 2023 at 7:11 PM Dmitri Maziuk <dmitri.maz...@gmail.com>
wrote:

> On 12/15/23 05:41, Vince McMahon wrote:
> > Ishan, you are right.  Doing multithreaded Indexing is going much faster.
> > I found out after the remote machine became unresponsive very quickly ;
> it
> > crashed.  lol.
> FWIW I got better results posting docs in batches from a single thread.
> Work is in a "private org" on gitlab so I can't post the link to the
> code, but the basic layout is a DB reader that yields rows and a writer
> that does requests.post() of a list of JSON docs. With the DB row ->
> JSON doc transformer in-between.
>
> I played with the size of the batch as well as async/await queue before
> leaving it single-threaded w/ batch size of 5K docs: I had no speed
> advantage with larger batches in our setup. And it doesn't DDoS the
> index. ;)
>
> Dima
>
>

-- 
Sincerely yours
Mikhail Khludnev

Reply via email to