Bulk inserting is slow

Frederic Esnault Mon, 23 Jun 2014 03:30:29 -0700

Hi everyone,

I'm inserting around 265 000 documents into an elastic search cluster 
composed of 3 nodes (real servers).
On two servers i give elastic search 20g of heap, on third one which has 
64g ram, i set 30g of heap for elastic search.


I set elastic search configuration to :

- 3 shards (1 per server)
- 0 replicas
- discovery.zen.ping.multicast.enabled: false (and giving on each node the 
unicast hostnames of the two other nodes);
- and this :

indices.memory.index_buffer_size: 50%
index.refresh_interval: 30s
threadpool:
  index:
    type: fixed
    size: 30
    queue_size: 1000
  bulk:
    queue_size: 1000
  bulk:
    type: fixed
    size: 30
    queue_size: 1000
  search:
    type: fixed
    size: 100
    queue_size: 200
  get:
    type: fixed
    size: 100
    queue_size: 200

Indexing is done by groups of 100 000 docs, and here is my application log :
INFO: Adding records to bulk insert batch
INFO: Added 100000 records to bulk insert batch. Inserting batch...
-- Bulk insert took 38.724 secondes
INFO: Adding records to bulk insert batch
INFO: Added 100000 records to bulk insert batch. Inserting batch...
-- Bulk insert took 31.134 secondes
INFO: Adding records to bulk insert batch
INFO: Added 64201 records to bulk insert batch. Inserting batch...
-- Bulk insert took 17.366 secondes

--- Import CSV file took 108.905 secondes ---

I'm wondering if this time is correct or not, or if there is something i 
can do to improve performances ?

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/3a38e79e-9afb-4146-a7e1-7984ec082e22%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bulk inserting is slow

Reply via email to