You should be using the bulk API, that's what it exists for!

On 29 January 2015 at 19:13, webish <greg...@yoursports.com> wrote:

> Hi Mark,
>
> Right now.  28 GB across two indices 5 shards 1 replica per index on 3 AWS
> large servers.
>
> Frequently 1-10 million records or more get imported.  During this time
> all ES nodes hit a CPU usage of over 75%.  We want to break the index down
> and add routing at some point.
>
> Refresh is using default (1) and based on coupling to some old imports
> system the bulk API is NOT used...  Problem is the index get's accessed and
> written to constantly by users.  So disabling refresh would delay their
> content from being indexed.
>
> I was debating using a separate index per import and grouping all the
> indices by an alias.  Not certain how that will affect performance.
>
>
> On Tuesday, January 27, 2015 at 8:03:16 PM UTC-5, Mark Walkom wrote:
>>
>> How much data are you talking? Are you using bulk API? What is your bulk
>> sizing?
>>
>> You can also set an index to not refresh while you ingest it (refresh =
>> -1), then once it's been sent to ES turn indexing back on.
>>
>> On 28 January 2015 at 11:45, webish <gre...@yoursports.com> wrote:
>>
>>> I have some production indices that needs to a large amount of data
>>> imported into them fairly frequently.  Each time we import data the ES
>>> nodes become a huge bottleneck.  I honestly expected a lot better
>>> performance out of them.  Regardless, I would like to import data in a
>>> production ES setup with the least amount of interruption or performance
>>> issues.
>>>
>>> What are some options I can take to import large quantities of data
>>> without affecting data that is already being used by applications?
>>>
>>> I was thinking I could use a combination of aliases or temp indices to
>>> migrate the data over...
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "elasticsearch" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to elasticsearc...@googlegroups.com.
>>> To view this discussion on the web visit https://groups.google.com/d/
>>> msgid/elasticsearch/410c454e-7e8d-4f1b-b70a-68e18fa7c732%
>>> 40googlegroups.com
>>> <https://groups.google.com/d/msgid/elasticsearch/410c454e-7e8d-4f1b-b70a-68e18fa7c732%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>  --
> You received this message because you are subscribed to the Google Groups
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to elasticsearch+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/elasticsearch/40010b4f-852b-4c2a-81a0-4187f9b5990b%40googlegroups.com
> <https://groups.google.com/d/msgid/elasticsearch/40010b4f-852b-4c2a-81a0-4187f9b5990b%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAEYi1X8LJUcKUfuT4eGY5Tgo7smknTPr9guJZUx55D1VOR6_Yw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to