Re: JDBC River missing documents??

GWired Wed, 29 Apr 2015 04:57:08 -0700

My theory is that i was overloading my ES VM's on initial loads or when 
doing large loads.


My cpu would jump to 99% and during the pulls it would fail silently and 
the river thought it got all the documents but didn't.

I have since rewritten my rivers using Stored procedures instead and it 
pulls much faster and CPU doesn't go to high.

The procedure keeps track of the created_at and updated_at for me instead 
of using the column strategy.  I also chunked my initial calls into logical 
batches.  Mine happen to be date based by month.  I couldn't figure out a 
way that made sense using row sizes or anything and month seemed just as 
good as any and it works.

This also solved my problem of when / ES is turned off.  It now will just 
start back up where it last left off because SQL server is storing that 
info instead of ES.  I also made it so I can rebuild at will.

Thanks Garrett



On Thursday, April 23, 2015 at 2:48:03 AM UTC-4, Jörg Prante wrote:
>
> There are log messages at ES cluster side, you should look there why bulk 
> indexing failed. 
>
> Jörg
>
> On Thu, Apr 23, 2015 at 5:45 AM, GWired <garrett...@gmail.com 
> <javascript:>> wrote:
>
>> Found this in the logs:
>>
>> [2015-04-22 22:01:25,063][ERROR][river.jdbc.BulkNodeClient] bulk [15] 
>> failed with 945 failed items, failure message = failure in bulk execution:
>>
>>
>>
>> On Wednesday, April 22, 2015 at 7:53:25 PM UTC-5, GWired wrote:
>>>
>>> Hi All,
>>>
>>> I've just been informed that i'm off by up to 100k records or so in my 
>>> jdbc river fed index.
>>>
>>> I am using the column strategy using a createddate and lastmodified 
>>> date. 
>>>
>>> Kibana is reporting an entirely different # than what i see reported in 
>>> the DB..
>>>
>>> Table A has 978634 in SQL, 934646 shown in Kibana.
>>> Table B has 957327 in SQL, 876725 shown in Kibana.
>>> Table C has 312826 in SQL, 238534 shown in Kibana
>>>
>>> I see in the ES logs 
>>>
>>> Table A metrics: 979044 rows,
>>> Table B metrics: 957591 rows
>>> Table C metrics: 312827 rows,
>>>
>>> These are the right numbers...well at least closer to right.
>>>
>>> But if i do this using Sense:
>>>
>>> GET jdbc/mytable/_count?q=*
>>>
>>> It returns the same # as Kibana is return.  
>>>
>>> This erring version is running on ES 1.5.1 with Kibana version 3.0
>>>
>>> On another server with ES 1.5.0 and Kibana 3.0 it is working just fine 
>>> #'s match up.
>>>
>>> Any ideas?
>>>
>>>
>>>
>>>
>>>
>>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "elasticsearch" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to elasticsearc...@googlegroups.com <javascript:>.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/elasticsearch/5561753d-9553-4bc5-bea2-102b7e030396%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/elasticsearch/5561753d-9553-4bc5-bea2-102b7e030396%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/5c2016b5-ef5f-4f54-ac37-b2991eba0775%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: JDBC River missing documents??

Reply via email to