[ 
https://issues.apache.org/jira/browse/IGNITE-6699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16342202#comment-16342202
 ] 

ASF GitHub Bot commented on IGNITE-6699:
----------------------------------------

GitHub user 1vanan opened a pull request:

    https://github.com/apache/ignite/pull/3442

    IGNITE-6699 Optimize client-side data streamer performance

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/1vanan/ignite ignite-6699

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/ignite/pull/3442.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3442
    
----
commit f927404cbfe440f50d2b3970da930b6e11c93514
Author: Ivan <vanen31@...>
Date:   2017-11-30T12:08:14Z

    benchmark on this case was developed

commit 7dcdfcf414a24aafd9a5a79349064f51dd7079d8
Author: Ivan <vanen31@...>
Date:   2017-12-26T16:35:58Z

    streaming per batch was implemented.

commit 4fb853d944dda24692b3e67a905f6d0dc40f9cba
Author: Ivan <vanen31@...>
Date:   2018-01-15T15:38:32Z

    some tests in DataStreamerAddDataKeyValueTest class were added

commit 5f509ed47aae4f3c579344e9b990fed7253f7c11
Author: 1vanan <vanen31@...>
Date:   2018-01-19T09:27:05Z

    was added flushing in DataStreamerImpl.close() and per timeout

commit c17b74a436ffe7d80dc940b36a433f092f008d96
Author: Ivan <vanen31@...>
Date:   2018-01-24T14:12:36Z

    code style changes

commit ae912906b6188bb0d70ba89b628bdffa685484cc
Author: Ivan <vanen31@...>
Date:   2018-01-27T13:27:21Z

    synchronize buffer collections and also change timeout test

commit daab5027fb959c539b9084fa6ee06f32b4772d9f
Author: Ivan <vanen31@...>
Date:   2018-01-27T13:33:26Z

    refactor addDataInternal: loadData method was added

----


> Optimize client-side data streamer performance
> ----------------------------------------------
>
>                 Key: IGNITE-6699
>                 URL: https://issues.apache.org/jira/browse/IGNITE-6699
>             Project: Ignite
>          Issue Type: Task
>          Components: streaming
>    Affects Versions: 2.3
>            Reporter: Vladimir Ozerov
>            Assignee: Ivan Fedotov
>            Priority: Major
>              Labels: iep-1, performance
>             Fix For: 2.5
>
>
> Currently if a user has several server nodes and a single client node with 
> single thread pushing data to streamer, he will not be able to load data at 
> maximum speed. On the other hand, if he start several data loading threads, 
> throughput will increase. 
> One of root causes of this is bad data streamer design. Method 
> {{IgniteDataStreamer.addData(K, V)}} returns new feature for every operation, 
> this is too fine grained approach. Also it generates a lot of garbage and 
> causes contention on streamer internals. 
> Proposed implementation flow:
> 1) Compare performance of {{addData(K, V)}} vs {{addData(Collection)}} 
> methods from one thread in distributed environment. The latter should show 
> considerably higher throughput.
> 2) Users should receive per-batch features, rather than per-key. 
> 3) Try caching thread data in some collection until it is large enough to 
> avoid contention and unnecessary allocations.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to