[
https://issues.apache.org/jira/browse/IGNITE-9314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ivan Pavlukhin updated IGNITE-9314:
-----------------------------------
Description:
Need to change DataStreamer semantics.
{{allowOverwrite=false}} mode currently is inconsistent with interval
_partition counters_ update approach used by MVCC transactions.
{{allowOverwrite=true}} mode is terribly slow when using single {{cache.put}}
operations (snapshot request, tx commit on coordinator overhead). Batched mode
using {{cache.putAll}} should handle write conflicts and possible deadlocks.
Also there is a problem when {{DataStreamer}} with {{allowOverwrite == false}}
does not insert value when versions for entry exist but they all are aborted.
Proper transactional semantics should developed for such case. After that
attention should be put on Cache.size method behavior. Cache.size addressed in
https://issues.apache.org/jira/browse/IGNITE-8149 could be decremented
improperly in
{{org.apache.ignite.internal.processors.cache.IgniteCacheOffheapManager#mvccRemoveAll}}
method (called during streamer processing) when all existing mvcc row versions
are aborted or last committed one is _remove_ version.
was:
Need to change DataStreamer semantics (make it transactional)
Currently clients can see DataStreamer partial writes and two subsequent
selects, which are run in scope of one transaction at load time, may return
different results.
Related thread:
[http://apache-ignite-developers.2346864.n4.nabble.com/MVCC-and-IgniteDataStreamer-td32340.html]
Also there is a problem when {{DataStreamer}} with {{allowOverwrite == false}}
does not insert value when versions for entry exist but they all are aborted.
Proper transactional semantics should developed for such case. After that
attention should be put on Cache.size method behavior. Cache.size addressed in
https://issues.apache.org/jira/browse/IGNITE-8149 could be decremented
improperly in
{{org.apache.ignite.internal.processors.cache.IgniteCacheOffheapManager#mvccRemoveAll}}
method (called during streamer processing) when all existing mvcc row versions
are aborted or last committed one is _remove_ version.
> MVCC TX: Datastreamer operations
> --------------------------------
>
> Key: IGNITE-9314
> URL: https://issues.apache.org/jira/browse/IGNITE-9314
> Project: Ignite
> Issue Type: Task
> Components: mvcc
> Reporter: Igor Seliverstov
> Assignee: Ivan Pavlukhin
> Priority: Major
> Fix For: 2.8
>
>
> Need to change DataStreamer semantics.
> {{allowOverwrite=false}} mode currently is inconsistent with interval
> _partition counters_ update approach used by MVCC transactions.
> {{allowOverwrite=true}} mode is terribly slow when using single {{cache.put}}
> operations (snapshot request, tx commit on coordinator overhead). Batched
> mode using {{cache.putAll}} should handle write conflicts and possible
> deadlocks.
> Also there is a problem when {{DataStreamer}} with {{allowOverwrite ==
> false}} does not insert value when versions for entry exist but they all are
> aborted. Proper transactional semantics should developed for such case. After
> that attention should be put on Cache.size method behavior. Cache.size
> addressed in https://issues.apache.org/jira/browse/IGNITE-8149 could be
> decremented improperly in
> {{org.apache.ignite.internal.processors.cache.IgniteCacheOffheapManager#mvccRemoveAll}}
> method (called during streamer processing) when all existing mvcc row
> versions are aborted or last committed one is _remove_ version.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)