[jira] [Updated] (IGNITE-22977) Reduce latency for txn preparing phase

Alexey Scherbakov (Jira) Mon, 12 Aug 2024 07:24:06 -0700


     [ 
https://issues.apache.org/jira/browse/IGNITE-22977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Alexey Scherbakov updated IGNITE-22977:
---------------------------------------
    Description: 
Tx mapping phase has some deficiencies, discovered during tracing, which need 
to be addressed.
 * HybridClockImpl uses currentTimeMillis on each update. This call is heavy, 
better use FastTimestamps with 1 millisecond resolution.
 * CompletableFuture has unexpected latency issues:
 ** onTimeout add 4-5 us of latency
 ** chaining adds 0.5-1 us of latency. instead of using
CompletableFuture.completedFuture(val).handle(closure) better to use if 
(fut.isDone()) closure();
 * Metadata validation {{{}validateTableExistence{}}}, 
{{{}validateSchemaMatch{}}}, {{{}waitForSchemasBeforeReading{}}}should wait 
only once.
 * Number of shared memory access (like updating some counter) should be 
reduced. For example index rebuilding uses counters per each txn and operation.
 * BinaryRow generation is 2x slower than BinaryObject in AI2: 5us vs 10us
 * Thread switching brings a lot of latency.
 * RAFT command is completed always in partitions thread pool
 * Number of {{clockService.now()}} calls could be reduced.
 * Each tx lock cost 2us. Might be improved to 1us.
 * Lease checking adds waiter even if not necessary.
 * Row id generation uses randomUUID which is heavy.

  was:
Tx mapping phase has some deficiencies, discovered during tracing.
 # HybridClockImpl uses currentTimeMillis on each update. This call is heavy, 
better use FastTimestamps with 1 millisecond resolution.

 # CompletableFuture has unexpected latency issues:

 ## onTimeout add 4-5 us of latency

 ## chaining adds 0.5-1 us of latency. instead of using 
CompletableFuture.completedFuture(val).handle(closure) better to use if 
(fut.isDone()) closure();

 # Metadata validation {{{}validateTableExistence{}}}, 
{{{}validateSchemaMatch{}}}, {{{}waitForSchemasBeforeReading{}}}should wait 
only once.

 # Number of shared memory access (like updating some counter) should be 
reduced. For example tx counting should be only during index rebuild.

 # BinaryRow generation is 2x slower than BinaryObject in AI2: 5us vs 10us 
(known issue, just for clarity)

 # Thread switching brings a lot of latency (known issue, just for clarity). 
Can try green threads pool to mitigate this (requires java 21).

 # RAFT command is completed always in partitions thread pool

 # Number of {{clockService.now()}} calls should be reduced.

 # Each tx lock cost 2us. Might be improved to 1us.

 # Lease checking adds waiter even if not necessary.

 # Row id generation uses randomUUID which is heavy.


> Reduce latency for txn preparing phase
> --------------------------------------
>
>                 Key: IGNITE-22977
>                 URL: https://issues.apache.org/jira/browse/IGNITE-22977
>             Project: Ignite
>          Issue Type: Improvement
>    Affects Versions: 3.0
>            Reporter: Alexey Scherbakov
>            Priority: Major
>              Labels: ignite-3
>
> Tx mapping phase has some deficiencies, discovered during tracing, which need 
> to be addressed.
>  * HybridClockImpl uses currentTimeMillis on each update. This call is heavy, 
> better use FastTimestamps with 1 millisecond resolution.
>  * CompletableFuture has unexpected latency issues:
>  ** onTimeout add 4-5 us of latency
>  ** chaining adds 0.5-1 us of latency. instead of using
> CompletableFuture.completedFuture(val).handle(closure) better to use if 
> (fut.isDone()) closure();
>  * Metadata validation {{{}validateTableExistence{}}}, 
> {{{}validateSchemaMatch{}}}, {{{}waitForSchemasBeforeReading{}}}should wait 
> only once.
>  * Number of shared memory access (like updating some counter) should be 
> reduced. For example index rebuilding uses counters per each txn and 
> operation.
>  * BinaryRow generation is 2x slower than BinaryObject in AI2: 5us vs 10us
>  * Thread switching brings a lot of latency.
>  * RAFT command is completed always in partitions thread pool
>  * Number of {{clockService.now()}} calls could be reduced.
>  * Each tx lock cost 2us. Might be improved to 1us.
>  * Lease checking adds waiter even if not necessary.
>  * Row id generation uses randomUUID which is heavy.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (IGNITE-22977) Reduce latency for txn preparing phase

Reply via email to