[ 
https://issues.apache.org/jira/browse/IGNITE-14743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17383047#comment-17383047
 ] 

Taras Ledkov commented on IGNITE-14743:
---------------------------------------

[~amashenkov], the patch is OK with me.


> Support Row with large values.
> ------------------------------
>
>                 Key: IGNITE-14743
>                 URL: https://issues.apache.org/jira/browse/IGNITE-14743
>             Project: Ignite
>          Issue Type: Improvement
>            Reporter: Andrey Mashenkov
>            Assignee: Andrey Mashenkov
>            Priority: Major
>              Labels: iep-54, ignite-3
>             Fix For: 3.0.0-alpha3
>
>         Attachments: Byte array columns only benchmark.txt, Fixlen cols only 
> benchmark.txt, Latin1 string columns benchmark.txt, Non-latin string columns 
> benchmark.txt, String marshalling comparison
>
>   Original Estimate: 168h
>          Time Spent: 2h 10m
>  Remaining Estimate: 165h 50m
>
> h3. Motivation.
> For now, TupleAssembler writes offsets for varlen columns as 2-byte \{{short 
> }}type.
> This implicitly restricts key/value sizes down to 64 kB in total.
> On another side, for small rows that columns can be addressed with \{{byte}} 
> type, we will waste few bytes.
> h3. Description.
> Let's
> # allow 4 byte types (byte, short, int) for offsets.
> # implement and benchmark different approaches that allow us to write rows in 
> the most compact way.
> # then choose and merge the best one.
> We can introduce several formats for writing Vartable (using byte/short/int 
> offsets).
> Additional information about Vartable format can be coded into chunk flags.
> The first approach is to precalculate chunk total size, then choose the most 
> compact format and write a chunk.
> The second approach is to write a chunk with the widest format then convert 
> the chunk into the most compact format in place.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to