[ 
https://issues.apache.org/jira/browse/PARQUET-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16578303#comment-16578303
 ] 

ASF GitHub Bot commented on PARQUET-1372:
-----------------------------------------

majetideepak opened a new pull request #484: PARQUET-1372:  Add an API to allow 
writing RowGroups based on size
URL: https://github.com/apache/parquet-cpp/pull/484
 
 
   I split the changes into multiple commits to ease the review.
   Used the example program to test the new API.
   I will add unit tests once we converge on the API after review. 
   Thanks to @anatolishein for collaborating with this patch.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> [C++] Add an API to allow writing RowGroups based on their size rather than 
> num_rows
> ------------------------------------------------------------------------------------
>
>                 Key: PARQUET-1372
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1372
>             Project: Parquet
>          Issue Type: Task
>            Reporter: Anatoli Shein
>            Assignee: Deepak Majeti
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.5.0
>
>
> The current API allows writing RowGroups with specified numbers of rows, 
> however does not allow writing RowGroups with specified size. In order to 
> write RowGroups of specified size we need to write rows in chunks while 
> checking the total_bytes_written after each chunk is written. This is 
> currently impossible because the call to NextColumn() closes the current 
> column writer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to