[ 
https://issues.apache.org/jira/browse/ASTERIXDB-3314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17787508#comment-17787508
 ] 

ASF subversion and git services commented on ASTERIXDB-3314:
------------------------------------------------------------

Commit ea4ae3a1a8df45d6f575400ce7da2d2099eaa416 in asterixdb's branch 
refs/heads/master from Wail Alkowaileet
[ https://gitbox.apache.org/repos/asf?p=asterixdb.git;h=ea4ae3a1a8 ]

[ASTERIXDB-3314][STO] Reduce buffer cache pressure on columnar

- user model changes: no
- storage format changes: no
- interface changes: no

Details:
- Make max merging components count to 4 in columnar datasets
- Fix not unpinning columnar filter pages
- Allocate initial 32KB buffers for columnar writers

Change-Id: I809109b232bc5a5db0c47a52cb98c838ff55e27f
Reviewed-on: https://asterix-gerrit.ics.uci.edu/c/asterixdb/+/17965
Integration-Tests: Jenkins <[email protected]>
Tested-by: Jenkins <[email protected]>
Reviewed-by: Murtadha Hubail <[email protected]>


> Reduce buffer cache pressure when operating against columnar datasets
> ---------------------------------------------------------------------
>
>                 Key: ASTERIXDB-3314
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-3314
>             Project: Apache AsterixDB
>          Issue Type: Bug
>          Components: STO - Storage
>    Affects Versions: 0.9.9
>            Reporter: Wail Y. Alkowaileet
>            Assignee: Wail Y. Alkowaileet
>            Priority: Major
>              Labels: triaged
>             Fix For: 0.9.9
>
>
> When ingesting columnar datasets, the bulkloader and its columnar writers 
> rely on the buffer cache to provide the necessary buffers for writing. 
> Specifically, the buffer cache provides temporary buffers to the columnar 
> writers. However, not all columns require a full 128KB buffers (e.g., sparse 
> columns). Instead of using precious buffer cache pages for such columns, we 
> should allow column writers to allocate smaller buffers to be used initially. 
> In case they needed more space, they can ask the buffer cache for more. This 
> approach relieve the pressure against the buffer cache.
>  
> Another issue/bug (related to the buffer cache), columnar filters are not 
> unpinning their pages when they're done – holding those pages indefinitely 
> until the next restart.  
>  
> A third issue is the merge policy. Currently, we allow merging upto 10 
> components in one shot. Reducing this number could also alleviate the 
> pressure on the buffer cache.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to