Rui Guo created ASTERIXDB-2842:
----------------------------------

             Summary: OOM when bulk loading on a cheap AWS machine
                 Key: ASTERIXDB-2842
                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2842
             Project: Apache AsterixDB
          Issue Type: Bug
            Reporter: Rui Guo
         Attachments: nc0_OOM.log

I created an AsterixDB cluster on AWS with three cheap VMs, and then tried to 
load the TPC-H dataset of size 10 GB, then I got an OOM error when bulk loading 
a dataset.
 * AWS machine configuration: three *t2.medium* VMs, one as CC and two as NCs. 
Each with *4 GB* memory.  (more details at 
[https://aws.amazon.com/ec2/pricing/on-demand/)]
 * Dataset: TPC-H dataset with a scale factor of 10 (it will generate 10 GB of 
raw data in total, and as I had two NCs, each NC will be fed around 5 GB of raw 
data).
 * NC logs containing the stack trace of the OOM error: attached to this ticket.

As for the detailed steps to reproduce the bug, I deployed the AWS VMs and load 
the dataset both via the Ansible scripts in the code base, but certain 
configurations from the user are required (e.g. configure the AWS access key). 
Please talk to Rui Guo if more detailed information is needed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to