Rui Guo created ASTERIXDB-2842:
----------------------------------
Summary: OOM when bulk loading on a cheap AWS machine
Key: ASTERIXDB-2842
URL: https://issues.apache.org/jira/browse/ASTERIXDB-2842
Project: Apache AsterixDB
Issue Type: Bug
Reporter: Rui Guo
Attachments: nc0_OOM.log
I created an AsterixDB cluster on AWS with three cheap VMs, and then tried to
load the TPC-H dataset of size 10 GB, then I got an OOM error when bulk loading
a dataset.
* AWS machine configuration: three *t2.medium* VMs, one as CC and two as NCs.
Each with *4 GB* memory. (more details at
[https://aws.amazon.com/ec2/pricing/on-demand/)]
* Dataset: TPC-H dataset with a scale factor of 10 (it will generate 10 GB of
raw data in total, and as I had two NCs, each NC will be fed around 5 GB of raw
data).
* NC logs containing the stack trace of the OOM error: attached to this ticket.
As for the detailed steps to reproduce the bug, I deployed the AWS VMs and load
the dataset both via the Ansible scripts in the code base, but certain
configurations from the user are required (e.g. configure the AWS access key).
Please talk to Rui Guo if more detailed information is needed.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)