Re: How to resolve error- "One or more nodes ran out of memory while executing the query"?

Paul Rogers Fri, 26 Oct 2018 14:13:34 -0700

Hi Ashish,

> So you are saying it could not even design to join two physical files of 1> 
> GB each and show the result on 8 GB RAM?


Thanks for sharing your file sizes. You said your files are compressed, so the 
actual data size might be 5-10x the on-disk size, so 5-10 GB. You mentioned you 
untarred the files. How large was the result?

10 GB of memory might be sufficient for your ~10 GB of uncompressed data.

Again, configuring Drill with 10 GB on an 8 GB system will lead to swapping. 
Drill asks for memory only as needed, so Drill will start fine and run fine for 
a while. But, eventually swapping will kick in, which is fine as long as you 
understand that the result will be slower-than-expected performance.

> FYI: If I ran the same query on plain CSV file (i.e running after untarring
> and uncompressed files ), it was successful.


This is very helpful! It shows that Drill can handle the data itself. Makes me 
wonder if there is some bug with how Drill handles decompression.

Perhaps someone can try to reproduce this problem. In the meantime, looks like 
a workaround is to go ahead and use the uncompressed files.

Thanks,
- Paul

Re: How to resolve error- "One or more nodes ran out of memory while executing the query"?

Reply via email to