Re: Re: join fail

2016-05-10 Thread lizhenm...@163.com
thank Andries and Aman to help me. 1. when i increase planner.memory.max_query_memory_per_node to 4G,8G, and it is not work. 2. As Aman said, after i run the analyze on the table, the rowcount of the scan is equal the true table, but the join fail yet. 3.The join is successful when use distrib

Re: Filtering data files in directories

2016-05-10 Thread François Méthot
like Ted mentioned, here is an example: SELECT * FROM dfs.data.`/*/processing1/*-mx.csv` On Tue, May 10, 2016 at 5:28 PM, Ted Dunning wrote: > Can you just use wild cards? > > > > On Tue, May 10, 2016 at 1:43 PM, Ludovic Claude < > ludovic.claud...@gmail.com> > wrote: > > > Hello, > > > > I h

Re: Partition reading problem (like operator) while using hive partition table in drill

2016-05-10 Thread Aman Sinha
The Drill test team was able to repro this and is now filed as: https://issues.apache.org/jira/browse/DRILL-4665 On Tue, May 10, 2016 at 8:16 AM, Aman Sinha wrote: > This is supposed to work, especially since LIKE predicate is not even on > the partitioning column (it should work either way). I

Re: Filtering data files in directories

2016-05-10 Thread Ted Dunning
Can you just use wild cards? On Tue, May 10, 2016 at 1:43 PM, Ludovic Claude wrote: > Hello, > > I have a repository of files relatively well organised and containing a > mix of medical images and csv files produced from those images in a > neuroscience lab. > > The csv files contain some inte

geospatial queries?

2016-05-10 Thread Krzysztof Jezak
Hi, I am trying to join data in MongoDB with data in PostGIS. Everything works great until I will use geospatial queries. I was hoping that maybe you can help how can I make these queries work: Test 1 Show geo data: returns null: SELECT ST_AsText(current_location) FROM postgis.public.device_locat

Filtering data files in directories

2016-05-10 Thread Ludovic Claude
Hello, I have a repository of files relatively well organised and containing a mix of medical images and csv files produced from those images in a neuroscience lab. The csv files contain some interesting data that I would like to aggregate with Drill, but the naming convention is quite speci

Hangout starting now

2016-05-10 Thread Parth Chandra
Please join us for the Drill hangout: https://plus.google.com/hangouts/_/event/ci4rdiju8bv04a64efj5fedd0lc

Slides for Drill + FPGA talk

2016-05-10 Thread Eric Fukuda
Hi, Here's the slides I used for my talk at Apache: Big Data yesterday. https://www.dropbox.com/s/mdb79qp3ucg732w/AcceleratingApacheDrillWithFPGA.pdf?dl=0 Questions and feedbacks are welcome! Thanks, Eric

Re: join fail

2016-05-10 Thread Aman Sinha
It's difficult to debug this type of issue over email thread. However, 2 observations: 1. The following Scan which is the table that is broadcast shows a rowcount of 1.3M rows whereas your original email says the rowcount is 32M rows. Are you sure Can you confirm what is the correct row count ?

Re: join fail

2016-05-10 Thread Andries Engelbrecht
See if increasing planner.memory.max_query_memory_per_node helps. It is set to 2G by default. Also see https://community.mapr.com/thread/10329 https://community.mapr.com/docs/DOC-1497 --Andries > On May 10

Re: Partition reading problem (like operator) while using hive partition table in drill

2016-05-10 Thread Aman Sinha
This is supposed to work, especially since LIKE predicate is not even on the partitioning column (it should work either way). I did a quick test with file system tables and it works for LIKE conditions. Not sure yet about Hive tables. Could you pls file a JIRA and we'll follow up. Thanks. -Aman

Re: Drill (CTAS) Default hadoop Replication factor on HDFS ?

2016-05-10 Thread Jason Altekruse
I think this is a bug with the config block feature. We currently apply this at the storage plugin level, but it does not appear that we are sourcing this configuration from the writers and instead creating a new configuration in each of our three current record writers. I have filed this bug to in

Hbase storage plugin multiple version

2016-05-10 Thread sunil yadav
Hi All, Is it possible to get multiple version of a record from hbase using Drill, I am only getiing the recent one and there are no way to mentioning the max version. Thanks Sunil

Partition reading problem (like operator) while using hive partition table in drill

2016-05-10 Thread Shankar Mane
Problem: 1. In drill, we are using hive partition table. But explain plan (same query) for like and = operator differs and used all partitions in case of like operator. 2. If you see below drill explain plans: Like operator uses *all* partitions where = operator uses *only* partition filtered by l

Re: Drill (CTAS) Default hadoop Replication factor on HDFS ?

2016-05-10 Thread Shankar Mane
Thanks Abhishek Girish. Copying hdfs-site.xml into Drill conf directory (on all nodes) works for me. And also tried config options setting. It does getting applied at storage plugin level But no effects. On Sat, May 7, 2016 at 11:29 PM, Jacques Nadeau wrote: > My suggestion would be to use Dri

Re:Re:Re:Re: join fail

2016-05-10 Thread leezy
I am sorry the picture cannot be read. here is the text: | Minor Fragment ID | Host Name | Start | End | Runtime | Max Records | Max Batches | Last Update | Last Progress | Peak Memory | State | | 05-00-xx | bigdata2 | 1.529s | 24m46s | 24m45s | 71,555,857 | 21,985 | 12:16:02 | 12:16:02 | 7

Re:Re:Re: join fail

2016-05-10 Thread leezy
here is the failure profiles. And i see the memory is not exceed 55G in each node. -- thanks for your regards. At 2016-05-10 14:47:10, "leezy" wrote: >Leon,thank you for your reply, yes, i have set the driver memory to 55G and >the java heap memory to 8G. And this user case is run s