Re: Redefining existing data into something like a View

2016-10-24 Thread Ted Dunning
Parth is, of course, exactly correct. Drill doesn't do CRUD. On the other hand, that isn't the end of the story. Because of the treatment of directories as tables and the ability to create files (as tables), you can get the effect of atomic for large scale inserts. Likewise for file-at-a-time

Re: Redefining existing data into something like a View

2016-10-24 Thread Parth Chandra
Yes, you can create views [1] However, Drill does not support CRUD being primarily for read only analytical queries. (The only write operation is 'create table as' [2] ) [1] https://drill.apache.org/docs/create-view/ [2] http://drill.apache.org/docs/create-table-as-ctas/ On Sun, Oct 23, 2016

Re: "Transactional" conversion of CSV to Parquet?

2016-10-24 Thread MattK
All of this is trivial on a conventional file system or on MapR. Don't think it works out of the box on HDFS (but am willing to be corrected). I did not mention that I am using MapR-FS, so links are options. On 24 Oct 2016, at 17:34, Ted Dunning wrote: Yeah... it is quite doable. It helps a

Re: "Transactional" conversion of CSV to Parquet?

2016-10-24 Thread Ted Dunning
Yeah... it is quite doable. It helps a bit to have hard links. The basic idea is to have one symbolic link that points to either of two ping-pong staging directories. Whichever staging directory the symbolic points to is called the active staging directory, the other is called inactive. To

Re: "Transactional" conversion of CSV to Parquet?

2016-10-24 Thread Jim Scott
I would think that the best way to accommodate this would be: When landing the CSV, place it into folder A, then convert them to parquet format and put them in folder B... This will give you isolation between the file formats, and you can then choose to only query the parquet files. This is the

Re: drill with hive snappy

2016-10-24 Thread Kunal Khatua
If you are specifically referring to snappy-compressed Parquet data, yes.  Drill can also leverage Hive's own libraries to consume Hive-generated data (for e.g. ORC format) that might not be directly consumed by Drill. On Mon 24-Oct-2016 9:54:49 AM, Rainday Chu(初雨)

drill with hive snappy

2016-10-24 Thread 初雨
Hi guys, We are very interested in drill. Could we query hive snappy data with drill directly ?

Redefining existing data into something like a View

2016-10-24 Thread steffen schuler
Hi Drill-Users, I have a fundamental question about refining existing data structures with drill. Is it possible with drill to (re-)define something like a „View“ (combination of existing data tables) and have all of the CRUD functionality available against this new „View“? Any hint is