With CSV data like:
~~~
id datearray
1 2016-01-01 "1,2,3"
2 2016-01-02 "4,5,6"
~~~
I would like to "flatten" the data on the "array" column like so:
~~~
id dateelement
1 2016-01-01 1
1 2016-01-01 2
1 2016-01-01
Problem was trailing whitespace in column names:
https://issues.apache.org/jira/browse/DRILL-4843
On 11 Aug 2016, at 20:06, MattK wrote:
On MapR Community cluster with Drill v1.6, using simple comma
delimited data with a header line, gzip compressed, and storage as:
~~~
"csv": {
On MapR Community cluster with Drill v1.6, using simple comma delimited
data with a header line, gzip compressed, and storage as:
~~~
"csv": {
"type": "text",
"extensions": [
"csv",
"gz"
],
"extractHeader": true,
"delimiter": ","
},
~~~
I configured mysql using the JDBC plugin and the metadata is available in
INFORMATION_SCHEMA.
0: jdbc:drill:> select * from INFORMATION_SCHEMA.`TABLES` where TABLE_SCHEMA =
'mysql';
++---++-+
| TABLE_CATALOG | TABLE_SCHEMA |
Hello Rahul,
show tables in ., and describe should possibly provide
what you would like to see.
It works on my DB2 at least so please try and see if it works for you as well.
Regards,
Magnus
> 11 aug 2016 kl. 15:09 skrev Rahul Raj :
>
> Hi,
>
> Does drill
Hi,
Does drill allow querying JDBC metadata from a storage plugin? I am trying
to get the list of tables/views and their respective columns.
If not, is it possible to develop rest apis within drill to achive this?
Regards,
Rahul
--
This email and any files transmitted with it are
Embedding the thread dump content in the message :
Thread dump 1:
2016-08-11 11:02:19
Full thread dump OpenJDK 64-Bit Server VM (25.91-b14 mixed mode):
"Attach Listener" #1308 daemon prio=9 os_prio=0 tid=0x7f4bfc156800
nid=0x1cce6 waiting on condition [0x]
Both the log and out files looks clean
Attached you can find the thread dump output (3 snapshots).
I will appreciate if one can point me to the reason the drillbits consume
constantly CPU
BTW, the CPU usage is now a constant at 40% for a single core (double from a
week ago)
-Original