Re: Drill Performance

2016-07-07 Thread Abdel Hakim Deneche
I'm not sure you'll get any performance improvement from running more than a single drillbit per cluster node. On Thu, Jul 7, 2016 at 9:47 AM, scott wrote: > Follow up question: Is there a sweet spot for DRILL_MAX_DIRECT_MEMORY and > DRILL_HEAP settings? > > On Wed, Jul 6,

Re: Performance issues when query sorts and retrieves all columns

2016-07-07 Thread Abdel Hakim Deneche
Assuming all your queries contain a LIMIT, when there is no ORDER BY then Drill doesn't need to read all rows to produce the results, but with ORDER BY there is no alternative than to read 11M rows from disk. Parquet is a columnar format, so Drill, is able to only read the columns you selected

Re: Number of records per batch

2016-07-05 Thread Abdel Hakim Deneche
ieve this capacity restriction? It doesn't have to be a configuration > option. I don't mind changing and compiling the code. > > On Tue, Jul 5, 2016 at 8:55 PM, Abdel Hakim Deneche <adene...@maprtech.com > > > wrote: > > > Unfortunately I don't think there is way to do

Re: Number of records per batch

2016-07-05 Thread Abdel Hakim Deneche
lowing the code with a debugger, there seem to be 4096 records in a > batch. Can this be 8192 or larger? > > On Tue, Jul 5, 2016 at 6:47 PM, Abdel Hakim Deneche <adene...@maprtech.com > > > wrote: > > > hey Eric, > > > > Can you give more information about what

Re: Number of records per batch

2016-07-05 Thread Abdel Hakim Deneche
hey Eric, Can you give more information about what you are trying to achieve ? Thanks On Tue, Jul 5, 2016 at 3:41 PM, Eric Fukuda wrote: > Hi, > > Does anyone know if there is a way to increase or specify the number of > records per batch manually? > > Thanks, > Eric >

Re: Initial Feed Back on 1.7.0 Release

2016-07-05 Thread Abdel Hakim Deneche
answers inline. On Tue, Jul 5, 2016 at 8:39 AM, John Omernik wrote: > Working with the 1.7.0, the feature that I was very interested in was the > fixing of the Metadata Caching while using user impersonation. > > I have a large table, with a day directory that can contain up

Re: Parquet Block Size Detection

2016-07-01 Thread Abdel Hakim Deneche
Just make sure you enable parquet metadata caching, otherwise the more files you have the more time Drill will spend reading the metadata from every single file. On Fri, Jul 1, 2016 at 11:17 AM, John Omernik wrote: > In addition > 7. Generally speaking, keeping number of files

Re: Parquet Block Size Detection

2016-07-01 Thread Abdel Hakim Deneche
some answers inline: On Fri, Jul 1, 2016 at 10:56 AM, John Omernik wrote: > I looked at that, and both the meta and schema options didn't provide me > block size. > > I may be looking at parquet block size wrong, so let me toss out some > observations, and inferences I am

Re: Information about ENQUEUED state in Drill

2016-07-01 Thread Abdel Hakim Deneche
Most likely planing is taking longer to finish. Once it's done, it should move to either ENQUEUED if the queuing was enabled or RUNNING if it was disabled. One easy way to confirm if planing is indeed taking too long is to just run a "EXPLAIN PLAN FOR " and see how long it takes to finish. On

Re: Drill Plugin Update

2016-06-21 Thread Abdel Hakim Deneche
SkipFirstLine is an optional parameter with a default value of "false", so when the parameter "disappears" it's actually equivalent to setting it to false. On Tue, Jun 21, 2016 at 4:30 AM, Kumar Anil7/DEL/TCS wrote: > > Hi, > > > I am using drill 1.4.0 in MapR5.1 cluster.

Re: Memory Settings for a Non-Sorted Failed Query

2016-06-13 Thread Abdel Hakim Deneche
Running out of heap could also make a Drillbit become irresponsive, eventually it will die after printing the following message in it's drillbit.out: Unable to handle out of memory condition in FragmentExecutor You may want to check your drillbits' drillbit.out for such message On Mon, Jun 13,

Re: How to specify Drill JDBC connection timeout or JDBC Query timeout

2016-06-03 Thread Abdel Hakim Deneche
ce we implement JDBC api level timeout? > > Thanks, > Hao > > On Thu, Jun 2, 2016 at 6:42 PM, Abdel Hakim Deneche <adene...@maprtech.com > > > wrote: > > > For connection timeout, there are configuration options that you can set > in > > drill-overr

Re: CTAS on MySQL*

2016-06-03 Thread Abdel Hakim Deneche
MySQL storage plugin is indeed read only, you cannot create a table on mysql through Drill. Thanks On Fri, Jun 3, 2016 at 10:05 AM, Shankar Mane wrote: > As we know, we can able to read MySQL data using drill mysql storage > plugin. But it seems it is Read Only and

Re: How to specify Drill JDBC connection timeout or JDBC Query timeout

2016-06-02 Thread Abdel Hakim Deneche
For connection timeout, there are configuration options that you can set in drill-override.conf that affect how much time the Drill client will try to connect to the server, but even then the client could actually block forever (I've seen it happen on an internal tool). Drill test framework has a

Re: Guidelines for planner.memory.max_query_memory_per_node

2016-06-02 Thread Abdel Hakim Deneche
> interested in guidelines here, keeping it at 2GB with such beefy nodes > seems to be a waste. > > John > > On Wed, Jun 1, 2016 at 3:38 PM, Abdel Hakim Deneche <adene...@maprtech.com > > > wrote: > > > I don't know about any specific guidelines for this options, b

Re: queries take over 2 min

2016-06-01 Thread Abdel Hakim Deneche
sometimes, if you have an issue in one of your storage plugin it affects all queries even those not querying that specific plugin. Do you have any enable storage plugin that's causing issues ? On Wed, Jun 1, 2016 at 2:21 PM, Scott Kinney wrote: > i'm running queries on

Re: Guidelines for planner.memory.max_query_memory_per_node

2016-06-01 Thread Abdel Hakim Deneche
I don't know about any specific guidelines for this options, but what I know is that it only affects the sort operator, and it's related to direct memory not heap memory. On Wed, Jun 1, 2016 at 1:20 PM, John Omernik wrote: > I am reposting this question here as well. (I

Re: Profiles Gone in Web UI: The great profile heist

2016-05-31 Thread Abdel Hakim Deneche
are you storing the profiles in a local folder or in nfs ? On Tue, May 31, 2016 at 12:49 PM, John Omernik wrote: > I am scratching my head at this one... I made some minor changes to my > drill-env.sh to enable gclogging, and was using the profiles in the webui > just fine.

Re: Hangout link?

2016-05-31 Thread Abdel Hakim Deneche
Sorry about the delay, there you go: https://plus.google.com/hangouts/_/event/ci4rdiju8bv04a64efj5fedd0lc On Tue, May 31, 2016 at 9:57 AM, John Omernik wrote: > > -- Abdelhakim Deneche Software Engineer Now Available - Free Hadoop On-Demand

Re: Reading GC Logs

2016-05-31 Thread Abdel Hakim Deneche
My understanding (which is incomplete) is that both the "new reader" and "dictionary encoding" are not stable yet and can cause failures or worse, incorrect data. That's why they are disabled by default. The "Allocation Failure" means that the JVM had to run a Full GC because it couldn't allocate

Re: Reading and converting Parquet files intended for Impala

2016-05-28 Thread Abdel Hakim Deneche
the new parquet reader, the complex reader, is disabled by default. You can enable it by setting the following option to true: store.parquet.use_new_reader On Sat, May 28, 2016 at 4:56 AM, John Omernik wrote: > I remember reading that drill uses two readers. One for certain

Re: Issue with Queries Hanging

2016-05-23 Thread Abdel Hakim Deneche
One question about the missing query profile: do you store the query profiles in the local file system or the distributed file system ? On Mon, May 23, 2016 at 9:31 AM, John Omernik wrote: > Hey all, this is separate, yet related issue to my other posts RE Parquet, > however,

Re: test

2016-05-17 Thread Abdel Hakim Deneche
your test succeeded ;) On Tue, May 17, 2016 at 10:17 AM, Khurram Faraaz wrote: > test email > -- Abdelhakim Deneche Software Engineer Now Available - Free Hadoop On-Demand Training

Re: CTAS Out of Memory

2016-05-13 Thread Abdel Hakim Deneche
Stefan, Can you share the query profile for the query that seems to be running forever ? you won't find it on disk but you can append .json to the profile web url and save the file. Thanks On Fri, May 13, 2016 at 9:55 AM, Stefan Sedich wrote: > Zelaine, > > It does, I

Re: Queries and Timeout

2016-05-13 Thread Abdel Hakim Deneche
Long running queries shouldn't timeout. This is most likely a bug. Is it reproducible ? Can you give more details about the query ? Thanks On Mon, May 9, 2016 at 12:30 PM, Subbu Srinivasan wrote: > What is the best way to implement queries that are long running? If

Re: workspaces

2016-05-13 Thread Abdel Hakim Deneche
I believe Drill stores storage plugins in different places when running in embedded mode vs distributed mode. Embedded mode uses local disk and distributed mode uses Zookeeper. On Fri, May 13, 2016 at 9:08 AM, Odin Guillermo Caudillo Gallegos < odin.guille...@gmail.com> wrote: > The plugins are

Re: problem running drill in 10minutes tutorial on macpro

2016-05-04 Thread Abdel Hakim Deneche
Hey, Unfortunately, the Apache mailing list blocks attachments so we are not able to see the error message. If you want you can just copy past the error messages here, or share a link to the screenshots. Thanks On Wed, May 4, 2016 at 4:23 PM, Rita Kuo wrote: > Hi, > > I

Re: Where is this C++ API mentioned in Drill docs

2016-04-09 Thread Abdel Hakim Deneche
I believe the source code is part of Drill distribution, in the following folder: contrib/native/client It also contains an example folder with a fairly good demonstration on how to use the API to submit queries to Drill. Thanks On Sat, Apr 9, 2016 at 2:23 PM, Devender Yadav

Re: How to modify connection timeout delay ?

2016-04-05 Thread Abdel Hakim Deneche
On Tue, Apr 5, 2016 at 7:59 AM, COUERON Damien (i-BP - MICROPOLE) < damien.coueron_s...@i-bp.fr> wrote: > Despite the log below, what kind of details are you interested in ? > > > > -Message d'origine----- > De : Abdel Hakim Deneche [mailto:adene...@maprtech.com] >

Re: Reading Avro Arrays

2016-04-03 Thread Abdel Hakim Deneche
pull requests are fine. You still need a JIRA though On Sun, Apr 3, 2016 at 8:03 AM, Johannes Schulte wrote: > I now extended the AvroFormatTest-Suite by two unit tests that show that > > * Flattening of primitive array works as expected > * Flattening of arrays of

Re: How to modify connection timeout delay ?

2016-04-02 Thread Abdel Hakim Deneche
Hi Damien, Like Jason said, we have a heartbeat mechanism that should've prevented this issue all together, so I'm interested to learn how this is happening. We've seen this happen many times but so far we were never able to reproduce it. Could you give us more details so we can reproduce the

Re: simple join failing for SQL server.

2016-03-31 Thread Abdel Hakim Deneche
gt; Yes Abdel. I will try with 1.6 & let you know. > > > > Regards, > > Devender > > > > On Thu, Mar 31, 2016 at 12:43 PM, Abdel Hakim Deneche < > > adene...@maprtech.com> wrote: > > > >> "we" did fix so many of them. =P > >> >

Re: simple join failing for SQL server.

2016-03-31 Thread Abdel Hakim Deneche
<dev@gmail.com> > > wrote: > > > >> Yes Abdel. I will try with 1.6 & let you know. > >> > >> Regards, > >> Devender > >> > >> On Thu, Mar 31, 2016 at 12:43 PM, Abdel Hakim Deneche < > >> adene...@maprtech.com&

Re: simple join failing for SQL server.

2016-03-31 Thread Abdel Hakim Deneche
"we" did fix so many of them. =P On Thu, Mar 31, 2016 at 8:12 AM, Abdel Hakim Deneche <adene...@maprtech.com> wrote: > Hi Devender, > > Whenever you see such errors, it's Drill's internal memory accounting > reporting a memory leak. This is "always" a

Re: simple join failing for SQL server.

2016-03-31 Thread Abdel Hakim Deneche
Hi Devender, Whenever you see such errors, it's Drill's internal memory accounting reporting a memory leak. This is "always" a bug, but you did fix so many of them(*) since 1.4 and we even improved the memory allocator in 1.5. Do you want to try again on the latest version and see if you still

Re: IllegalStateException: Memory was leaked by query - Drill error on wide table, but OK on a narrow but longer table.

2016-03-26 Thread Abdel Hakim Deneche
Hey Edmon, Can you search the logs for the errorId displayed with this error ? I've seen a similar issue when external sort fails to spill to disk (no space left on disk), it would leak memory and display this error message instead of the original issue (problem spilling to disk). I will open a

Re: Code too large

2016-03-24 Thread Abdel Hakim Deneche
This exception states that the code generated for the project is too big for that Java compiler. Can you share the query that caused this failure ? On Thu, Mar 24, 2016 at 1:27 PM, Edmon Begoli wrote: > Does anyone know what might be causing this exception: > > *Error: SYSTEM

Re: unable to start Drill 1.6.0

2016-03-19 Thread Abdel Hakim Deneche
; classpath.scanning: { > packages: [ > "com.companyname.drill.*" > ] > } > } > */ > > *But DRILL GET SHUTDOWN on all nodes.* > > > > > *Please help me to resolved this issue. Or suggest any other way to invoke > my custome UDFs. * > > > > >

Re: unable to start Drill 1.6.0

2016-03-19 Thread Abdel Hakim Deneche
Easiest fix when Drill fails to load a storage plugin is to delete the existing configurations. Deleting /tmp/drill/ should resolve this. I know this may not be practical in some cases, and other developers may give you a better solution. On Thu, Mar 17, 2016 at 2:13 PM, Shankar Mane

Re: Drill join performance

2016-03-18 Thread Abdel Hakim Deneche
One quick note here, I don't think partitioning LINEORDER table on LO_ORDERDATE would help this query. If you look at the query profile you will see that Drill is reading everything from LINEORDER. On Fri, Mar 18, 2016 at 7:57 AM, Dmitry Krivov wrote: > Just for info : > >

Re: unable to start Drill 1.6.0

2016-03-18 Thread Abdel Hakim Deneche
Hi Shankar, mailing list doesn't allow attachments, can you post the file in some public place and share link ? Thanks On Thu, Mar 17, 2016 at 1:51 PM, Shankar Mane wrote: > I am not able to start drill 1.6.0. Please find the attached file for more > details. > >

Question about Text Files documentation

2016-03-16 Thread Abdel Hakim Deneche
In this documentation page: http://drill.apache.org/docs/text-files-csv-tsv-psv/ We can read the following: Using a distributed file system, such as HDFS, instead of a local file > system to query the files also improves performance because currently Drill > *does > not split* files on block

Re: NumberFormatException with cast to double?

2016-03-10 Thread Abdel Hakim Deneche
Looks like the COALESCE function is the source of the problem. Passing a double (0.0) instead of an int (0) as a second expression solved the problem for me: CAST(COALESCE(t_total, 0.0) AS double) On Fri, Mar 11, 2016 at 12:45 AM, Matt wrote: > ~~~ > 00-01

Re: Drill with String Aggregation

2016-03-08 Thread Abdel Hakim Deneche
You can always develop a User Defined Aggregate Function: http://drill.apache.org/docs/develop-custom-functions/ Thanks On Wed, Mar 9, 2016 at 12:29 AM, Bosung Seo wrote: > Hello, > > I found that Drill doesn't support string_agg function yet. > Is there another way to

Re: The praises for Drill

2016-02-26 Thread Abdel Hakim Deneche
Looking forward to reading the paper! On Fri, Feb 26, 2016 at 10:19 AM, Parth Chandra wrote: > Welcome back Edmon, and thanks for the praise :). Hope to see you on the > next hangout. > > On Thu, Feb 25, 2016 at 7:27 PM, Edmon Begoli wrote: > > > Hello

Re: Drill error with large sort

2016-02-25 Thread Abdel Hakim Deneche
, Feb 25, 2016 at 5:26 PM, Abdel Hakim Deneche <adene...@maprtech.com> wrote: > Short answer: > > increase the value of planner.memory.max_query_memory_per_node, by default > it's set to 2GB, try setting to 4 or even 8GB. This should get the query to > pass. > > On Thu, Fe

Re: Date Format conversion

2016-02-23 Thread Abdel Hakim Deneche
more precisely you can use TO_DATE . The following worked for me: TO_DATE('01/25/2016', 'MM/dd/') On Tue, Feb 23, 2016 at 10:48 AM, Neeraja Rentachintala < nrentachint...@maprtech.com> wrote: > Please refer to

Re: Drill join performance

2016-02-22 Thread Abdel Hakim Deneche
Hello Dmitry, Welcome to Drill's community :) What version of Drill are you using ? Also, can you share the query profile of your query, it helps to show what taking most of the time. Thanks On Mon, Feb 22, 2016 at 10:54 AM, Dmitry Krivov wrote: > Hello > > I have load

Re: what am I missing?

2016-02-22 Thread Abdel Hakim Deneche
To run Drill in distributed mode you need to have Zookeeper up and running. This shouldn't be too complicated, you can find more details here: https://zookeeper.apache.org/doc/r3.1.2/zookeeperStarted.html#sc_InstallingSingleMode On my Mac I used brew and it took care of everything. On Mon, Feb

Re: One single query for more files JSON

2016-02-21 Thread Abdel Hakim Deneche
nsions": [ > "tsv" > ], > "delimiter": "\t" > }, > "parquet": { > "type": "parquet" > }, > "json": { > "type": "json" > }, >

Re: One single query for more files JSON

2016-02-19 Thread Abdel Hakim Deneche
> > > > yes with windows > > > > > > > > > 2016-02-12 17:11 GMT+01:00 Zelaine Fong <zf...@maprtech.com>: > > > > > > Are you running on Windows? If so, perhaps this is DRILL-4305? > > > > > > -- Zelaine > > > > &

Re: Drill Doc Question: Multi Tenant Clusters

2016-02-15 Thread Abdel Hakim Deneche
core nodes vs 24 core nodes for example) > > > > On Mon, Feb 15, 2016 at 1:37 PM, Abdel Hakim Deneche < > adene...@maprtech.com> > wrote: > > > so yes, you are correct, you should set it to 1 x 32 x 0.7 > > > > Btw, Drill should already have set thi

Re: Drill Doc Question: Multi Tenant Clusters

2016-02-15 Thread Abdel Hakim Deneche
so yes, you are correct, you should set it to 1 x 32 x 0.7 Btw, Drill should already have set this option to 32 x 0.7 On Mon, Feb 15, 2016 at 11:36 AM, Abdel Hakim Deneche <adene...@maprtech.com > wrote: > Don't be, it took me quite some time to figure out this one either =P > &

Re: Drill Doc Question: Multi Tenant Clusters

2016-02-15 Thread Abdel Hakim Deneche
ut I think you are saying that we should be taking 1 > drill bit per node * 32 * 0.7 ... correct? > > Quote from the docs: > number of active drillbits (typically one per node) * number of cores per > node * 0.7 > > On Mon, Feb 15, 2016 at 11:15 AM, Abdel Hakim Deneche < >

Re: Drill Doc Question: Multi Tenant Clusters

2016-02-15 Thread Abdel Hakim Deneche
No, it's the maximum number of threads each drillbit will be able to spawn for every major fragment of a query. If you run a query on a cluster of 32 core machines, and the query plan contains multiple major fragments, each major fragment will have "at most" 32 x 0.7= 23 minor fragments (or

Re: One single query for more files JSON

2016-02-12 Thread Abdel Hakim Deneche
Of course, if the schema changes between files, this will most likely cause the query to fail On Fri, Feb 12, 2016 at 7:42 AM, Abdel Hakim Deneche <adene...@maprtech.com> wrote: > Yes, it should work. > > On Fri, Feb 12, 2016 at 7:31 AM, Paolo Spanevello <paolosp...@gmail.com&g

Re: One single query for more files JSON

2016-02-12 Thread Abdel Hakim Deneche
Yes, it should work. On Fri, Feb 12, 2016 at 7:31 AM, Paolo Spanevello wrote: > Dear All, > > Could i've a single query for more json files ? > > Example: > >- /user/folder1/file1.json >- /user/folder2/file2.json >- /user/folder3/file3.json > > Query: > SELECT

expected behavior when using wild cards in table name

2016-02-11 Thread Abdel Hakim Deneche
I have the following table tpch100/lineitem that contains 97 parquet files: tpch100/lineitem/part-m-0.parquet tpch100/lineitem/part-m-1.parquet tpch100/lineitem/part-m-2.parquet ... tpch100/lineitem/part-m-00096.parquet I can run the following queries: SELECT COUNT(*) FROM

Re: Query Planning and Directory Pruning

2016-02-09 Thread Abdel Hakim Deneche
79 seconds to > > plan the query. Indicating to me the issue isn't in the plan that was > > created, but the actual planning process. (Let me know if you disagree or > > still need to see the plan, like I said, the actual plans were exactly > the > > same) > > > > >

Re: Source for drill's calcite?

2016-02-09 Thread Abdel Hakim Deneche
You can find the r10 branch here: https://github.com/mapr/incubator-calcite/tree/DrillCalcite1.4.0 On Tue, Feb 9, 2016 at 8:00 AM, Jason Altekruse wrote: > I can't find the latest version either, but this is the r9 branch. I don't > think any very major changes

Re: Dealing with files created in Windows

2016-02-08 Thread Abdel Hakim Deneche
is dos2unix an option ? On Mon, Feb 8, 2016 at 9:56 AM, John Omernik wrote: > Are there any decent tricks for dealing with Windows based text files (that > use /r/n as the line ending rather than just /n) > > Right now my last field has /r showing up, and I'd like to not have

Re: Query Planning and Directory Pruning

2016-02-04 Thread Abdel Hakim Deneche
Hey John, can you try an explain plan for both queries and see how much times it takes ? for example, for the first query you would run: *explain plan for* select count(1) from `data/2016-02-03`; It can also be helpful if you could share the query profiles for both queries. Thanks On Thu, Feb

Re: UDF - BooleanHolder

2016-02-03 Thread Abdel Hakim Deneche
It's called BitHolder On Wed, Feb 3, 2016 at 3:12 PM, Nicolas Paris wrote: > Hello, > > Hello, > > I would like to create a user defined function that would return a boolean > value. > Use case would be : > > SELECT * FROM x WHERE MyFunction(); > > I haven't found any

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Abdel Hakim Deneche
Hi Stefán, Can you open a JIRA for this, please ? Thanks On Tue, Feb 2, 2016 at 6:21 AM, Stefán Baxter wrote: > Hi, > > I can confirm that this same query+avro-files work in 1.4 so this is > probably a regression > > Regards, > -Stefan > > On Tue, Feb 2, 2016 at

Re: DRILL 1.4 - newline in strings not supported

2016-02-01 Thread Abdel Hakim Deneche
<nipari...@gmail.com> wrote: > Abdel, > > select * on my csv file fails as well > > Thanks > > 2016-02-01 17:16 GMT+01:00 Abdel Hakim Deneche <adene...@maprtech.com>: > > > When you run a select * on your csv file, does it succeed or fail ? > > > >

Re: DRILL 1.4 - newline in strings not supported

2016-02-01 Thread Abdel Hakim Deneche
tributed calculus > ? > > Do you have a fair number of files?​ > ​I have one 30GB csv file. I don't know how many parquet file it could > create as process crashes because of newlines. > I can imagine approx 5 parquet files 500 MB. > > Thanks,​ > > > 2016-02-01 16:

Re: DRILL 1.4 - newline in strings not supported

2016-02-01 Thread Abdel Hakim Deneche
Hey Nicolas, what kind of queries are you running on your csv file ? On Sun, Jan 31, 2016 at 12:14 PM, Nicolas Paris wrote: > Hello, > > I am trying to import a csv containing large texts. They contains newline > character "\n". > Apache Drill conplains about that. There

Re: DRILL 1.4 - newline in strings not supported

2016-02-01 Thread Abdel Hakim Deneche
or equivalent)) > Usually I used postgresql or monetdb in order to mine the texts, but I am > benchmarking/studying apache drill too. > > Thanks, > > > 2016-02-01 15:54 GMT+01:00 Abdel Hakim Deneche <adene...@maprtech.com>: > > > Hey Nicolas, > > > > wh

Re: CTAS error with CSV data

2016-01-26 Thread Abdel Hakim Deneche
(SqlLine.java:746) > at sqlline.SqlLine.begin(SqlLine.java:621) > at sqlline.SqlLine.start(SqlLine.java:375) > at sqlline.SqlLine.main(SqlLine.java:268) > ~~~ > > It also looks like if I run the SELECT from a bash shell as "sqlline -u > ... -f te

Re: CTAS error with CSV data

2016-01-26 Thread Abdel Hakim Deneche
ill.jdbc.impl.DrillConnectionImpl > ~~~ > > Is that index a byte or line offset? > > > On 26 Jan 2016, at 12:55, Abdel Hakim Deneche wrote: > > Does a select * on the same data also fail ? >> >> On Tue, Jan 26, 2016 at 9:44 AM, Matt <bsg...@gmail.com> w

Re: Drill ODBC: format number on excel looks like Text

2016-01-23 Thread Abdel Hakim Deneche
Could this help ? http://superuser.com/questions/385511/easy-way-to-one-off-import-data-with-different-decimal-separator-in-excel The solution is a bit old though and newer version of Excel may not offer the option anymore. On Sat, Jan 23, 2016 at 3:11 PM, Paolo Spanevello

Re: JDBC Driver - Possible regression

2016-01-20 Thread Abdel Hakim Deneche
Stefán, Please reopen the JIRA and add a comment describing what you are seeing. Thanks On Wed, Jan 20, 2016 at 4:34 AM, Stefán Baxter wrote: > Hi again, > > We have verified that the error exists on master:head (1.5-SNAPSHOT). > > Regards, > -Stefan > > On Wed,

Re: Too many open files

2016-01-11 Thread Abdel Hakim Deneche
Hi Ian, Can you open up a JIRA for this ? is it easy to reproduce ? Thanks On Mon, Jan 11, 2016 at 8:59 AM, Ian Maloney wrote: > Hi, > > I've been running a lot of queries via jdbc/drill. I have four drillbits, > but I could not get the zk jdbc URL to work so I

Re: Issue in developing UDF

2016-01-11 Thread Abdel Hakim Deneche
"null6"; > > >> > byte[] valueDecoded = e.getMessage().getBytes(); > > >> > outValue.buffer = > > >> buffer.reallocIfNeeded(valueDecoded.length); > > >> > outValue.start = 0; > > >> >

Re: Issue in developing UDF

2016-01-06 Thread Abdel Hakim Deneche
According to Drill documentation: http://drill.apache.org/docs/adding-custom-functions-to-drill/ You need to copy both class jar and source jar of your UDF to $DRILL_HOME/jars/3rdparty/ did you do it ? On Tue, Jan 5, 2016 at 11:58 PM, Nirav Shah wrote: > Hi , > > >

Re: Drill Query Problem

2015-12-04 Thread Abdel Hakim Deneche
Hi Nirav, can you give us more information to help reproduce this issue ? thanks On Fri, Dec 4, 2015 at 3:42 AM, Nirav Shah wrote: > Hello, > > I am getting below error while running big query. > > === > > Error: SYSTEM ERROR:

Re: Infinite pending Bug ?

2015-11-13 Thread Abdel Hakim Deneche
gs). > > Any slight modification to the create View2 query in the "trim" section > makes the select * query succeed (for example changing '' to ' '). I'm not > sure if this will help narrow down the issue though. > > -Abhishek > > On Fri, Nov 13, 2015 at 8:55 AM,

Re: Infinite pending Bug ?

2015-11-13 Thread Abdel Hakim Deneche
Hello Boris, What version of Drill are you using ? Thanks On Fri, Nov 13, 2015 at 8:33 AM, Hsuan Yi Chu wrote: > Do you know if it is stuck at planning? > > On Fri, Nov 13, 2015 at 8:03 AM, Boris Chmiel < > boris.chm...@yahoo.com.invalid> wrote: > > > Hello every one, > >

Re: Help with Troubleshooting dense error message

2015-11-04 Thread Abdel Hakim Deneche
> variable I was changed was the json as text option at the session level and > I was setting it on both the pre drillbit reboot and the post drillbit > reboot sessions (I need that to query the data). > > On Wed, Nov 4, 2015 at 12:46 PM, Abdel Hakim Deneche < > adene...@maprtech.co

Re: Help with Troubleshooting dense error message

2015-11-04 Thread Abdel Hakim Deneche
in, I'll ensure I set that to get a > full stack trace. > > John > > On Wed, Nov 4, 2015 at 12:13 PM, Abdel Hakim Deneche < > adene...@maprtech.com> > wrote: > > > The error message "index: 9604, length: 4 (expected: range(0, 8192))" > > suggests

Re: Help with Troubleshooting dense error message

2015-11-04 Thread Abdel Hakim Deneche
The error message "index: 9604, length: 4 (expected: range(0, 8192))" suggests an error happened when Drill tried to access a memory buffer (most likely while writing an int or float value) This may be a bug actually exposed by that particular data record. You can try enabling verbose error

Re: Drill Query Error

2015-10-29 Thread Abdel Hakim Deneche
Hi Sanjeev, are you running a single query or multiple queries concurrently ? Thanks On Thu, Oct 29, 2015 at 9:24 AM, Andries Engelbrecht < aengelbre...@maprtech.com> wrote: > Information that will be more helpful are answering questions like the > following. > > What DFS are you trying to

Re: Exception with CSV storage format : Repeated types are not supported

2015-10-27 Thread Abdel Hakim Deneche
Hey Chandan, I assume 'parquetlogs' contain parquet files, right ? what is the schema of 'parquetlogs' ? does it contain repeated fields ? thanks On Tue, Oct 27, 2015 at 2:40 AM, chandan prakash wrote: > Hi everyone, > Can anyone help how to write CTAS query with

Re: CTAS over empty file throws NPE

2015-10-22 Thread Abdel Hakim Deneche
Chandan actually found the JIRA: https://issues.apache.org/jira/browse/DRILL-3539 On Thu, Oct 22, 2015 at 10:25 AM, Neeraja Rentachintala < nrentachint...@maprtech.com> wrote: > Hsuan > Is there is a JIRA for this? > > On Thu, Oct 22, 2015 at 10:11 AM, Hsuan Yi Chu >

Re: Drill CTAS to single file

2015-10-21 Thread Abdel Hakim Deneche
Another way to do it is to let sqlline save the csv file for you, this way you won't have to worry about Drill's parallelization, but you might need to make slight changes to your storage plugin to properly read sqlline's csv files. For example, I have the following CTAS: create table e as

[ANNOUNCE] Release of Apache Drill 1.2.0

2015-10-17 Thread Abdel Hakim Deneche
It is my pleasure to announce the release of Apache Drill 1.2.0. This release of Drill fixes many issues and introduces a number of enhancements, including the following ones: - Support for JDBC data sources, such as MySQL, through a new JDBC Storage plugin - Partition pruning improvements -

[VOTE] Release Apache Drill 1.2.0 RC1

2015-10-08 Thread Abdel Hakim Deneche
Hi, I propose the second release candidate of Apache Drill, version 1.2.0. Here is a list of all JIRAs that have been resolved in this release [1]. The tarball artifacts are hosted at [2] and the maven artifacts are hosted at [3]. The vote will be open for the next 72 hours ending at 7AM

Re: Parquet #Files and closing accountor error

2015-10-08 Thread Abdel Hakim Deneche
orking, so I reverted back to Drill 1.1 > as packaged by MapR. > > > > On Thu, Oct 8, 2015 at 10:42 AM, Abdel Hakim Deneche < > adene...@maprtech.com> > wrote: > > > We fixed a similar issue as part of Drill 1.2. Can you give it a try to > see > >

Re: Parquet #Files and closing accountor error

2015-10-08 Thread Abdel Hakim Deneche
Hey John, The error you are seeing is a memory leak. Drill's allocator found that about 1MB of allocated memory wasn't released at the end of the fragment's execution. What version of Drill are you using ? can you share the result of: select * from sys.version; Thanks On Thu, Oct 8, 2015 at

[VOTE] Release Apache Drill 1.2.0 RC2

2015-10-08 Thread Abdel Hakim Deneche
Hi all, I'm enjoying the release management so much that I decided to propose a third RC of Apache Drill 1.2.0 The tarball artifacts are hosted at [1] and the maven artifacts are hosted at [2]. The vote will be open for the next 72 hours ending at 2PM Pacific, October 11, 2015. [ ] +1 [ ] +0 [

Re: Using Drill JDBC V1.2 with V1.1 release

2015-09-30 Thread Abdel Hakim Deneche
Hi Nikunj, What kind of dependcy issues you are seeing ? DRILL-3589 purpose was to improve Drill JDBC dependencies but I think the JDBC driver didn't have any "known" issues in 1.1 Thanks On Wed, Sep 30, 2015 at 12:02 PM, Daniel Barclay wrote: > Nikunj Thakkar wrote: >

Re: JDBC connection pool

2015-09-29 Thread Abdel Hakim Deneche
This is most likely a bug. I think it's similar to the following bug: https://issues.apache.org/jira/browse/DRILL-3763 Please take a look at it and feel free to add more information to the ticket. Thanks On Tue, Sep 29, 2015 at 2:20 AM, xia wrote: > Hi, everyone, > > >

Re: CTAS exception

2015-09-18 Thread Abdel Hakim Deneche
This kind of errors usually happens when there is an unsupported schema change in the json files, but you should be able to reproduce the error with just a select statement. Can you share both queries you tried (the failing CTAS and the successful SELECT *) ? Thanks On Fri, Sep 18, 2015 at 5:38

Re: CTAS exception

2015-09-18 Thread Abdel Hakim Deneche
p 18, 2015 at 4:02 PM, Stefán Baxter < > > > ste...@activitystream.com> > > > > wrote: > > > > > > > >> Hi, > > > >> > > > >> Both statements select everything but the CTAS statement included a > > date > &g

Re: Resetting an option

2015-09-17 Thread Abdel Hakim Deneche
I am looking at the corresponding pull request: https://github.com/apache/drill/pull/159 and I have a question I can't seem to find an answer in this discussion: Let's say a user changes an option A both at the SESSION and SYSTEM level. What happens when the users issues "ALTER SYSTEM RESET A",

Re: Reg Connecting Apache Drill to Oracle DB

2015-09-15 Thread Abdel Hakim Deneche
Hi Siva, Now that the JDBC storage plugin has been merged into master, you can just get the latest version of Drill from github and build it yourself. Let me know if you need more information about how to do it. Thanks On Tue, Sep 15, 2015 at 7:12 AM, Tugdual Grall wrote: >

Re: Quering parquet is giving Reading past RLE/BitPacking stream

2015-09-15 Thread Abdel Hakim Deneche
Hi Anas, Can you please open a JIRA for this ? It would be really helpful if you can attach the parquet file to the JIRA Thanks On Mon, Sep 14, 2015 at 9:21 PM, Anas Mesrah wrote: > Hi, > > I am trying to query large parquet files -local file system- that has > string

Community Hangout happening now!

2015-09-15 Thread Abdel Hakim Deneche
Please join us in the weekly community hangout: https://plus.google.com/hangouts/_/event/ci4rdiju8bv04a64efj5fedd0lc -- Abdelhakim Deneche Software Engineer Now Available - Free Hadoop On-Demand Training

Re: BlockMissingException

2015-09-09 Thread Abdel Hakim Deneche
Hi Grant, Do you see any other errors in the logs ? I don't think the WorkEventBus warning has anything to do with the issue. It's a warning you can expect to see for failed/cancelled queries. Thanks On Wed, Sep 9, 2015 at 10:32 AM, Grant Overby (groverby) wrote: > I'm

Re: what is the meaning of the 0 in '0:jdbc:drill'?

2015-09-08 Thread Abdel Hakim Deneche
Because Sqlline can open multiple jdbc connections, each connection receives a unique id (starting from 0). "0" is just the id of the jdbc connection you are using. Thanks On Sun, Sep 6, 2015 at 9:02 PM, 寻 <1813710...@qq.com> wrote: > -- 原始邮件 -- > 发件人:

  1   2   >