ogether the
hybrid design described by Ted and Paul - and makes parquet be more than
static archive.
https://docs.delta.io/latest/delta-intro.html
--
nicolas paris
Hi
This looks an interesting design.
Am I correct such view
would hit the RDBMS for every query ?
However such view would hit the parquet file only when
the timestamp predicate would match a partition ?
Any news on a recent test to confirm the design ?
Thanks
2018-03-20 6:49 GMT+01:00 Ted
Hi,
Join csv, json, databases.
Your needs looks like ETL processes. I am not sure drill suits well for
such goal. AFAIK, it is not able to work on disk when out of memory
occures.
Moreover those tasks usally needs some procedural code parts. I am not
sure UDFs are very flexible.
For such use
Hi Weiwan,
1)Does both drill and hive support predicat pushdown with indexR ? I mean
using the indexes and not scanning table.
2)Does it support join push down, sort etc ?
3)Can you elaborate why your team choose Drill versus equivalent (impala,
presto...)
Thanks !
2017-01-04 2:59 GMT+01:00
2016-12-26 16:22 GMT+01:00 Bob Rudis :
> I guess I read more into "ANSI SQL" than I should have. While it does
> handle some complex SQL generated by dplyr (which you can see at
> https://github.com/hrbrmstr/sergeant if you scroll down) there are
> instances when if, say you try to
gt;>
> >> I think it is fixed in 1.8 .try it on
> >>
> >> Get Outlook for iOS<https://aka.ms/o0ukef>
> >>
> >>
> >>
> >>
> >> On Fri, Oct 7, 2016 at 12:18 PM +0300, "Nicolas Paris" <
> >&g
Hey,
I have run tests on drill on a standalone installation (1 computer 8
core/32GO ram).
I will get soon a 5 computer cluster (8 core/96GO ram each).
Is it possible to get an estimation of the performance gain ?
Is it linear ? Will the performance get better ? Worst ?
I just want an
@Ted
My company (MapR), for instance, makes a database that would likely work,
but many
will not work.
You must be thinking about maprDB right ? Can't drill query directly maprDB
? What is the botleneck in such situation ? Does drill wait for maprDB to
give the resultset ? Where can we get
mentExecutor$1.run():256
> org.apache.drill.exec.work.fragment.FragmentExecutor$1.run():250
> java.security.AccessController.doPrivileged():-2
> javax.security.auth.Subject.doAs():415
> org.apache.hadoop.security.UserGroupInformation.doAs():1595
> org.apache
where srcday = '2016-02-05' and
contains(domain_name, '\.com$'); working too.
I keep you aware new version
2016-02-09 19:22 GMT+01:00 Nicolas Paris <nipari...@gmail.com>:
> John,
>
> About the escape, I will explore that question.
> About your query, you may try this pattern
; > select count(1) from view_mydata where srcday = '2016-02-05' and
> contains(domain_name, '\\.com$');
> +-+
> | EXPR$0 |
> +-+
> | 0 |
> +-+
> 1 row selected (201.391 seconds)
>
>
>
> On Tue, Feb 9, 2016 at 9:34 AM, Nicolas
ble. I think what happened in Hive is the regex
> commands started with the users having the escape and now there are just to
> many things that using the escaped regex and the project doesn't want to
> adjust.
>
>
>
>
> On Thu, Feb 4, 2016 at 1:38 PM, Nicolas Paris &
Hello,
I have problem to load csv containg double double quote eg:
col1;col2;col3
1;"\"\"foo\"\"";"bar"
1;"\"\"foo\"\"";"bar"
Thanks !
ser has to do that. If there was a way to do it in Drill, that
> would save countless people hours and save many mistakes.
>
> On Thu, Feb 4, 2016 at 12:03 PM, Nicolas Paris <nipari...@gmail.com>
> wrote:
>
> > John, Jason,
> >
> > 2016-02-04 18:47 GMT+01:00 J
Altekruse <altekruseja...@gmail.com>:
> Awesome, thanks!
>
> On Thu, Feb 4, 2016 at 7:44 AM, Nicolas Paris <nipari...@gmail.com> wrote:
>
> > Well I am creating a udf
> > good exercise
> > I hope a PR soon
> >
> > 2016-02-04 16:37 GMT+01:00 Jason Al
or your commit message, prefixing with the JIRA number in the
> > format
> > Example:
> > DRILL-: Feature description
> >
> > This will automatically link the PR to your JIRA.
>
Ok I will try thanks
a lot
> > - Jason
> >
> > On Thu, Feb 4, 201
it's an included
> > UDF like REGEX(field, pattern) using Java's library for regex like Hive
> > does. That would be EXTREMELY helpful.
> >
> >
> >
> >
> >
> > On Tue, Feb 2, 2016 at 6:55 AM, Nicolas Paris <nipari...@gmail.com>
> wrote:
>
Hello,
Hello,
I would like to create a user defined function that would return a boolean
value.
Use case would be :
SELECT * FROM x WHERE MyFunction();
I haven't found any BooleanHolder in order to.
@Output
BooleanHolder out;
The only way I have is:
SELECT * FROM x WHERE MyFunction()
push it into
github then ?
> Doesn't it enough 'LIKE' operator?
>
Sadly not, I'am looking for complex pattern matching.
--
> Miura, Masahide
>
> -Original Message-
> From: Nicolas Paris [mailto:nipari...@gmail.com]
> Sent: Tuesday, February 02, 2016 9:04 PM
>
Abdel,
select * on my csv file fails as well
Thanks
2016-02-01 17:16 GMT+01:00 Abdel Hakim Deneche <adene...@maprtech.com>:
> When you run a select * on your csv file, does it succeed or fail ?
>
> On Mon, Feb 1, 2016 at 7:53 AM, Nicolas Paris <nipari...@gmail.com>
line to allow easy
> splitting of csv files.
>
> On Mon, Feb 1, 2016 at 8:24 AM, Nicolas Paris <nipari...@gmail.com> wrote:
>
> > Abdel,
> >
> > select * on my csv file fails as well
> >
> > Thanks
> >
> > 2016-02-01 17:16 GMT+01:00 Abdel Hakim
lem was related to a bug in the LIKE function.
> Unfortunately he never got around to fill a JIRA for his issue.
>
> Is your problem similar ? if yes, then can you please fill a JIRA ?
>
> On Mon, Feb 1, 2016 at 7:26 AM, Nicolas Paris <nipari...@gmail.com> wrote:
>
> >
Hello,
I am trying to import a csv containing large texts. They contains newline
character "\n".
Apache Drill conplains about that. There is a jira issue opened on
23 matches
Mail list logo