Re: License headers inside Javadoc comments

2024-01-29 Thread Paul Rogers
James, If the extra check is costly, you might also observe that all (most?) existing files have the proper header format. It is only new or changed files that must be checked. So, you can use Git to determine the change set on each PR and do the extra format check only on those files. - Paul

Re: [Important] GSoC 2024 Project Ideas

2024-01-27 Thread Paul Rogers
Some ideas: * Time marches on. Drill has a design from ten years back. What modern environment things do current users need? Integration with Amazon Glue? Delta lake/lakehouse/whatever the cool new thing is? Integration with the latest & greatest BI tools? * Seems many folks use Drill as a

Re: License headers inside Javadoc comments

2024-01-26 Thread Paul Rogers
Hi James, For some reason, Drill started with the license headers in Javadoc comments. The (weak) explanation I got was that we never generate Javadoc, so it didn't really matter. Later, we started converting the headers to regular comments when convenient. If we were to generate Javadoc, having

Re: Possible Regression: Can't build current master

2024-01-25 Thread Paul Rogers
checkout master git reset --hard origin/master Use these with caution: I used a slightly different set to update my own branch. Caveat emptor. This assumes that your Drill clone is "origin". - Paul On Thu, Jan 25, 2024 at 12:48 PM Paul Rogers wrote: > The symbols in questions are so

Re: Possible Regression: Can't build current master

2024-01-25 Thread Paul Rogers
The symbols in questions are some I modified in my recent PR. I wonder if there was a merge issue somewhere? The PR did get a clean build on the master branch. I'll try a build myself to see if I can locate the issue. - Paul On Thu, Jan 25, 2024 at 10:56 AM Charles Givre wrote: > All, > I

Re: Parquet files with microsecond columns

2024-01-22 Thread Paul Rogers
Hi Peter, It sounds like you are on the right track: the new option is the quick short-term solution. The best long-term solution is to generalize Drill's date/time type, but that would take much more work. (Drill also has a bug where the treatment of timezones is incorrect, which forces Drill to

[jira] [Resolved] (DRILL-8375) Incomplete support for non-projected complex vectors

2024-01-07 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-8375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-8375. Resolution: Fixed > Incomplete support for non-projected complex vect

UNION and LIST non-projection support in EVF

2024-01-06 Thread Paul Rogers
Hi All, Happy New Year! I dusted off the work to add non-projection support in EVF for the UNION and LIST types. I believe that only REPEATED LIST is missing. "Non-projection support" just means that you can read a JSON file that requires a UNION or LIST vector, and tell EVF to NOT actually

Re: Next Version

2024-01-01 Thread Paul Rogers
> occasion to use the Union once. However, when I used it, it consumed so > much memory, we ended up finding a workaround anyway and stopped using it. > Honestly, since we improved the implicit casting rules, I think Drill is a > lot smarter about how it reads data anyway. Bottom

Re: Next Version

2024-01-01 Thread Paul Rogers
a user of Drill, I've only had the > occasion to use the Union once. However, when I used it, it consumed so > much memory, we ended up finding a workaround anyway and stopped using it. > Honestly, since we improved the implicit casting rules, I think Drill is a > lot smarter about ho

Re: Next Version

2023-12-30 Thread Paul Rogers
Hi Luoc, Thanks for reminding me about the EVF V2 work. I got mostly done adding projection for complex types, then got busy on other projects. I've yet to tackle the hard cases: unions, repeated unions and repeated lists (which are, in fact, repeated repeated unions). The code to handle

Re: assistance needed debugging drill + daffodil

2023-12-07 Thread Paul Rogers
Hi Mike, I wonder if you've got an array in there somewhere? Either in the data, or you're creating an array in your code in response to the data? If you have just scalars, then all you need to do is start a row, write the scalars, and end the row. The starting and ending are done automagically

Re: Drill TupleMetadata created from DFDL Schema - how do I inform Drill about it

2023-10-18 Thread Paul Rogers
gt; > > This is just to get enough working (against local files only) that I can > be > > unblocked on creating and testing the rest of the Daffodil-to-Drill > > metadata bridge and data bridge. > > > > My plan is to get all kinds of data and queries working fir

Re: Drill TupleMetadata created from DFDL Schema - how do I inform Drill about it

2023-10-18 Thread Paul Rogers
records, etc. > > > > Thanks for the help. > > > >> > >>> On Oct 12, 2023, at 2:58 PM, Mike Beckerle <mailto:mbecke...@apache.org>> wrote: > >>> > >>> So when a data format is described by a DFDL schema, I can generate > >>> equivalent Drill schema (TupleMetadata). This sc

Re: [apache/drill] WIP: Preliminary Review on adding Daffodil to Drill (PR #2836)

2023-10-15 Thread Paul Rogers
Hi Mike, Congrats on the PR. I'll take a look soon. You asked about initialization. Initialization is a bit tricky in a distributed system such as Drill. There are a number of things "initialization" could mean: * Global, one-time initialization (per Drillbit): Unlike Druid, Drill has no

Re: Drill TupleMetadata created from DFDL Schema - how do I inform Drill about it

2023-10-12 Thread Paul Rogers
etrieve it from a cache) > > What objects do I call, or what classes do I have to create to make this > Drill TupleMetadata available to Drill so it uses it in all the ways a > static Drill schema can be useful? > > I just need pointers to the code that illustrate how to do thi

Re: Drill expects pull parsing? Daffodil is event callbacks style

2023-10-11 Thread Paul Rogers
Mike, This is a complex question and has two answers. First, the standard enhanced vector framework (EVF) used by most readers assumes a "pull" model: read each record. This is where the next() comes in: readers just implement this to read the next record. But, the code under EVF works with a

Re: Question about Drill internal data representation for Daffodil tree infosets

2023-10-11 Thread Paul Rogers
Mike, Just to echo Charles, thanks for the work; sounds like you are making good progress. The question you asked is tricky. Charles is right, the type of the data structure is a map. The output you showed appears to be from the sqlline tool. If so, then it helps to understand that sqlline

Re: Question on Representing DFDL/XSD choice data for Drill (Unions required?)

2023-09-13 Thread Paul Rogers
Hi Mike, Looks like you are wrestling with two separate issues. The first is how to read the encoded data that you showed. In Drill, each data format generally needs its own reader. Drill's reader operator provides all the plumbing needed to handle multiple format readers, pack data into vectors,

Re: Discuss: JSON and XML and Daffodil - same Infoset, same Query should create same rowset?

2023-09-13 Thread Paul Rogers
Hi Mike, I believe I sent a detailed response to this. Did it get through? If not, I'll try sending it again... - Paul On Wed, Sep 13, 2023 at 6:44 AM Mike Beckerle wrote: > ... sound of crickets on a summer night . > > It would really help me if I could get a response to this inquiry, to

Re: Discuss: JSON and XML and Daffodil - same Infoset, same Query should create same rowset?

2023-08-25 Thread Paul Rogers
Hi Mike, You asked about how to work with nested data items. As noted in a previous email, this can be a bit tricky. Drill uses SQL, and SQL does not have good native support for structured data: it was designed in the 1970's for record oriented data (tuples). Several attempts were made to extend

Re: Discuss: JSON and XML and Daffodil - same Infoset, same Query should create same rowset?

2023-08-25 Thread Paul Rogers
Great progress, Mike! First, let's address the schema issue. As you've probably noticed, Drill's original notion was that data needed no schema: the data itself provides sufficient syntactic structure to let Drill infer schema. Also as you've noticed, this assumption turned out to be more

Re: Drill SQL questions - JSON context

2023-08-18 Thread Paul Rogers
Hi Mike, Good progress! There are a number of factors to consider. Let's work through them one by one. First, try the simplest possible query: SELECT * FROM If you are using the row set mechanism, grab the schema and print it. (My memory is hazy, but I do believe that there are methods and

Re: is there a way to provide inline array metadata to inform the xml_reader?

2023-08-14 Thread Paul Rogers
IIRC, the syntax for the "provided schema" for arrays is "ARRAY" such as "ARRAY". This works, however, only if the XML reader uses the (very complex) EVF framework and has a way to control parsing based on the data type (and to set the data type based on parsing). The JSON reader has such an

Re: UserBitShared.proto question

2023-08-08 Thread Paul Rogers
Unless something changed, Drill's build does not compile the .proto files. Instead, the files are generated manually, and checked into git, on those rare occasions that the API changes. I seem to recall that there are some instructions somewhere, but a quick search didn't reveal anything. - Paul

Re: drill tests not passing

2023-07-11 Thread Paul Rogers
Hi Mike, A quick glance at the log suggests a failure in the tests for the JSON reader, in the Mongo extended types. Drill's date/time support has historically been fragile. Some tests only work if your machine is set to use the UTC time zone (or Java is told to pretend that the time is UTC.) The

Re: Drill and Highly Hierarchical Data from Daffodil

2023-07-11 Thread Paul Rogers
Drill can internally handle scalars, arrays (AKA vectors) and maps (AKA tuples, structs). SQL, however, prefers to work with scalars: there is no good syntax to reach inside a complex object for, say, a WHERE condition without also projecting that item as a top-level scalar. The cool thing, for

[jira] [Created] (DRILL-8375) Incomplete support for non-projected complex vectors

2022-12-24 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8375: -- Summary: Incomplete support for non-projected complex vectors Key: DRILL-8375 URL: https://issues.apache.org/jira/browse/DRILL-8375 Project: Apache Drill Issue

Re: Small dataset query issue and the workaround we found

2022-08-30 Thread Paul Rogers
Hi All, As others have said, the only difference between plans for “small” and “large” queries is the queue size and memory. As I recall, those are spelled out in the docs. Ensure that there is sufficient memory for the slicing up done by the queue, and the query. Memory is allocated to

Re: [DISCUSS] Add schema support for the XML format

2022-04-06 Thread Paul Rogers
Hi Luoc, First, what poor soul is asked to deal with large amounts of XML in this day and age? I thought we were past the XML madness, except in Maven and Hadoop config files. XML is much like JSON, only worse. JSON at least has well-defined types that can be gleaned from JSON syntax. With

[jira] [Created] (DRILL-8185) EVF 2 doen't handle map arrays or nested maps

2022-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8185: -- Summary: EVF 2 doen't handle map arrays or nested maps Key: DRILL-8185 URL: https://issues.apache.org/jira/browse/DRILL-8185 Project: Apache Drill Issue Type

Re: [VOTE] Adopt the Drill Test Framework from MapR

2022-03-17 Thread Paul Rogers
Abhishek used to have that thing running like a charm. Great to see it getting attention again. +1 - Paul On Thu, Mar 17, 2022 at 2:03 AM James Turton wrote: > Hi dev community! > > Many of you need no introduction to the test framework developed by MapR > >

[jira] [Created] (DRILL-8159) Upgrade HTTPD, Text readers to use EVF3

2022-03-06 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8159: -- Summary: Upgrade HTTPD, Text readers to use EVF3 Key: DRILL-8159 URL: https://issues.apache.org/jira/browse/DRILL-8159 Project: Apache Drill Issue Type: New

Re: [DISCUSS] Some ideas for Drill 1.21

2022-02-09 Thread Paul Rogers
tion is to store metadata as data in a distributed data > store. > > > For static resources, that can scale very well. For highly dynamic > > > resources like conventional databases behind JDBC connections, you can > > > generally delegate metadata to that layer. Pe

Re: [DISCUSS] Some ideas for Drill 1.21

2022-02-06 Thread Paul Rogers
Hi All, Drill, like all open source projects, exists to serve those that use it. To that end, the best contributions come when some company needs a feature badly enough that it is worth the effort to develop and contribute a solution. That's pretty standard, as along as the contribution is

[jira] [Created] (DRILL-8124) Fix implicit file issue with EVF 2

2022-02-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8124: -- Summary: Fix implicit file issue with EVF 2 Key: DRILL-8124 URL: https://issues.apache.org/jira/browse/DRILL-8124 Project: Apache Drill Issue Type: New Feature

[jira] [Created] (DRILL-8123) Revise scan limit pushdown

2022-02-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8123: -- Summary: Revise scan limit pushdown Key: DRILL-8123 URL: https://issues.apache.org/jira/browse/DRILL-8123 Project: Apache Drill Issue Type: New Feature

[jira] [Created] (DRILL-8115) LIMIT pushdown into EVF

2022-01-28 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8115: -- Summary: LIMIT pushdown into EVF Key: DRILL-8115 URL: https://issues.apache.org/jira/browse/DRILL-8115 Project: Apache Drill Issue Type: New Feature

Re: [ANNOUNCE] James Turton as PMC Member

2022-01-24 Thread Paul Rogers
Congratulations James! - Paul On Mon, Jan 24, 2022 at 9:34 AM Charles Givre wrote: > The Project Management Committee (PMC) for Apache Drill is pleased to > announce that we have invited James Turton to join us as a PMC member of > the Drill project and he has accepted. Please join me in

Re: [ANNOUNCE] New Committer: PJ Fanning

2022-01-24 Thread Paul Rogers
Congratulations! - Paul On Mon, Jan 24, 2022 at 9:15 AM Charles Givre wrote: > The Project Management Committee (PMC) for Apache Drill is pleased to > announce that we have invited PJ Fanning to join us as a committer to the > Drill project. PJ is a committer and PMC member for the Apache POI

Re: [DISCUSS] Lombok - friend or foe?

2022-01-24 Thread Paul Rogers
;> > >>> I use both Eclipse and IDEA, but I use Eclipse more often. I have no > >>> objection to the use of Lombok, but suggest the following three points > : > >>> > >>> 1. Could we use Lombok only in `drill-contrib` module? > >>> &g

Re: [DISCUSS] Lombok - friend or foe?

2022-01-22 Thread Paul Rogers
Hi All, I look at any tool as a cost/benefit tradeoff. If Drill were a typical business app, with lots of "data objects", then the hassle of Lomboc might be a net win. However, the nature of Drill is that we have very few data objects. We have lots of Protobuf objects, or Jackson-serialized

Re: [DISCUSS] Drill 2 and plug-in organisation

2022-01-18 Thread Paul Rogers
dded Drill > with the storage plugin deployed to it, or am I wrong here? > > On 2022/01/18 00:32, Paul Rogers wrote: > > Hi Ted, > > > > Thanks for the explanation, makes sense. > > > > Ideally, the client side would be somewhat agnostic about the repo it >

Re: [DISCUSS] Drill 2 and plug-in organisation

2022-01-17 Thread Paul Rogers
the same "already > exists" benefit as does Maven. > > > > On Mon, Jan 17, 2022 at 1:30 PM Paul Rogers wrote: > >> Hi Ted, >> >> Well said. Just to be clear, I wasn't suggesting that we use >> Maven-the-build-tool to distribute plugins. Rather, I was

Re: [DISCUSS] Drill 2 and plug-in organisation

2022-01-17 Thread Paul Rogers
ty to fetch and install plugins itself without too much >> trouble, at least for Drill clusters with Internet access. >> "Sideloading" by downloading from Maven and copying manually would >> always remain possible. >> >> @Paul I'll try to get a lit

Re: [DISCUSS] Drill 2 and plug-in organisation

2022-01-14 Thread Paul Rogers
Hi All, James raises an important issue, I've noticed that it used to be easy to build and test Drill, now it is a struggle, because of the many odd external dependencies we have introduced. That acts as a big damper on contributions: none of us get paid enough to spend more time fighting builds

Re: [DISCUSS] Per User Access Controls

2022-01-13 Thread Paul Rogers
Hey All, Other members of the Hadoop Ecosystem rely on external systems to handle permissions: Ranger or Sentry. There is probably something different in the AWS world. As you look into security, you'll see that you need to maintain permissions on many entities: files, connections, etc. You need

Re: [DISCUSS] Restarting the Arrow Conversation

2022-01-04 Thread Paul Rogers
Hi Ted, I like where you're going with how to manage the discussion. Here's a trick that I saw someone do recently. The design/discussion as a PR. Comments are just code review comments, tagged to a specific line. The "er, never mind" aspect that Ted talks about is handled by pushing a new

Re: [DISCUSS] Restarting the Arrow Conversation

2022-01-03 Thread Paul Rogers
, and I'd hate to see it get lost. > -- C > > > On Jan 3, 2022, at 7:41 PM, Paul Rogers wrote: > > > > Hi All, > > > > Thanks Charles for dredging up that old discussion, your memory is better > > than mine! And, thanks Ted for that summary of MapR hist

Re: [DISCUSS] Restarting the Arrow Conversation

2022-01-03 Thread Paul Rogers
data > > between platforms without having to serialize/deserialize the data. From > my > > understanding, MapR did some research and didn't find a significant > > performance advantage and hence didn't really pursue the integration. The > > other side of it was that it wou

[jira] [Created] (DRILL-8102) Tests use significant space outside the drill directory

2022-01-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8102: -- Summary: Tests use significant space outside the drill directory Key: DRILL-8102 URL: https://issues.apache.org/jira/browse/DRILL-8102 Project: Apache Drill

[jira] [Created] (DRILL-8101) Resolve the TIMESTAMP madness

2022-01-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8101: -- Summary: Resolve the TIMESTAMP madness Key: DRILL-8101 URL: https://issues.apache.org/jira/browse/DRILL-8101 Project: Apache Drill Issue Type: Bug Affects

[jira] [Created] (DRILL-8100) JSON record writer does not convert Dril local timestamp to UTC

2022-01-01 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8100: -- Summary: JSON record writer does not convert Dril local timestamp to UTC Key: DRILL-8100 URL: https://issues.apache.org/jira/browse/DRILL-8100 Project: Apache Drill

[jira] [Created] (DRILL-8099) Parquet record writer does not convert Dril local timestamp to UTC

2021-12-31 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8099: -- Summary: Parquet record writer does not convert Dril local timestamp to UTC Key: DRILL-8099 URL: https://issues.apache.org/jira/browse/DRILL-8099 Project: Apache Drill

[jira] [Created] (DRILL-8087) {{TestNestedDateTimeTimestamp.testNestedDateTimeCTASExtendedJson}} assumes time zone

2021-12-19 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8087: -- Summary: {{TestNestedDateTimeTimestamp.testNestedDateTimeCTASExtendedJson}} assumes time zone Key: DRILL-8087 URL: https://issues.apache.org/jira/browse/DRILL-8087

[jira] [Created] (DRILL-8086) Convert the CSV (AKA "compliant text") reader to EVF V2

2021-12-19 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8086: -- Summary: Convert the CSV (AKA "compliant text") reader to EVF V2 Key: DRILL-8086 URL: https://issues.apache.org/jira/browse/DRILL-8086 Project: Ap

[jira] [Created] (DRILL-8085) EVF V2 support in the "Easy" format plugin

2021-12-19 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8085: -- Summary: EVF V2 support in the "Easy" format plugin Key: DRILL-8085 URL: https://issues.apache.org/jira/browse/DRILL-8085 Project: Apache Drill Issue

[jira] [Created] (DRILL-8084) Scan LIMIT pushdown fails across files

2021-12-19 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8084: -- Summary: Scan LIMIT pushdown fails across files Key: DRILL-8084 URL: https://issues.apache.org/jira/browse/DRILL-8084 Project: Apache Drill Issue Type: Bug

[jira] [Created] (DRILL-8083) HttpdLogBatchReader creates unnecessary empty maps

2021-12-19 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-8083: -- Summary: HttpdLogBatchReader creates unnecessary empty maps Key: DRILL-8083 URL: https://issues.apache.org/jira/browse/DRILL-8083 Project: Apache Drill Issue

Re: A new developer wiki begins!

2021-11-02 Thread Paul Rogers
: > Hi Charles > > When I first took this idea to Paul I proposed that we attribute > authorship but he declined that bit. We do have the Git history for the > wiki, and the lines shown for the last Git commit to affect a page are > quite visible in the wiki, e.g. > > > P

[jira] [Resolved] (DRILL-7325) Many operators do not set container record count

2021-04-25 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-7325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-7325. Resolution: Fixed A number of individual commits fixed problems found in each operator

[jira] [Resolved] (DRILL-6953) Merge row set-based JSON reader

2021-04-25 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-6953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-6953. Resolution: Fixed Resolved via  series of individual tickets. > Merge row set-based JSON rea

[jira] [Created] (DRILL-7789) Exchanges are slow on large systems & queries

2020-09-23 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7789: -- Summary: Exchanges are slow on large systems & queries Key: DRILL-7789 URL: https://issues.apache.org/jira/browse/DRILL-7789 Project: Apache Drill Issue

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-09-02 Thread Paul Rogers
Hi Abhishek, Downloaded the tar file, installed Drill, cleaned my ZK and poked around in the UI. As you noted, you've already run the thousands of unit tests and the test framework, so no point in trying to repeat that. Our tests, however, don't cover the UI much at all, so I clicked around on

[jira] [Created] (DRILL-7734) Revise the result set reader

2020-05-05 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7734: -- Summary: Revise the result set reader Key: DRILL-7734 URL: https://issues.apache.org/jira/browse/DRILL-7734 Project: Apache Drill Issue Type: Improvement

[jira] [Created] (DRILL-7733) Use streaming for REST JSON queries

2020-05-05 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7733: -- Summary: Use streaming for REST JSON queries Key: DRILL-7733 URL: https://issues.apache.org/jira/browse/DRILL-7733 Project: Apache Drill Issue Type: Improvement

[jira] [Created] (DRILL-7729) Use java.time in column accessors

2020-05-04 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7729: -- Summary: Use java.time in column accessors Key: DRILL-7729 URL: https://issues.apache.org/jira/browse/DRILL-7729 Project: Apache Drill Issue Type: Improvement

Re: Drill with No-SQL [was: Cannot Build Drill "exec/Java Execution Engine"]

2020-05-03 Thread Paul Rogers
2020 at 2:42 PM Paul Rogers wrote: > Hi Tug, > > Glad to hear from you again. Ted's summary is pretty good; here's a bit > more detail. > > > Presto is another alternative which seems to have gained the most traction > outside of the Cloud ecosystem on the one hand, and th

Drill with No-SQL [was: Cannot Build Drill "exec/Java Execution Engine"]

2020-05-03 Thread Paul Rogers
Hi Tug, Glad to hear from you again. Ted's summary is pretty good; here's a bit more detail. Presto is another alternative which seems to have gained the most traction outside of the Cloud ecosystem on the one hand, and the Cloudera/HortonWorks ecosystem on the other. Presto does, however,

[jira] [Created] (DRILL-7728) Drill SPI framework

2020-05-01 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7728: -- Summary: Drill SPI framework Key: DRILL-7728 URL: https://issues.apache.org/jira/browse/DRILL-7728 Project: Apache Drill Issue Type: Improvement Affects

[jira] [Created] (DRILL-7725) Updates to EVF2

2020-04-30 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7725: -- Summary: Updates to EVF2 Key: DRILL-7725 URL: https://issues.apache.org/jira/browse/DRILL-7725 Project: Apache Drill Issue Type: Improvement Affects

[jira] [Created] (DRILL-7724) Refactor metadata controller batch

2020-04-30 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7724: -- Summary: Refactor metadata controller batch Key: DRILL-7724 URL: https://issues.apache.org/jira/browse/DRILL-7724 Project: Apache Drill Issue Type: Improvement

[jira] [Created] (DRILL-7717) Support Mongo extended types in V2 JSON loader

2020-04-26 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7717: -- Summary: Support Mongo extended types in V2 JSON loader Key: DRILL-7717 URL: https://issues.apache.org/jira/browse/DRILL-7717 Project: Apache Drill Issue Type

Re: [QUESTION]: httpclient dependency

2020-04-23 Thread Paul Rogers
Hi All, I think there may be a bit of confusion. It may be true that some of Drill's dependencies now use the newer version of the library httpcomponents:httpclient. However, it looks like ES directly depends on the older flavor. We have pom file entries which exclude that old version. As a

Format plugin configs should be immutable

2020-04-19 Thread Paul Rogers
Hi All, This is a quick note for any of you who create or work on format plugins in Drill. You will see that all existing plugins have been modified so that config properties are immutable. This note will explain why. Drill uses storage and format plugins as keys into an internal map. (That's

[jira] [Created] (DRILL-7711) Add data path, parameter filter pushdown to HTTP plugin

2020-04-18 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7711: -- Summary: Add data path, parameter filter pushdown to HTTP plugin Key: DRILL-7711 URL: https://issues.apache.org/jira/browse/DRILL-7711 Project: Apache Drill

[jira] [Created] (DRILL-7709) CTAS as CSV creates files which the "csv" plugin can't read

2020-04-17 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7709: -- Summary: CTAS as CSV creates files which the "csv" plugin can't read Key: DRILL-7709 URL: https://issues.apache.org/jira/browse/DRILL-7709 Project: Ap

Re: [DISCUSS]: Masking Creds in Query Plans

2020-04-17 Thread Paul Rogers
Hi Charles, Excellent point. The problem is deeper. Drill serializes plugin configs in the query plan which it sends to each worker (Drillbit.) Why? To avoid race conditions if you start a query then change the plugin config and thus different nodes see different versions of the config.

[jira] [Created] (DRILL-7708) Downgrade maven from 3.6.3 to 3.6.0

2020-04-17 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7708: -- Summary: Downgrade maven from 3.6.3 to 3.6.0 Key: DRILL-7708 URL: https://issues.apache.org/jira/browse/DRILL-7708 Project: Apache Drill Issue Type: Bug

Re: [NOTICE] Maven 3.6.3

2020-04-17 Thread Paul Rogers
Hi Arina, Thanks for keeping us up to date! As it turns out, I use Ubuntu (Linux Mint) for development. Maven is installed as a package using apt-get. Packages can lag behind a bit. The latest maven available via apt-get is 3.6.0. It is a nuisance to install a new version outside the package

[jira] [Resolved] (DRILL-7655) Add Default Schema text box to Edit Query page in query profile

2020-04-15 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-7655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-7655. Resolution: Fixed Fixed as part of PR #2052. > Add Default Schema text box to Edit Query p

[jira] [Created] (DRILL-7703) Support for 3+D arrays in EVF JSON loader

2020-04-15 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7703: -- Summary: Support for 3+D arrays in EVF JSON loader Key: DRILL-7703 URL: https://issues.apache.org/jira/browse/DRILL-7703 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-7701) EVF V2 Scan Framework

2020-04-14 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7701: -- Summary: EVF V2 Scan Framework Key: DRILL-7701 URL: https://issues.apache.org/jira/browse/DRILL-7701 Project: Apache Drill Issue Type: Improvement Affects

[jira] [Resolved] (DRILL-7685) Case statement marking column as required in parquet metadata

2020-04-12 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-7685. Resolution: Cannot Reproduce Tested in Drill 1.18 (snapshot) and found that the provided query

[jira] [Created] (DRILL-7697) Revise query editor in profile page of web UI

2020-04-12 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7697: -- Summary: Revise query editor in profile page of web UI Key: DRILL-7697 URL: https://issues.apache.org/jira/browse/DRILL-7697 Project: Apache Drill Issue Type

[jira] [Resolved] (DRILL-6672) Drill table functions cannot handle "setFoo" accessors

2020-04-11 Thread Paul Rogers (Jira)
[ https://issues.apache.org/jira/browse/DRILL-6672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers resolved DRILL-6672. Resolution: Not A Problem Storage and format plugins must be immutable since their entire values

[jira] [Created] (DRILL-7696) EVF v2 Scan Schema Resolution

2020-04-09 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7696: -- Summary: EVF v2 Scan Schema Resolution Key: DRILL-7696 URL: https://issues.apache.org/jira/browse/DRILL-7696 Project: Apache Drill Issue Type: Improvement

[jira] [Created] (DRILL-7690) Display (major) operators in fragment title bar in Web UI

2020-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7690: -- Summary: Display (major) operators in fragment title bar in Web UI Key: DRILL-7690 URL: https://issues.apache.org/jira/browse/DRILL-7690 Project: Apache Drill

[jira] [Created] (DRILL-7689) Do not save profiles for trivial queries

2020-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7689: -- Summary: Do not save profiles for trivial queries Key: DRILL-7689 URL: https://issues.apache.org/jira/browse/DRILL-7689 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-7688) Provide web console option to see non-default options

2020-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7688: -- Summary: Provide web console option to see non-default options Key: DRILL-7688 URL: https://issues.apache.org/jira/browse/DRILL-7688 Project: Apache Drill Issue

[jira] [Created] (DRILL-7687) Inaccurate memory estimates in hash join

2020-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7687: -- Summary: Inaccurate memory estimates in hash join Key: DRILL-7687 URL: https://issues.apache.org/jira/browse/DRILL-7687 Project: Apache Drill Issue Type: Bug

[jira] [Created] (DRILL-7686) Excessive memory use in partition sender

2020-04-02 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7686: -- Summary: Excessive memory use in partition sender Key: DRILL-7686 URL: https://issues.apache.org/jira/browse/DRILL-7686 Project: Apache Drill Issue Type: Bug

[jira] [Created] (DRILL-7683) Add "message parsing" to new JSON loader

2020-03-31 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7683: -- Summary: Add "message parsing" to new JSON loader Key: DRILL-7683 URL: https://issues.apache.org/jira/browse/DRILL-7683 Project: Apache Drill

[jira] [Created] (DRILL-7680) Move UDF projects before plugins in contrib

2020-03-31 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7680: -- Summary: Move UDF projects before plugins in contrib Key: DRILL-7680 URL: https://issues.apache.org/jira/browse/DRILL-7680 Project: Apache Drill Issue Type

Re: Excessive Memory Use in Parquet Files (From Drill Slack Channel)

2020-03-24 Thread Paul Rogers
rg  8:43 AM Sure 8:43 I'll get to it cgivre  8:44 AM I'd like for Paul Rogers to see this as I think he was the author of some of this. Idan Sheinberg  8:44 AM Hmm. I'll keep that in mind cgivre  8:47 AM We've been refactoring some of the complex readers as well, so its possible that is caused this, but

[jira] [Created] (DRILL-7658) Vector allocateNew() has poor error reporting

2020-03-24 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7658: -- Summary: Vector allocateNew() has poor error reporting Key: DRILL-7658 URL: https://issues.apache.org/jira/browse/DRILL-7658 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-7640) EVF-based JSON Loader

2020-03-12 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7640: -- Summary: EVF-based JSON Loader Key: DRILL-7640 URL: https://issues.apache.org/jira/browse/DRILL-7640 Project: Apache Drill Issue Type: Improvement Affects

[jira] [Created] (DRILL-7634) Rollup of code cleanup changes

2020-03-10 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7634: -- Summary: Rollup of code cleanup changes Key: DRILL-7634 URL: https://issues.apache.org/jira/browse/DRILL-7634 Project: Apache Drill Issue Type: Improvement

[jira] [Created] (DRILL-7633) Fixes for union and repeated list accessors

2020-03-09 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7633: -- Summary: Fixes for union and repeated list accessors Key: DRILL-7633 URL: https://issues.apache.org/jira/browse/DRILL-7633 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-7632) Improve user exception formatting

2020-03-09 Thread Paul Rogers (Jira)
Paul Rogers created DRILL-7632: -- Summary: Improve user exception formatting Key: DRILL-7632 URL: https://issues.apache.org/jira/browse/DRILL-7632 Project: Apache Drill Issue Type: Improvement

  1   2   3   4   5   6   7   8   9   10   >