Hi Mike, 
I hope all is well.  I need to apologize as I grossly overestimated my 
available free time to assist with the DFDL / Drill integration.  I had a 
thought which I wanted to propose.

My thinking is that we should complete the integration in two phases:

Phase 1:  
For phase 1, I propose that we merge the work that you’ve already done.  We’d 
have to make sure that the DFDL files are accessible from the class path.  This 
isn’t really a great solution, but it is just to get the pieces in so we can 
work on phase 2.  I don’t like seeing good work languishing in the PR queue and 
getting stale.  To complete phase 1, all we’d really have to do is get the unit 
tests working.  

Phase 2:
The remaining issue revolves around making the DFDL files accessible to Drill 
and also so that a user can easily add or remove files.  For this we have a 
solution: DRILL-4726[1] which provides dynamic UDF support.  Basically what I’m 
proposing is that we duplicate the components of this PR for Drill.  The end 
result would be that a user could copy the UDF files to a staging directory.  
Then the user would run a command like:

CREATE DAFFODIL SCHEMA xxxx USING JAR yyyyy 

When the user does that, the file would be propagated to all the Drill nodes.  
Implementing this feature would really involve a lot of duplicating with slight 
mods from that pull request.  What do you think?
Best,
— C



[1]: https://github.com/apache/drill/pull/574



Reply via email to