Hello Sebastian, By default Drill sets the field 'foo' to null for the files that don't contain it. I am of the opinion that the condition where foo = 'bar' should result in false for all those files which don't contain the field. Please can you send across the queries which you have run and the observed result.
Just off the top of my head, some query like the below one might work select file_name from dfs.`/bla/*/*` where foo != null. --- You might want to remove duplicate entries. (of course this also results in the rows which contain the field and are null). Hope this helps. Thanks On Fri, Sep 13, 2019 at 10:53 PM Sebastian Fischmeister < [email protected]> wrote: > Hi, > > When searching multiple directories, drill only searches fields that are > common to all files (see the json data model). Is there a way to query a > directory and list all files that contain a certain field? > > In other words, I would like to use the workaround in this way: > > select * from (select fqn from dfs.`/bla/*/*` where foo exists) where foo > = 'bar' > > Or is there another way to do this? I dynamically get more files, so > finding the files should be included in the query. > > An alternative would be to execute the query such that it sets the field > 'foo' to null for all files that don't contain it. However, I don't know > how to execute this. > > Thanks, > Sebastian >
