Filtering data files in directories

Ludovic Claude Tue, 10 May 2016 13:44:18 -0700

Hello,

I have a repository of files relatively well organised and containing amix of medical images and csv files produced from those images in aneuroscience lab.

The csv files contain some interesting data that I would like toaggregate with Drill, but the naming convention is quite special - filenames contain some id, then a prefix or suffix to identify the categoryof the file and all that is nested into a folder structure organised bysubjects, for example ID1/processing1/ID1-mx.csv.

How can I use Drill to filter out the files that I do not need and keeponly the files containing my data?


For example, I would like to write something like

SELECT * FROM dfs.data.`/` where dir1 = "processing1" and file like"%-mx.csv";



Thanks

Filtering data files in directories

Reply via email to