For the pattern part, there is a JIRA issue open, but from the comment thread, 
I'm not sure where we are with it:

https://issues.apache.org/jira/browse/HIVE-951

JVS
________________________________________
From: Ashish Thusoo [[email protected]]
Sent: Wednesday, May 26, 2010 11:03 AM
To: [email protected]
Subject: RE: Query HDFS files without using LOAD (move)

You could probably use external tables?? CREATE EXTERNAL TABLE allows you to 
create tables on hdfs files but I do not think that it takes file patterns / 
regex. If all the files are created within a directory then you could point the 
external table to the directory location and then querying on that table would 
automatically query all the files in that directory. Are your files in a single 
directory or are they spread out?

http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL#Create_Table

Ashish

-----Original Message-----
From: Karthik [mailto:[email protected]]
Sent: Wednesday, May 26, 2010 10:45 AM
To: [email protected]
Subject: Query HDFS files without using LOAD (move)

Is there a way where I can specify a list of files (or file pattern / regex) 
from a HDFS location other than the Hive Warehouse as a parameter to a Hive 
Query?  I have a bunch of files that are used by other applications as well and 
I need to perform queries on those as well using Hive and so I do not want to 
use LOAD and move those files on to Hive warehouse from the original location.

My query is on incremental data (new files) that are added on a daily basis and 
need not use the full list of files on a folder and so I need to specify a list 
of file / pattern, something like a filter of files to the query.

Please suggest.

- KK.

Reply via email to