You can use CREATE EXTERNAL TABLE... LOCATION. See http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL and examples thereof. When you LOAD DATA INPATH, the directory gets moved to the Hive warehouse dir; it does not get modified.
On Wed, Dec 15, 2010 at 3:33 PM, Mark <[email protected]> wrote: > Can someone explain what actually happens when you create a table and import > data into using "LOAD DATA INPATH..." > > I noticed that when I load the data from files already existing in HDFS that > it actually removes the original file from its location and moves it under > the /user/hive directory. Is there anyway I can prevent this from happening > or is this just the way things work? At this point is the file modified in > anyway? I have some other Hadoop jobs that rely on this data. Should I just > update those jobs to operate on the data within these directories? Thanks >
