[ https://issues.apache.org/jira/browse/HIVE-467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Carl Steinbach updated HIVE-467: -------------------------------- Fix Version/s: 0.4.0 > Scratch data location should be on different filesystems for different types > of intermediate data > ------------------------------------------------------------------------------------------------- > > Key: HIVE-467 > URL: https://issues.apache.org/jira/browse/HIVE-467 > Project: Hadoop Hive > Issue Type: Bug > Components: Query Processor > Environment: S3/EC2 > Reporter: Joydeep Sen Sarma > Assignee: Joydeep Sen Sarma > Fix For: 0.4.0 > > Attachments: hive-467.3.patch, hive-467.4.patch, hive-467.5.patch, > hive-467.6.patch, hive-467.patch.1, hive-467.patch.2 > > > Currently Hive uses the same scratch directory/path for all sorts of > temporary and intermediate data. This is problematic: > 1. Temporary location for writing out DDL output should just be temp file on > local file system. This divorces the dependence of metadata and browsing > operations on a functioning hadoop cluster. > 2. Temporary location of intermediate map-reduce data should be the default > file system (which is typically the hdfs instance on the compute cluster) > 3. Temporary location for data that needs to be 'moved' into tables should be > on the same file system as the table's location (table's location may not be > same as hdfs instance of processing cluster). > ie. - local storage, map-reduce intermediate storage and table storage should > be distinguished. Without this distinction - using hive on environments like > S3/EC2 causes problems. In such an environment - i would like to be able to: > - do metadata operations without a provisioned hadoop cluster (using data > stored in S3 and metastore on local disk) > - attach to a provisioned hadoop cluster and run queries > - store data back in tables that are created over s3 file system -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.