[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-22 Thread Siying Dong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siying Dong updated HIVE-2296:
--

Resolution: Fixed
Status: Resolved  (was: Patch Available)

committed. Thanks Franklin!

 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Fix For: 0.8.0

 Attachments: hive-2296.1.patch, hive-2296.2.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 This causes corrupted output when doing a SELECT * on the table.
 Correct behavior should be to pick a valid filename such as:
 00_0_copy_1.gz

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-21 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Fix Version/s: 0.8.0
   Status: Patch Available  (was: Open)

 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Fix For: 0.8.0

 Attachments: hive-2296.1.patch, hive-2296.2.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 This causes corrupted output when doing a SELECT * on the table.
 Correct behavior should be to pick a valid filename such as:
 00_0_copy_1.gz

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Affects Version/s: 0.8.0

 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu

 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0
 00_0.gz_copy_1
 Correct behavior should be to pick a valid filename

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Description: 
When INSERT INTO is run on a table with compressed output 
(hive.exec.compress.output=true) and existing files in the table, it may copy 
the new files in bad file names:

Before INSERT INTO:
00_0.gz

After INSERT INTO:
00_0.gz
00_0.gz_copy_1

Correct behavior should be to pick a valid filename

  was:
When INSERT INTO is run on a table with compressed output 
(hive.exec.compress.output=true) and existing files in the table, it may copy 
the new files in bad file names:

Before INSERT INTO:
00_0.gz

After INSERT INTO:
00_0
00_0.gz_copy_1

Correct behavior should be to pick a valid filename


 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu

 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 Correct behavior should be to pick a valid filename

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Attachment: hive-2296.1.patch

 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Attachments: hive-2296.1.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 Correct behavior should be to pick a valid filename

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Description: 
When INSERT INTO is run on a table with compressed output 
(hive.exec.compress.output=true) and existing files in the table, it may copy 
the new files in bad file names:

Before INSERT INTO:
00_0.gz

After INSERT INTO:
00_0.gz
00_0.gz_copy_1

This causes corrupted output when doing a SELECT * on the table.
Correct behavior should be to pick a valid filename such as:
00_0_copy_1.gz

  was:
When INSERT INTO is run on a table with compressed output 
(hive.exec.compress.output=true) and existing files in the table, it may copy 
the new files in bad file names:

Before INSERT INTO:
00_0.gz

After INSERT INTO:
00_0.gz
00_0.gz_copy_1

Correct behavior should be to pick a valid filename


 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Attachments: hive-2296.1.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 This causes corrupted output when doing a SELECT * on the table.
 Correct behavior should be to pick a valid filename such as:
 00_0_copy_1.gz

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread Franklin Hu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Franklin Hu updated HIVE-2296:
--

Attachment: hive-2296.2.patch

add unit test

 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Attachments: hive-2296.1.patch, hive-2296.2.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 This causes corrupted output when doing a SELECT * on the table.
 Correct behavior should be to pick a valid filename such as:
 00_0_copy_1.gz

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira