[ 
https://issues.apache.org/jira/browse/HAWQ-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15960282#comment-15960282
 ] 

Lili Ma edited comment on HAWQ-1426 at 4/7/17 5:12 AM:
-------------------------------------------------------

RCA:
When hawq extract tries to find the HDFS files information, it wrongly treated 
the pg_aoseg.pg_aoseg_$relid as the catalog table for storing those 
information. 
Here is the correct processing steps: 
1. Find the directory on HDFS which stores the actual data for the table. This 
can be achieved by following the column "relfilenode" in pg_class table. 
2. Find the detailed file name for the table under above directory. This can be 
achieved by searching the catalog table pg_aoseg.pg_aoseg(paqseg)_$. The table 
name suffix is neither $relid nor $relfilenode under some circumstances. We 
should get it by referring the column "segrelid" in catalog table 
"pg_appendonly", and then looking up the table "pg_class" to get the accurate 
table name.


was (Author: lilima):
RCA:
When hawq extract tries to find the HDFS files information, it wrongly treated 
the pg_aoseg.pg_aoseg_$relid as the catalog table for storing those 
information. 
When determining the file path of a table, hawq extract should follow below 
steps:
1. Find the directory on HDFS which stores the actual data for the table. This 
can be achieved by following the column "relfilenode" in pg_class table. 
2. Find the detailed file name for the table under above directory. This can be 
achieved by searching the catalog table pg_aoseg.pg_aoseg(paqseg)_$. The table 
name suffix is neither $relid nor $relfilenode under some circumstances. We 
should get it by referring the column "segrelid" in catalog table 
"pg_appendonly", and then looking up the table "pg_class" to get the accurate 
table name.

> hawq extract meets error after the table was reorganized.
> ---------------------------------------------------------
>
>                 Key: HAWQ-1426
>                 URL: https://issues.apache.org/jira/browse/HAWQ-1426
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: Command Line Tools
>            Reporter: Lili Ma
>            Assignee: Ed Espino
>             Fix For: 2.3.0.0-incubating
>
>
> After one table is reorganized, hawq extract the table will meet error.
> Reproduce Steps:
> 1. create an AO table
> 2. insert into several records into it
> 3. Get the table reorganized.  "alter table a set with (reorganize=true);"
> 4. run hawq extract, error thrown out.
> For the bug fix, we should also guarantee that hawq extract should work if 
> the table is truncated and re-inserted.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to