[ 
https://issues.apache.org/jira/browse/HIVE-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13861962#comment-13861962
 ] 

Lefty Leverenz commented on HIVE-5795:
--------------------------------------

Got it, thanks [~shuainie].  One doc question:  can "skip.footer.line.count" 
and "skip.header.line.count" be changed, or specified for the first time, with 
"ALTER TABLE tbl SET TBLPROPERTIES" and if so would any problems ensue?  (Hm, 
that's two or three questions.  Here's another:  can the values vary by 
partition?)

[~thejas], a followup jira isn't needed to get the doc task done, because it's 
already on my to-do list.  I'll post a comment here when the doc is ready for 
review.

TL;DR:  This jira has a doc release note, so that covers the record-keeping 
requirement.  The new config parameter and table properties are named here, so 
search capability is covered.  The only question is whether we want all doc 
tasks to have separate jiras.  I don't see any immediate advantage to that 
policy although we might want to move in that direction eventually.

> Hive should be able to skip header and footer rows when reading data file for 
> a table
> -------------------------------------------------------------------------------------
>
>                 Key: HIVE-5795
>                 URL: https://issues.apache.org/jira/browse/HIVE-5795
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Shuaishuai Nie
>            Assignee: Shuaishuai Nie
>             Fix For: 0.13.0
>
>         Attachments: HIVE-5795.1.patch, HIVE-5795.2.patch, HIVE-5795.3.patch, 
> HIVE-5795.4.patch, HIVE-5795.5.patch
>
>
> Hive should be able to skip header and footer lines when reading data file 
> from table. In this way, user don't need to processing data which generated 
> by other application with a header or footer and directly use the file for 
> table operations.
> To implement this, the idea is adding new properties in table descriptions to 
> define the number of lines in header and footer and skip them when reading 
> the record from record reader. An DDL example for creating a table with 
> header and footer should be like this:
> {code}
> Create external table testtable (name string, message string) row format 
> delimited fields terminated by '\t' lines terminated by '\n' location 
> '/testtable' tblproperties ("skip.header.line.count"="1", 
> "skip.footer.line.count"="2");
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to