[ 
https://issues.apache.org/jira/browse/IMPALA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16534184#comment-16534184
 ] 

Gevorg commented on IMPALA-801:
-------------------------------

This would be really nice to have, asĀ others already mentioned file names may 
contain metadata which we'd like to use in subsequent operations.

> Add function or virtual column for file name
> --------------------------------------------
>
>                 Key: IMPALA-801
>                 URL: https://issues.apache.org/jira/browse/IMPALA-801
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Catalog
>    Affects Versions: Impala 1.2.3
>            Reporter: Udai Kiran Potluri
>            Priority: Minor
>              Labels: built-in-function, ramp-up
>
> Hive can list the data files in a table. For eg the following query lists all 
> the data files for the table or partition:
> {noformat}
> select INPUT__FILE__NAME, count(*) from <table_name> where dt='20140210' 
> group by INPUT__FILE__NAME;
> {noformat}
> This has two advantages over the existing "show files" functionality:
> * The output can be used in arbitrary SQL statements.
> * You can see which record came from which file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to