GitHub user JDrit opened a pull request:

    https://github.com/apache/spark/pull/7743

    [SPARK-8005][SQL] Input file name

    Users can now get the file name of the partition being read in. A thread 
local variable is in `SQLNewHadoopRDD` and is set when the partition is 
computed. `SQLNewHadoopRDD` is moved to core so that the catalyst package can 
reach it.
    
    This supports:
    
    `df.select(inputFileName())`
    
    and
    
    `sqlContext.sql("select input_file_name from table")`

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/JDrit/spark input_file_name

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/7743.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #7743
    
----
commit c7f7b5a6ca12934c212b5e808a68014eb96db6e3
Author: Joseph Batchik <[email protected]>
Date:   2015-07-29T03:46:03Z

    addeding input file name to Spark SQL

commit 75313f5c47ccfbcfd78c3f9edd2cf75771da1766
Author: Joseph Batchik <[email protected]>
Date:   2015-07-29T03:54:30Z

    small fixes

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to