[GitHub] spark pull request: [SPARK-4361][Doc] Add more docs for Hadoop Con...

andrewor14 Fri, 06 Feb 2015 11:26:17 -0800

Github user andrewor14 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/3225#discussion_r24264861
  
    --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala ---
    @@ -630,7 +634,10 @@ class SparkContext(config: SparkConf) extends 
SparkStatusAPI with Logging {
        * necessary info (e.g. file name for a filesystem-based dataset, table 
name for HyperTable),
        * using the older MapReduce API (`org.apache.hadoop.mapred`).
        *
    -   * @param conf JobConf for setting up the dataset
    +   * @param conf JobConf for setting up the dataset. Note: This will be 
put into a Broadcast.
    --- End diff --
    
    Nice find. It seems perfectly reasonable from the user's perspective to 
just save `sc.hadoopConfiguration` into a val and use it for many things. 
That's probably what I would have done if I didn't know about the nuances here.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-4361][Doc] Add more docs for Hadoop Con...

Reply via email to