Github user nico-pappagianis commented on a diff in the pull request:

    https://github.com/apache/phoenix/pull/221#discussion_r87291460
  
    --- Diff: 
phoenix-spark/src/main/scala/org/apache/phoenix/spark/ProductRDDFunctions.scala 
---
    @@ -16,19 +16,20 @@ package org.apache.phoenix.spark
     import org.apache.hadoop.conf.Configuration
     import org.apache.hadoop.io.NullWritable
     import org.apache.phoenix.mapreduce.PhoenixOutputFormat
    -import 
org.apache.phoenix.mapreduce.util.{ColumnInfoToStringEncoderDecoder, 
PhoenixConfigurationUtil}
    +import org.apache.phoenix.mapreduce.util.PhoenixConfigurationUtil
     import org.apache.spark.Logging
     import org.apache.spark.rdd.RDD
    +
     import scala.collection.JavaConversions._
     
     class ProductRDDFunctions[A <: Product](data: RDD[A]) extends Logging with 
Serializable {
     
       def saveToPhoenix(tableName: String, cols: Seq[String],
    -                    conf: Configuration = new Configuration, zkUrl: 
Option[String] = None)
    +                    conf: Configuration = new Configuration, zkUrl: 
Option[String] = None, tenantId: Option[String] = None)
    --- End diff --
    
    Looking at this closer it appears the only way to pass the tenantId to 
PairRDDFunctions.saveAsNewAPIHadoopFile is through the Configuration parameter. 
    
    I was able to avoid setting TenantId in the partitionConfig by overloading 
getUpsertColumnMetadataList to take in a Properties object. However 
PairRDDFunctions is part of spark-core, so I don't know if we want to make a 
change to spark-core so we can avoid setting the tenantId in the outputConfig. 
    
    Let me know what you think.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to