[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

HyukjinKwon Thu, 25 Oct 2018 18:07:44 -0700

Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22814#discussion_r228380951
  
    --- Diff: 
external/avro/src/main/scala/org/apache/spark/sql/avro/package.scala ---
    @@ -31,10 +32,32 @@ package object avro {
        * @since 2.4.0
        */
       @Experimental
    -  def from_avro(data: Column, jsonFormatSchema: String): Column = {
    -    new Column(AvroDataToCatalyst(data.expr, jsonFormatSchema))
    +  def from_avro(
    +      data: Column,
    +      jsonFormatSchema: String): Column = {
    +    new Column(AvroDataToCatalyst(data.expr, jsonFormatSchema, Map.empty))
    +  }
    +
    +  /**
    +   * Converts a binary column of avro format into its corresponding 
catalyst value. The specified
    +   * schema must match the read data, otherwise the behavior is undefined: 
it may fail or return
    +   * arbitrary result.
    +   *
    +   * @param data the binary column.
    +   * @param jsonFormatSchema the avro schema in JSON string format.
    +   * @param options options to control how the Avro record is parsed.
    +   *
    +   * @since 3.0.0
    +   */
    +  @Experimental
    +  def from_avro(
    +      data: Column,
    +      jsonFormatSchema: String,
    +      options: Map[String, String]): Column = {
    --- End diff --
    
    One thing I am less sure is tho, what do you think about we just expose one 
Java specific one? This will only be usable in Scala side.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

Reply via email to