Kailash Gupta created SPARK-26379:
-------------------------------------

             Summary: Structured Streaming - Exception on adding column to 
Dataset
                 Key: SPARK-26379
                 URL: https://issues.apache.org/jira/browse/SPARK-26379
             Project: Spark
          Issue Type: Bug
          Components: Structured Streaming
    Affects Versions: 2.3.0
            Reporter: Kailash Gupta


While using withColumn to add a column to a structured streaming Dataset, I am 
getting following exception: 

org.apache.spark.sql.catalyst.analysis.UnresolvedException: Invalid call to 
dataType on unresolved object, tree: 'timestamp

Following is sample code
{code:java}
final String path = "path_to_input_directory";

final StructType schema = new StructType(new StructField[] { new 
StructField("word", StringType, false, Metadata.empty()), new 
StructField("count", DataTypes.IntegerType, false, Metadata.empty()) });

SparkSession sparkSession = 
SparkSession.builder().appName("StructuredStreamingIssue").master("local").getOrCreate();
Dataset<Row> words = sparkSession.readStream().option("sep", 
",").schema(schema).csv(path);

Dataset<Row> wordsWithTimestamp = words.withColumn("timestamp", 
functions.current_timestamp());

// wordsWithTimestamp.explain(true);

StreamingQuery query = 
wordsWithTimestamp.writeStream().outputMode("update").option("truncate", 
"false").format("console").trigger(Trigger.ProcessingTime("2 seconds")).start();

query.awaitTermination();{code}
Following are the contents of the file present at _path_
{code:java}
a,2
c,4
d,2
r,1
t,9
{code}
This seems working with 2.2.0 release, but not with 2.3.0 and 2.4.0



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to