nandini57 edited a comment on issue #1582:
URL: https://github.com/apache/incubator-hudi/issues/1582#issuecomment-624789029


   Probably switching to parquet format instead of hudi and doing a 
spark.read.parquet(partitionpath).dropduplicates where commit_time= X is an 
option? The following works if i want to go back to commit X and have a view of 
data.However,the same with hudi format doesn't provide me the right view as of 
commit X
   
    def audit(spark: SparkSession, partitionPath: String, tablePath: String, 
commitTime: String): Unit = {
       val hoodieROViewDF = spark.read.option("inferSchema", 
true).parquet(tablePath +  "/" + partitionPath)
       hoodieROViewDF.createOrReplaceTempView("hoodie_ro")
       spark.sql("select * from hoodie_ro where _hoodie_commit_time =" + 
commitTime).dropDuplicates().show()
     }
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to