[GitHub] spark pull request #21805: [SPARK-24850][SQL] fix str representation of Cach...

gatorsmile Thu, 19 Jul 2018 23:06:01 -0700

Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21805#discussion_r203945605
  
    --- Diff: 
sql/core/src/test/scala/org/apache/spark/sql/DatasetCacheSuite.scala ---
    @@ -206,4 +206,19 @@ class DatasetCacheSuite extends QueryTest with 
SharedSQLContext with TimeLimits
         // first time use, load cache
         checkDataset(df5, Row(10))
       }
    +
    +  test("SPARK-24850 InMemoryRelation string representation does not 
include cached plan") {
    +    val dummyQueryExecution = spark.range(0, 1).toDF().queryExecution
    +    val inMemoryRelation = InMemoryRelation(
    +      true,
    +      1000,
    +      StorageLevel.MEMORY_ONLY,
    +      dummyQueryExecution.sparkPlan,
    +      Some("test-relation"),
    +      dummyQueryExecution.logical)
    +
    +    
assert(!inMemoryRelation.simpleString.contains(dummyQueryExecution.sparkPlan.toString))
    +    assert(inMemoryRelation.simpleString.contains(
    +      "CachedRDDBuilder(true, 1000, StorageLevel(memory, deserialized, 1 
replicas))"))
    --- End diff --
    
    `true` and `1000` look confusing to end users. Can we improve it?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #21805: [SPARK-24850][SQL] fix str representation of Cach...

Reply via email to