[GitHub] spark pull request: [SPARK-7155] [CORE] Allow newAPIHadoopFile to ...

srowen Mon, 27 Apr 2015 04:33:10 -0700

Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5708#discussion_r29138679
  
    --- Diff: core/src/test/scala/org/apache/spark/SparkContextSuite.scala ---
    @@ -213,4 +215,78 @@ class SparkContextSuite extends FunSuite with 
LocalSparkContext {
           sc.stop()
         }
       }
    +
    +  test("Comma separated paths for 
newAPIHadoopFile/wholeTextFiles/binaryFiles (SPARK-7155)") {
    +    // Regression test for SPARK-7155
    +    // dir1 and dir2 are used for wholeTextFiles and binaryFiles
    +    val dir1 = Utils.createTempDir()
    +    val dir2 = Utils.createTempDir()
    +
    +    val dirpath1=dir1.getAbsolutePath
    +    val dirpath2=dir2.getAbsolutePath
    +
    +    // file1 and file2 are placed inside dir1, they are also used for
    +    // textFile, hadoopFile, and newAPIHadoopFile
    +    // file3, file4 and file5 are placed inside dir2, they are used for
    +    // textFile, hadoopFile, and newAPIHadoopFile as well
    +    val file1 = new File(dir1, "part-00000")
    +    val file2 = new File(dir1, "part-00001")
    +    val file3 = new File(dir2, "part-00000")
    +    val file4 = new File(dir2, "part-00001")
    +    val file5 = new File(dir2, "part-00002")
    +
    +    val filepath1=file1.getAbsolutePath
    +    val filepath2=file2.getAbsolutePath
    +    val filepath3=file3.getAbsolutePath
    +    val filepath4=file4.getAbsolutePath
    +    val filepath5=file5.getAbsolutePath
    +
    +
    +    try {
    +      // Create 5 text files.
    +      Files.write("someline1 in file1\nsomeline2 in file1\nsomeline3 in 
file1", file1, UTF_8)
    +      Files.write("someline1 in file2\nsomeline2 in file2", file2, UTF_8)
    +      Files.write("someline1 in file3", file3, UTF_8)
    +      Files.write("someline1 in file4\nsomeline2 in file4", file4, UTF_8)
    +      Files.write("someline1 in file2\nsomeline2 in file5", file5, UTF_8)
    +
    +      sc = new SparkContext(new 
SparkConf().setAppName("test").setMaster("local"))
    +
    +      // Test textFile, hadoopFile, and newAPIHadoopFile for file1 and 
file2
    +      assert(sc.textFile(filepath1 + "," + filepath2).count() == 5L)
    +      assert(sc.hadoopFile(filepath1 + "," + filepath2,
    +        classOf[TextInputFormat], classOf[LongWritable], 
classOf[Text]).count() == 5L)
    +      assert(sc.newAPIHadoopFile(filepath1 + "," + filepath2,
    +        classOf[NewTextInputFormat], classOf[LongWritable], 
classOf[Text]).count() == 5L)
    +
    +      // Test textFile, hadoopFile, and newAPIHadoopFile for file3, file4, 
and file5
    +      assert(sc.textFile(filepath3 + "," + filepath4 + "," + 
filepath5).count() == 5L)
    +      assert(sc.hadoopFile(filepath3 + "," + filepath4 + "," + filepath5,
    +               classOf[TextInputFormat], classOf[LongWritable], 
classOf[Text]).count() == 5L)
    +      assert(sc.newAPIHadoopFile(filepath3 + "," + filepath4 + "," + 
filepath5,
    +               classOf[NewTextInputFormat], classOf[LongWritable], 
classOf[Text]).count() == 5L)
    +
    +      // Test wholeTextFiles, and binaryFiles for dir1 and dir2
    +      assert(sc.wholeTextFiles(dirpath1 + "," + dirpath2).count() == 5L)
    +      assert(sc.binaryFiles(dirpath1 + "," + dirpath2).count() == 5L)
    +
    +    } finally {
    +      sc.stop()
    +      if (file1.exists()) {
    --- End diff --
    
    Simpler just to delete the parent dirs right?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-7155] [CORE] Allow newAPIHadoopFile to ...

Reply via email to