[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

dakirsa Thu, 19 Oct 2017 05:36:11 -0700

Github user dakirsa commented on the issue:

    https://github.com/apache/spark/pull/19439
  
    @hhbyyh, @thunterdb 
    > Not sure about the reason to include "origin" info into the image data. 
Based on my experience, path info 
    > serves better as a separate column in the DataFrame. (E.g. prediction)
    
    One of the main reasons is MLlib pipelines: transformers/estimators work on 
a single dataframe column; so it is much easier when "origin" is a part of this 
column too.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

Reply via email to