Keiji Yoshida created SPARK-26335:
-------------------------------------

             Summary: Add an option for Dataset#show not to care about wide 
characters when padding them
                 Key: SPARK-26335
                 URL: https://issues.apache.org/jira/browse/SPARK-26335
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.4.0
            Reporter: Keiji Yoshida


https://issues.apache.org/jira/browse/SPARK-25108 makes Dataset#show care about 
wide characters when padding them. That is useful for humans to read a result 
of Dataset#show. On the other hand, that makes it impossible for programs to 
parse a result of Dataset#show because each cell's length can be difference 
from its header's length. My company develops and manages a Jupyter/Apache 
Zeppelin-like visualization tool named "OASIS" 
([https://databricks.com/session/oasis-collaborative-data-analysis-platform-using-apache-spark]).
 On this application, a result of Dataset#show is parsed to visualize it as an 
HTML table format.

So, it is preferable to add an option for Dataset#show not to care about wide 
characters when padding them by adding a parameter such as "fixedColLength" to 
Dataset#show.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to