[jira] [Commented] (SPARK-26335) Add an option for Dataset#show not to care about wide characters when padding them

Hyukjin Kwon (JIRA) Wed, 12 Dec 2018 21:43:16 -0800


    [ 
https://issues.apache.org/jira/browse/SPARK-26335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16719775#comment-16719775
 ]


Hyukjin Kwon commented on SPARK-26335:
--------------------------------------

Hm, I don't think Dataset#show is supposed to be used to be parsed. It's rather 
for just showing a pretty print. To make HTML table, I think you should use 
collect or copy some methods in Spark into your project to make it pretty.

> Add an option for Dataset#show not to care about wide characters when padding 
> them
> ----------------------------------------------------------------------------------
>
>                 Key: SPARK-26335
>                 URL: https://issues.apache.org/jira/browse/SPARK-26335
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.4.0
>            Reporter: Keiji Yoshida
>            Priority: Major
>         Attachments: Screen Shot 2018-12-11 at 17.53.54.png
>
>
> h2. Issue
> https://issues.apache.org/jira/browse/SPARK-25108 makes Dataset#show care 
> about wide characters when padding them. That is useful for humans to read a 
> result of Dataset#show. On the other hand, that makes it impossible for 
> programs to parse a result of Dataset#show because each cell's length can be 
> different from its header's length. My company develops and manages a 
> Jupyter/Apache Zeppelin-like visualization tool named "OASIS" 
> ([https://databricks.com/session/oasis-collaborative-data-analysis-platform-using-apache-spark]).
>  On this application, a result of Dataset#show on a Scala or Python process 
> is parsed to visualize it as an HTML table format. (A screenshot of OASIS has 
> been attached to this ticket as a file named "Screen Shot 2018-12-11 at 
> 17.53.54.png".)
> h2. Solution
> Add an option for Dataset#show not to care about wide characters when padding 
> them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-26335) Add an option for Dataset#show not to care about wide characters when padding them

Reply via email to