[
https://issues.apache.org/jira/browse/BEAM-12533?focusedWorklogId=615927&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-615927
]
ASF GitHub Bot logged work on BEAM-12533:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 29/Jun/21 00:16
Start Date: 29/Jun/21 00:16
Worklog Time Spent: 10m
Work Description: ibzib commented on a change in pull request #15089:
URL: https://github.com/apache/beam/pull/15089#discussion_r660189622
##########
File path: sdks/python/apache_beam/dataframe/frames.py
##########
@@ -143,6 +143,14 @@ def wrapper(self, *args, **kwargs):
class DeferredDataFrameOrSeries(frame_base.DeferredFrame):
+ def _render_indexes(self):
Review comment:
It may be better to keep things simple here and do `indexes={...}` even
for a single index. Though TBH I am not sure why `index.name` and `index.names`
are separate attributes (is this different between a pandas index and a
deferred index)?
Related question, why can't we use `repr(index)`? I wonder if there is more
to an index than just its name? For example, if there are different types of
index (which I gather there are) could that information be useful to the user?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 615927)
Time Spent: 4h (was: 3h 50m)
> DeferredSeries and DeferredDataFrame should have a useful repr
> --------------------------------------------------------------
>
> Key: BEAM-12533
> URL: https://issues.apache.org/jira/browse/BEAM-12533
> Project: Beam
> Issue Type: Improvement
> Components: dsl-dataframe
> Reporter: Brian Hulette
> Assignee: Brian Hulette
> Priority: P2
> Fix For: 2.32.0
>
> Time Spent: 4h
> Remaining Estimate: 0h
>
> DeferredSeries and DeferredDataFrame just use the default __repr__
> implementation right now, which means outputting them in a notebook is not
> useful at all. Users will need to inspect columns, dtypes, index, name, etc..
> manually. We should include basic information about the frames in a simple
> __repr__ implementation.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)