timsaucer commented on PR #1041:
URL: 
https://github.com/apache/datafusion-python/pull/1041#issuecomment-2702330600

   We have 3 PRs that are all impacting the `__repr__` and `_repr_html_`. We 
have:
   
   - This one which does the additional data checking with a collect()
   - #1015 which collects until we get to 10 rows
   - #1036 which collects 2MB or 20 rows but just for the html rendering
   
   I suggest we consolidate. My proposal is:
   
   - we merge in #1015 as it is
   - I update #1036 to combine the collecting operations to be either by 
minimum number of rows or data size
   - We close #1041 in favor of the truncation message from 1036 (I'll add it 
to `__repr__` also.
   
   Does this sound reasonable?
   
   Also, its incredible to have so many people pitching in at the same time. I 
will try to spend some time this weekend to organize some of the open issues to 
make it easier to not duplicate effort.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to