[GitHub] [spark] d80tb7 opened a new pull request #25939: [SPARK-27463][PYTHON] Tidy Up

GitBox Thu, 26 Sep 2019 01:22:43 -0700

d80tb7 opened a new pull request #25939: [SPARK-27463][PYTHON] Tidy Up
URL: https://github.com/apache/spark/pull/25939
 
 
   Follow up from https://github.com/apache/spark/pull/24981 incorporating some 
comments from @HyukjinKwon.
   
   Specifically:
   
   - Adding `CoGroupedData` to `pyspark/sql/__init__.py __all__` so that 
documentation is generated.
   - Added pydoc, including example, for the use case whereby the user supplies 
a cogrouping function including a key.
   - Added the boilerplate for doctests to cogroup.py.  Note that cogroup.py 
only contains the apply() function which has doctests disabled as per the  
other Pandas Udfs.
   - Restricted the newly exposed RelationalGroupedDataset constructor 
parameters to access only by the sql package.
   - Some minor  formatting tweaks.
   
   This was tested by running the appropriate unit tests.  I'm unsure as to how 
to check that my change will cause the documentation to be generated correctly, 
but it someone can describe how I can do this I'd be happy to check.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] d80tb7 opened a new pull request #25939: [SPARK-27463][PYTHON] Tidy Up

Reply via email to