HyukjinKwon commented on PR #37437: URL: https://github.com/apache/spark/pull/37437#issuecomment-1208831610
Hey @Transurgeon, thanks for taking a look. > Does self-contained simply mean that we need to initialise the dataframe in each shell example? Yes. Plus, we should add some description for each example. Basiclaly I would like to follow what the pandas do. > Also what do you mean by more realistic examples? Something meaningful. The operation has to do something. For example, `spark.createDataFrame([1]).count()` doesn't much make sense. > Just to make sure for building the docs, I need to run these two commands right? Yes. As long as the format is consistent, you might not need to build and validate it by yourself though. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
