This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
     new bacd34398aaa [SPARK-53182][PYTHON][DOCS] Fix broken and missing links 
in PySpark DataFrames user guide
bacd34398aaa is described below

commit bacd34398aaaf97f4a97cf8574fc490ed96b3e06
Author: Jonny Comes <jonnyco...@gmail.com>
AuthorDate: Mon Sep 15 08:00:11 2025 +0900

    [SPARK-53182][PYTHON][DOCS] Fix broken and missing links in PySpark 
DataFrames user guide
    
    ### What changes were proposed in this pull request?
    
    This PR fixes two small issues in the PySpark DataFrames user guide:
    
    1. **Replaces a broken external link** to a section on data manipulation. 
The previous link pointed to an outdated Databricks-hosted page. It is now 
replaced with a working internal link to Chapter 3 of the PySpark user guide:
       [Chapter 3: Function Junction - Data manipulation with 
PySpark](https://spark.apache.org/docs/latest/api/python/user_guide/dataprep.html)
    
    2. **Adds a missing link** for the section on saving DataFrames to 
persistent storage. The text previously said “TODO: add link.” This has been 
replaced with a correct reference to:
       [Chapter 7: Load and Behold - Data loading, storage, file 
formats](https://spark.apache.org/docs/latest/api/python/user_guide/loadandbehold.html)
    
    ### Why are the changes needed?
    
    These changes improve the quality and usability of the documentation by 
fixing a broken link and completing a placeholder that may confuse users. It 
ensures readers are directed to up-to-date, relevant internal documentation 
instead of an outdated or unavailable external resource.
    
    ### Does this PR introduce _any_ user-facing change?
    
    Yes. It updates two markdown cells in the `dataframes.ipynb` user guide 
notebook, affecting how users navigate to related documentation when reading 
the generated HTML docs.
    
    ### How was this patch tested?
    
    The documentation was built locally using `make html` in the `python/docs` 
directory. The rendered output for the notebook was reviewed in a browser to 
confirm the links appear and function correctly.
    
    <img width="1451" height="225" alt="Screenshot 2025-08-07 at 11 53 19 AM" 
src="https://github.com/user-attachments/assets/41d3342d-0637-48e5-b010-e2b6e5bc5630";
 />
    <img width="1435" height="231" alt="Screenshot 2025-08-07 at 11 52 54 AM" 
src="https://github.com/user-attachments/assets/466be073-1f90-4cf1-b1e7-ad03f5bbe21a";
 />
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #51851 from jonnycomes/fix-doc-user-guide-dataframes-links.
    
    Authored-by: Jonny Comes <jonnyco...@gmail.com>
    Signed-off-by: Hyukjin Kwon <gurwls...@apache.org>
---
 python/docs/source/user_guide/dataframes.ipynb | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

diff --git a/python/docs/source/user_guide/dataframes.ipynb 
b/python/docs/source/user_guide/dataframes.ipynb
index 7b02055367a5..49f0fbcdab27 100644
--- a/python/docs/source/user_guide/dataframes.ipynb
+++ b/python/docs/source/user_guide/dataframes.ipynb
@@ -528,7 +528,9 @@
                 "\n",
                 "Let's look at some ways we can transform our DataFrames.\n",
                 "\n",
-                "For more detailed information, please see the section about 
data manipulation, [Chapter 6: Function Junction - Data manipulation with 
PySpark](https://databricks-eng.github.io/pyspark-cookbook/07-dataprep.html).\n",
+                "For more detailed information, please see the section about 
data manipulation, ",
+                "[Chapter 3: Function Junction - Data manipulation with 
PySpark](https://spark.apache.org/docs/latest/api/python/user_guide/dataprep.html).\n",
+                "\n",
                 "\n",
                 "### Rename columns\n",
                 "\n",
@@ -686,8 +688,8 @@
                 "## Save DataFrame to Persistent Storage\n",
                 "\n",
                 "There are several ways to save a DataFrame to persistent 
storage in PySpark.\n",
-                "For more detailed information about saving data to your local 
environment,\n",
-                "please see the section about Data Loading (TODO: add link)."
+                "For more detailed information about saving data locally, see 
",
+                "[Chapter 7: Load and Behold - Data loading, storage, file 
formats](https://spark.apache.org/docs/latest/api/python/user_guide/loadandbehold.html).\n"
             ]
         },
         {
@@ -795,3 +797,4 @@
     "nbformat": 4,
     "nbformat_minor": 5
 }
+


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to