This is an automated email from the ASF dual-hosted git repository.
dongjoon pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/orc.git
The following commit(s) were added to refs/heads/main by this push:
new f50375478 ORC-2057: Add `Pandas` page at `Using in Python` section
f50375478 is described below
commit f50375478d5c435e84a70af72cc637bc62b8dc26
Author: Dongjoon Hyun <[email protected]>
AuthorDate: Mon Jan 5 11:54:02 2026 +0900
ORC-2057: Add `Pandas` page at `Using in Python` section
### What changes were proposed in this pull request?
This PR aims to add `Pandas` page at `Using in Python` section.
### Why are the changes needed?
To helps `Pandas` users.
### How was this patch tested?
Manually generate the page and check.
<img width="627" height="706" alt="Screenshot 2026-01-05 at 10 32 58"
src="https://github.com/user-attachments/assets/816c20e2-8682-491a-90f2-3614da188aa8"
/>
### Was this patch authored or co-authored using generative AI tooling?
Yes (`Gemini 3 Pro` on `Antigravity`)
Closes #2480 from dongjoon-hyun/ORC-2057.
Authored-by: Dongjoon Hyun <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
---
site/_data/docs.yml | 1 +
site/_docs/pandas.md | 41 +++++++++++++++++++++++++++++++++++++++++
2 files changed, 42 insertions(+)
diff --git a/site/_data/docs.yml b/site/_data/docs.yml
index a3346b133..745d2c231 100644
--- a/site/_data/docs.yml
+++ b/site/_data/docs.yml
@@ -18,6 +18,7 @@
- title: Using in Python
docs:
- pyarrow
+ - pandas
- dask
- title: Using in Hive
diff --git a/site/_docs/pandas.md b/site/_docs/pandas.md
new file mode 100644
index 000000000..ad1b3c697
--- /dev/null
+++ b/site/_docs/pandas.md
@@ -0,0 +1,41 @@
+---
+layout: docs
+title: Pandas
+permalink: /docs/pandas.html
+---
+
+## How to install
+
+[Pandas](https://pandas.pydata.org/) is a fast, powerful, flexible and easy to
use open source data analysis and manipulation tool.
+Since Pandas relies on [pyarrow](https://pypi.org/project/pyarrow/) for ORC
support, it is required.
+
+```
+pip3 install pandas==2.3.3
+pip3 install pyarrow
+```
+
+## How to write and read an ORC file
+
+```
+In [1]: import pandas as pd
+
+In [2]: df = pd.DataFrame({"col1": [1, 2, 3], "col2": ["a", "b", None]})
+
+In [3]: df.to_orc("test.orc")
+
+In [4]: pd.read_orc("test.orc")
+Out[4]:
+ col1 col2
+0 1 a
+1 2 b
+2 3 None
+
+In [5]: pd.read_orc("test.orc", columns=["col1"])
+Out[5]:
+ col1
+0 1
+1 2
+2 3
+```
+
+[Pandas](https://pandas.pydata.org/docs/) page provides more information.