H0TB0X420 commented on issue #1227:
URL: 
https://github.com/apache/datafusion-python/issues/1227#issuecomment-3330305485

   I used these commands to find where pyarrow is used for input parameters and 
return types.
   
   `grep -r "import pyarrow"` -> 
[pyarrow_imports.txt](https://github.com/user-attachments/files/22521550/pyarrow_imports.txt):
 All files importing pyarrow
   `grep -r "pa\."` -> 
[pyarrow_usage.txt](https://github.com/user-attachments/files/22521551/pyarrow_usage.txt):
 All usage of `pa.` objects in the codebase
   
   
   
   ### Input Parameters
   **SessionContext methods:**
   - `from_arrow_table(data: pa.Table)` 
   - `create_dataframe(partitions: list[list[pa.RecordBatch]])`
   - `register_csv(..., schema: pa.Schema)`
   - `register_parquet(..., schema: pa.Schema)` 
   - `register_json(..., schema: pa.Schema)`
   - `register_dataset(dataset: pa.dataset.Dataset)`
   
   **DataFrame methods:**
   - `cast(mapping: dict[str, pa.DataType])`
   
   **User-defined functions:**
   - Various methods accepting `pa.DataType`, `pa.Array`, `pa.Scalar`
   
   ### Return Types
   
   **Core API methods:**
   - `DataFrame.collect() -> List[pa.RecordBatch]`
   - `DataFrame.collect_partitioned() -> List[List[pa.RecordBatch]]`
   - `DataFrame.schema() -> pa.Schema`
   - `DataFrame.to_arrow_table() -> pa.Table`
   - `RecordBatch.to_pyarrow() -> pa.RecordBatch`
   - `DataFrame.into_view() -> pa.Table`
   
   **Convenience methods:**
   - `DataFrame.to_pandas()` 
   - `DataFrame.to_polars()` 
   
   What's the recommended approach for tackling this?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to