wjones127 commented on issue #12416:
URL: https://github.com/apache/arrow/issues/12416#issuecomment-1043287951
When I try to run this, here is my output for "bad_df" run:
```
reading csv
shape: (3, 4)
┌──────┬───────┬─────────────────────┬──────────┐
│ pid1 ┆ pid2 ┆ intCol ┆ strCol │
│ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ i64 ┆ i64 ┆ str │
╞══════╪═══════╪═════════════════════╪══════════╡
│ 2010 ┆ 10000 ┆ 23455555508999 ┆ Peaceful │
├╌╌╌╌╌╌┼╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 2015 ┆ 15000 ┆ 7753285016841556620 ┆ Happy │
├╌╌╌╌╌╌┼╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 2020 ┆ 25000 ┆ null ┆ World │
└──────┴───────┴─────────────────────┴──────────┘
Table Schema..
pid1: int64
pid2: int64
intCol: int64
strCol: large_string
Retrived table schema
pyarrow.Table
intCol: int64
strCol: large_string
pid1: int64
pid2: int64
shape: (3, 4)
┌─────────────────────┬──────────┬──────┬───────┐
│ intCol ┆ strCol ┆ pid1 ┆ pid2 │
│ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ str ┆ i64 ┆ i64 │
╞═════════════════════╪══════════╪══════╪═══════╡
│ 23455555508999 ┆ Peaceful ┆ 2010 ┆ 10000 │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌┼╌╌╌╌╌╌╌┤
│ 7753285016841556992 ┆ Happy ┆ 2015 ┆ 15000 │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌┼╌╌╌╌╌╌╌┤
│ null ┆ World ┆ 2020 ┆ 25000 │
└─────────────────────┴──────────┴──────┴───────┘
```
My versions:
```python
print(pa.__version__) # 4.0.0
print(pl.__version__) # 0.13.3
```
So I don't think I am seeing the same behavior as you are; the output looks
good to me. Could you share what you are seeing and also exactly which versions
of polars and pyarrow you are using?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]