Hello all.

I'm confused about the current level of integration between pandas and pyarrow. 
Am I correct in understanding that currently I'll need to convert pyarrow 
Tables to pandas DataFrames in order to use most of the pandas features?  By 
"pandas features" I mean every day slicing and dicing of data: merge, 
filtering, melt, spread, etc.

I have a dataframe which starts out from small files (< 1GB) and quickly 
explodes into dozens of gigabytes of memory in a pandas DataFrame. I'm 
interested in whether arrow can provide a better, optimized dataframe.

Thanks.

Reply via email to