Jung Kim created ARROW-16698:
--------------------------------
Summary: Read First N rows of feather file through Python
Key: ARROW-16698
URL: https://issues.apache.org/jira/browse/ARROW-16698
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Reporter: Jung Kim
It would be helpfully to be able to read first N rows of feather file.
e.g.
[read_feather|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_feather.html]
or
[read_table|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_table.html]
could have "nrows" argument that behaves like "nrows" argument in
[pd.read_csv|https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html]
my particular use case is to development and test my data jobs. Say I saved a
large data file(s) in feather format. For development, I don't want to read all
rows but just first few rows of data to make sure my code runs.
I suppose there are many other use-cases.
Reference: https://github.com/wesm/feather/issues/158
--
This message was sent by Atlassian Jira
(v8.20.7#820007)