Jung Kim created ARROW-16698:
--------------------------------

             Summary: Read First N rows of feather file through Python 
                 Key: ARROW-16698
                 URL: https://issues.apache.org/jira/browse/ARROW-16698
             Project: Apache Arrow
          Issue Type: New Feature
          Components: Python
            Reporter: Jung Kim


It would be helpfully to be able to read first N rows of feather file.
e.g.  
[read_feather|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_feather.html]
 or 
[read_table|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_table.html]
 could have "nrows" argument that behaves like "nrows" argument in 
[pd.read_csv|https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html] 

my particular use case is to development and test my data jobs. Say I saved a 
large data file(s) in feather format. For development, I don't want to read all 
rows but just first few rows of data to make sure my code runs.

I suppose there are many other use-cases.

Reference: https://github.com/wesm/feather/issues/158



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to