[jira] [Commented] (ARROW-13939) how to do resampling of arrow table using cython

krishna deepak (Jira) Thu, 09 Sep 2021 02:46:21 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-13939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17412481#comment-17412481
 ]


krishna deepak commented on ARROW-13939:
----------------------------------------

Hi Will,

Yes, resampling timeseries table, eg 1min buckets to 5 min buckets table etc. 
Same as dataframe.resample. Does arrow provide this functionality already?


So how to go about iterating the table. from documentation all I could use is 
only *Slice* function. But does not feel like a proper iterator of sorts. Is 
there anything better to iterate properly?

By "build arrays", you mean arrow chunkedarrays, arraybuilders or cpp vectors?

 

Thanks

 

 

> how to do resampling of arrow table using cython
> ------------------------------------------------
>
>                 Key: ARROW-13939
>                 URL: https://issues.apache.org/jira/browse/ARROW-13939
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Python
>            Reporter: krishna deepak
>            Priority: Minor
>
> Please can someone point me to resources, how to write a resampling code in 
> cython for Arrow table.
>  # Will iterating the whole table be slow in cython?
>  # which is the best to use to append new elements to. Is there a way i 
> create an empty table of same schema and keep appending to it. Or should I 
> use vectors/list and then pass them to create a table.
> Performance is very important for me. Any help is highly appreciated.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (ARROW-13939) how to do resampling of arrow table using cython

Reply via email to