[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-12-18 Thread taotao li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16999645#comment-16999645 ] taotao li commented on ARROW-7043: -- [~apitrou]  Hi, Antonie, sorry for updating this issue so late,

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-12-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16999054#comment-16999054 ] Antoine Pitrou commented on ARROW-7043: --- [~taotao] Could you give an update here? > [Python]

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-11-08 Thread taotao li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970704#comment-16970704 ] taotao li commented on ARROW-7043: -- thanks [~wesm] and [~apitrou] for your detailed test, let me try on

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-11-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16966785#comment-16966785 ] Wes McKinney commented on ARROW-7043: - The jemalloc changes in 0.15.1 could be a factor here >

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-11-04 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16966682#comment-16966682 ] Antoine Pitrou commented on ARROW-7043: --- I've tried to reproduce using pyarrow 0.15.1, compiled

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-11-01 Thread taotao li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16965218#comment-16965218 ] taotao li commented on ARROW-7043: -- [~apitrou] thanks, Antonie, I've update a google driver link.   *

[jira] [Commented] (ARROW-7043) [Python] pyarrow.csv.read_csv, memory consumed much larger than raw pandas.read_csv

2019-11-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16965089#comment-16965089 ] Antoine Pitrou commented on ARROW-7043: --- Yes, can you upload the file? > [Python]