Re: [C++] CSV string column category to dictionary/indices?

2019-12-03 Thread ntfs hard
Hello Thank you for your advice! I'll try to adapt it to my code. Best, -- вт, 3 дек. 2019 г. в 17:16, Antoine Pitrou : > > Agreed. I've opened https://issues.apache.org/jira/browse/ARROW-7302 to > track it. > > Regards > > Antoine. > > > Le 03/12/2019 à 04:55, Wes McKinney a écrit : > > An

Re: [C++] CSV string column category to dictionary/indices?

2019-12-03 Thread Antoine Pitrou
Agreed. I've opened https://issues.apache.org/jira/browse/ARROW-7302 to track it. Regards Antoine. Le 03/12/2019 à 04:55, Wes McKinney a écrit : > An option was recently added to dictionary encode all string columns > >

Re: [C++] CSV string column category to dictionary/indices?

2019-12-02 Thread Wes McKinney
An option was recently added to dictionary encode all string columns https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/options.h#L82 I think it would be useful to be able to hard-opt-in to dictionary-encode a particular column (regardless of the what cardinality ends up being).