Hello, I would like to contribute pandas support in the python API.
There is a jira ticket <https://issues.apache.org/jira/browse/KUDU-1276> regarding this however the level is quite technical and beyond my current abilities. I would like to get consensus if you are open to simpler solutions in the interim. To give you an idea, I was looking at doing something along the lines of: import pandas as pd scanner = table.scanner() scanner.open() data = scanner.read_all_tuples() pd.DataFrame(data, columns=table.schema.names).set_index(table.schema.primary_keys()) Please let me know if such solutions are welcome. Kind Regards, Greg
