Dears, I would like to get more information from you in order for me to use Arrow and be able to contribute in the near future.
What i see in Arrow that i can read and write Arrow files (from the vector test classes), i did not see tests for sending data over a network. As i understood from the project proposal (correct me if i am wrong.), that i can write Arrow Array from somewhere and read from somewhere else, this means that Arrow would be such a centralised server that hold a state and engines will connect to it to write Arrow Arrays and other engines will read (like in the picture bellow). How far Arrow from having this centralised system, where we are now? I am working on an application which is about moving data while changing the schema in between the source and the destination. Like moving the data from Apache Spark to Apache Flink and in between change the schema. Regards, [cid:76C48A02-6D9A-4B48-9952-F992A981414B] ------------------------------------------------------ Abdulrahman Kaitoua Ph.D. Candidate at Politecnico di Milano Department of Electronics, Information and Bioengineering Piazza Leonardo da Vinci 32 - 20133 Milano, Italy Tel. Lab: +39 02 2399 3631
