rahil-c opened a new issue, #17622:
URL: https://github.com/apache/hudi/issues/17622

   ### Task Description
   
   Goal: We should be able to construct a writer that takes in Spark's 
InternalRow and outputs Lance files.
   Interface:
   
   Exit criteria: We should be able to take an iterator of rows and write those 
out to a Lance file. The file should then be readable and return back the same 
data that was written.
   
   The Lance writer takes in data as Arrow formatted columns (vectors). We 
should create a writer that takes in all of the Lance specific configurations 
and then a converter of type T to Arrow. This allows us to easily make a writer 
compatible with other engines in the future but for now we only need an 
implementation for InternalRow to Arrow.
   
   ### Task Type
   
   Code improvement/refactoring
   
   ### Related Issues
   
   **Parent feature issue:** (if applicable )
   **Related issues:**
   NOTE: Use `Relationships` button to add parent/blocking issues after issue 
is created.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to