2bethere commented on pull request #9449:
URL: https://github.com/apache/druid/pull/9449#issuecomment-625975288


   Thanks for the contribution, I have a few questions.
   
   1. If the SQL table has a timestamp like column, is there a way for me to 
specify this as a parameter so that not the entire table gets pulled?
   2. Is there a way for me to specify which column to split this on? Because 
the user might already know how the table is sharded/partitioned to make it 
more efficient in parallel ingestion
   3. If incremental loads are supported, how are duplicates handled? Do I 
specify a key or this is handled downstream?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to