2bethere commented on pull request #9449: URL: https://github.com/apache/druid/pull/9449#issuecomment-625975288
Thanks for the contribution, I have a few questions. 1. If the SQL table has a timestamp like column, is there a way for me to specify this as a parameter so that not the entire table gets pulled? 2. Is there a way for me to specify which column to split this on? Because the user might already know how the table is sharded/partitioned to make it more efficient in parallel ingestion 3. If incremental loads are supported, how are duplicates handled? Do I specify a key or this is handled downstream? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
