NetsanetGeb commented on issue #714: Performance Comparison of 
HoodieDeltaStreamer and DataSourceAPI
URL: https://github.com/apache/incubator-hudi/issues/714#issuecomment-512140516
 
 
   Yes, you can extract  data from [IPUMS USA](https://usa.ipums.org/usa/)  to 
run the workload locally.  I am not allowed to share the files i downloaded 
from there. Hence, You can extract the dataset from their site by specifying 
the column fields that you want in a csv fromat and later change it to JSON for 
using JSON as a source class. 
    Am also glad to do a video call  on time thats convenient for the both of 
us may be on weekends or next week to debug it together.  Thanks,

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to