Proposal:

DIL (LinkedIn internal project name) is a generic multi-stage Gobblin connector 
library. The code can be accessed here:  
https://github.com/linkedin/gobblin-connectors. Its core features and high 
level descriptions are shared here: 
https://engineering.linkedin.com/blog/2021/data-integration-library.

Per initial discussion with members of Gobblin community, we are here proposing 
a separate sub-repo for this library.

Why:
            Some thoughts/justifications of a sub-repo vs. a module in the main 
Gobblin repo.


  1.  Gobblin connectors are important part of Gobblin ecosystem, but the 
development of connectors is relatively independent of Gobblin core.
  2.  Gobblin connector is where open source communities can contribute the 
most, and it will be growing much faster than Gobblin core.
  3.  The new connector library is a comprehensive package of unique design 
patterns. This is where the data integration diversity challenge will be 
addressed. The importance of this code base grows by day as more integration 
scenarios are becoming supported.
  4.  The new connector library evolves and replaces many prior Gobblin 
connectors under the “gobblin-modules” module. A separate repo will help avoid 
confusion.
  5.  Separating core and ecosystem modules can help improve isolation and 
reduce the number of defects.

Regards,
Chris




Reply via email to