xgdyp commented on issue #953: URL: https://github.com/apache/incubator-devlake/issues/953#issuecomment-1144533817
I see. So the main steps: 1. extract the data using gitextractor 2. for each file of git commits, process the data using some functions to get the fields of the `commits_files` table 3. meanwhile, calculate the filed `file_components` by parsing the `file_path` field. (in lake, what we need to do is parse its first-level-directory , for some other projects we need to define the rule of components manually) 4. store the results into `commits_files` table which add a new column `components` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
