mkaufmaner opened a new issue, #6344:
URL: https://github.com/apache/incubator-devlake/issues/6344

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-devlake/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### What happened
   
   The GitHub plugin pipeline task for a GitHub repository with;
   * ~ 125,000 commits
   * ~ 40,000 pull requests (open & closed)
   
   Simplified Task JSON;
   ```json
   {
       "id": 125907,
       "createdAt": "2023-10-25T19:15:17.77Z",
       "updatedAt": "2023-10-25T19:19:15.662Z",
       "plugin": "github",
       "subtasks": [
           "collectApiPullRequests",
           "extractApiPullRequests",
           "collectApiComments",
           "extractApiComments",
           "collectApiPullRequestCommits",
           "extractApiPullRequestCommits",
           "collectApiPullRequestReviews",
           "extractApiPullRequestReviews",
           "collectApiPrReviewCommentsMeta",
           "extractApiPrReviewComments",
           "collectAccounts",
           "extractAccounts",
           "collectAccountOrg",
           "ExtractAccountOrg",
           "enrichPullRequestIssues",
           "convertRepo",
           "convertPullRequestCommits",
           "convertPullRequests",
           "convertPullRequestReviews",
           "convertPullRequestLabels",
           "convertPullRequestIssues",
           "convertPullRequestComments",
           "convertAccounts"
       ],
       "options": 
"{\"connectionId\":5,\"githubId\":509,\"name\":\"xxxx/xxxx-xxxx-xxx-xxxxxx-xxx\",\"timeAfter\":\"2023-04-24T00:00:00-04:00\"}",
       "status": "TASK_FAILED",
       "message": "subtask collectApiPullRequestCommits ended 
unexpectedly\nWraps: (2)\n  | combined messages: \n  | {\n  | \terror inserting 
raw rows into _raw_github_api_pull_request_commits (500)\n  | 
\t=====================\n [...] \terror inserting raw rows into 
_raw_github_api_pull_request_commits (500)\n  | }\nError types: (1) 
*hintdetail.withDetail (2) *errors.errorString",
       "errorName": "subtask collectApiPullRequestCommits ended 
unexpectedly\ncaused by: error inserting raw rows into 
_raw_github_api_pull_request_commits (500), [...] error inserting raw rows into 
_raw_github_api_pull_request_commits (500)"
       "progress": 0.17391305,
       "progressDetail": null,
       "failedSubTask": "collectApiPullRequestCommits",
       "pipelineId": 57,
       "pipelineRow": 6,
       "pipelineCol": 14,
       "beganAt": "2023-10-25T19:15:18.344Z",
       "finishedAt": "2023-10-25T19:19:15.655Z",
       "spentSeconds": 237
   }
   ```
   
   Full Task JSON (1.2MB):
   
[devlake-github-500.json](https://github.com/apache/incubator-devlake/files/13176855/devlake-github-500.json)
   
   Screenshot:
   
![devlake-github-500-errors](https://github.com/apache/incubator-devlake/assets/2073135/e33a8483-264d-44a5-bf0e-a1784215346a)
   
   
   ### What do you expect to happen
   
   I expected this task to be successful.
   
   ### How to reproduce
   
   This is a good question, I am guessing with a repository with a significant 
amount of PRs and commits.
   
   ### Anything else
   
   Possibly related to https://github.com/apache/incubator-devlake/issues/6320
   
   ### Version
   
   v0.18.0
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to