michelengelen opened a new issue, #8817:
URL: https://github.com/apache/incubator-devlake/issues/8817

   ### Search before asking
   
   - [x] I had searched in the 
[issues](https://github.com/apache/incubator-devlake/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### What happened
   
   The gitextractor started to fail for one of our repos recently.
   It seems the reason is that it tries to collect data for a PR that does not 
exist (anymore) and after 3 retries exits with a failure message:
   
   ```
   time="2026-03-29 12:33:22" level=error msg=" [pipeline service] [pipeline 
#828] [task #16537] attached stack trace\n\t  -- stack trace:\n\t  | 
github.com/apache/incubator-devlake/core/runner.RunPluginSubTasks\n\t  | 
\t/app/core/runner/run_task.go:333\n\t  | [...repeated from below...]\n\tWraps: 
(2) subtask Collect Pull Requests ended unexpectedly\n\tWraps: (3) attached 
stack trace\n\t  -- stack trace:\n\t  | 
github.com/apache/incubator-devlake/helpers/pluginhelper/api.(*WorkerScheduler).WaitAsync\n\t
  | \t/app/helpers/pluginhelper/api/worker_scheduler.go:173\n\t  | 
github.com/apache/incubator-devlake/helpers/pluginhelper/api.(*ApiCollector).Execute\n\t
  | \t/app/helpers/pluginhelper/api/api_collector.go:206\n\t  | 
github.com/apache/incubator-devlake/helpers/pluginhelper/api.(*StatefulApiCollector).Execute\n\t
  | \t/app/helpers/pluginhelper/api/api_collector_stateful.go:97\n\t  | 
github.com/apache/incubator-devlake/plugins/github/tasks.CollectApiPullRequests\n\t
  | \t/app/plugins/g
 ithub/tasks/pr_collector.go:140\n\t  | 
github.com/apache/incubator-devlake/core/runner.runSubtask\n\t  | 
\t/app/core/runner/run_task.go:425\n\t  | 
github.com/apache/incubator-devlake/core/runner.RunPluginSubTasks\n\t  | 
\t/app/core/runner/run_task.go:330\n\t  | 
github.com/apache/incubator-devlake/core/runner.RunPluginTask\n\t  | 
\t/app/core/runner/run_task.go:165\n\t  | 
github.com/apache/incubator-devlake/core/runner.RunTask\n\t  | 
\t/app/core/runner/run_task.go:139\n\t  | 
github.com/apache/incubator-devlake/server/services.runTaskStandalone\n\t  | 
\t/app/server/services/task_runner.go:114\n\t  | 
github.com/apache/incubator-devlake/server/services.RunTasksStandalone.func1\n\t
  | \t/app/server/services/task.go:187\n\t  | runtime.goexit\n\t  | 
\t/usr/local/go/src/runtime/asm_amd64.s:1598\n\tWraps: (4) Retry exceeded 3 
times calling repos/mui/material-ui/pulls/47788. The last error was: Http 
DoAsync error calling [method:GET path:repos/mui/material-ui/pulls/47788 
query:map[]]. Response
 : {\"message\":\"Not 
Found\",\"documentation_url\":\"https://docs.github.com/rest/pulls/pulls#get-a-pull-request\",\"status\":\"404\"}
 (404)\n\tError types: (1) *withstack.withStack (2) *errutil.withPrefix (3) 
*withstack.withStack (4) *errutil.leafError"
   ```
   
   I don't know how it even gets the id for this PR, since the list call I 
tried does not even return it.
   
   `gh pr list --limit 1000 --json number,title,state,author,createdAt` does 
not include the PR in question.
   
   `gh pr view 47788 --json 
number,title,state,author,body,createdAt,mergedAt,files` returns: `GraphQL: 
Could not resolve to a PullRequest with the number of 47788. 
(repository.pullRequest)`
   
   ### What do you expect to happen
   
   The extractor should finish and skip `404` status silently (maybe with a 
warning), but not fail the collection altogether
   
   ### How to reproduce
   
   ---
   
   ### Anything else
   
   every time
   
   ### Version
   
   v1.0.3-beta10
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [x] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to