dosubot[bot] commented on issue #8529:
URL: 
https://github.com/apache/incubator-devlake/issues/8529#issuecomment-3163442439

   <!-- Greeting -->
   Hi @zia-devsecops! I'm [Dosu](https://go.dosu.dev/dosubot) and I’m helping 
the incubator-devlake team.
   
   <!-- Answer -->
   This is a known limitation in DevLake's GitLab integration: user data 
collection uses offset pagination (with `page` and `per_page` parameters), 
which is subject to GitLab's maximum offset of 5000. When the user count 
exceeds this, data collection fails—keyset pagination is not implemented for 
GitLab users in DevLake, so the error message about switching to keyset 
pagination can't be addressed by configuration or environment variables right 
now. This has been discussed in detail in open issues 
([#8523](https://github.com/apache/incubator-devlake/issues/8523), 
[#8283](https://github.com/apache/incubator-devlake/issues/8283)), and no 
recent changes or merged PRs resolve it.
   
   Workarounds include periodically deduplicating or cleaning up the 
`_raw_gitlab_api_users` table, or limiting how often you collect all users. 
There is a proposed (but not merged) solution to collect users only once per 
pipeline and/or use an environment variable (`GITLAB_SERVER_COLLECT_ALL_USERS`) 
to reduce duplication and table size, but this is not available in 1.0.2-beta5. 
The underlying code for user collection is here and confirms only offset 
pagination is used 
([source](https://github.com/apache/incubator-devlake/blob/2c15ad880dce830c6512fa58039527cbf4745b8b/backend/plugins/gitlab/tasks/account_collector.go)).
 
   
   For now, if your user base is large, you will hit this limit until a new 
pagination strategy is implemented. Keep an eye on the linked issues for 
updates.
   
   
   <!-- Dosu Comment Footer -->
   *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).*
   
   ---
   <sup>How did I do? 
[Good](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=great_response)
 | 
[Irrelevant](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=irrelevant_answer)
 | 
[Incorrect](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=incorrect_sources)
 | 
[Verbose](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=too_verbose)
 | 
[Hallucination](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=hallucination)
 | [Report 
🐛](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=bug_report)
 | 
[Other](https://app.dosu.dev/response-feedback/ddcb4cee-f3be-4abd-8204-14a07a94effd?feedback_type=other)</sup>&nbsp;&nbsp;[![Chat
 with 
Dosu](https://dosu.dev/dosu-chat-badge.svg)](https://app.dosu.dev/b4e8e847-d479-4541-83a8-d88d83fea5c9/ask?utm_source=githu
 b)&nbsp;[![Join 
Discord](https://img.shields.io/badge/join-5865F2?logo=discord&logoColor=white&label=)](https://go.dosu.dev/discord-bot)&nbsp;[![Share
 on 
X](https://img.shields.io/badge/X-share-black)](https://twitter.com/intent/tweet?text=%40dosu_ai%20helped%20me%20solve%20this%20issue!&url=https%3A//github.com/apache/incubator-devlake/issues/8529)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@devlake.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to