Gentoo Linux, a highly customizable and community-focused distribution, is taking a significant step to protect its code from being used to train artificial intelligence models. The project has begun migrating its code repository from GitHub to Codeberg, a non-profit, privacy-focused platform based on the Forgejo software platform. The move is a direct response to concerns about GitHub’s Copilot and other AI tools scanning and learning from publicly available code.
The decision, detailed in a blog post published on February 16, 2026, marks a turning point for the distribution. Gentoo developers expressed discomfort with the idea of their work being used to power large language models (LLMs) without their explicit consent. “People have differing opinions as to how much they’re okay with an LLM taking a peek at their projects, and I think it’s for the best of all humanity if I keep my own code away from the eyes of an AI,” the post states, reflecting a growing sentiment within the open-source community.
GitHub’s Copilot, an AI pair programmer, has sparked debate since its release. While proponents tout its ability to accelerate development, critics raise concerns about copyright, licensing, and the potential for AI-generated code to introduce vulnerabilities. The core issue for Gentoo appears to be the lack of control over how their code is used once it’s ingested by these AI systems.
Codeberg offers a different approach. As a non-profit organization, it prioritizes user privacy and control. Being “based on Forgejo” is significant; Forgejo is a community-driven fork of Gitea, another self-hosted Git service. So Codeberg isn’t tied to a commercial entity with potentially conflicting interests. The platform’s stated commitment to avoiding AI scraping is a key factor in Gentoo’s decision.
The migration is being rolled out in phases. Currently, contributions are being accepted on Codeberg at https://codeberg.org/gentoo/gentoo, serving as a mirror to the existing GitHub repository. The Gentoo team anticipates that other Git repositories will follow suit, gradually shifting the entire project away from GitHub. For the time being, GitHub will remain active to ensure a smooth transition for developers.
This move isn’t simply about avoiding AI training; it’s also about aligning with Gentoo’s core values. Gentoo has always been a distribution that emphasizes user freedom and control. The source-based nature of the distribution – where users compile software from source code – already requires a high degree of technical expertise and customization. Choosing a platform like Codeberg reinforces that commitment to user agency.
The Gentoo team acknowledged the ongoing debate surrounding AI and code in their announcement. They recognize that We find varying perspectives on the appropriate level of AI access to open-source projects. However, they’ve made a clear decision to prioritize the privacy and control of their developers and contributors.
This decision by Gentoo is part of a broader trend within the open-source community. Developers are increasingly scrutinizing the terms of service and privacy policies of platforms like GitHub, and exploring alternatives that offer greater control over their code. A recent post on the Gentoo forums explicitly warned AI scrapers to stay away from Codeberg, stating, “If you are an AI scraper, and wish to not receive garbage when visiting Codeberg: stop visiting.”
The implications of this move extend beyond Gentoo itself. It sets a precedent for other open-source projects that may be considering similar steps to protect their intellectual property and maintain control over how their code is used. It also highlights the growing tension between the benefits of AI-powered tools and the need to safeguard the rights and interests of developers.
While the migration is underway, it’s worth noting that other developments are happening in the Linux world. GOG, a digital distribution platform for games, is increasing its support for Linux gaming, demonstrating a growing interest in the operating system within the gaming industry. This suggests a broader trend of increased investment and attention towards the Linux ecosystem.
Gentoo’s move to Codeberg is a deliberate and principled decision. It’s a statement about the importance of user control, privacy, and the ethical considerations surrounding the use of open-source code in the age of artificial intelligence. As more projects grapple with these issues, Gentoo’s experience may serve as a valuable case study for navigating the complex landscape of AI and open-source development.
