ArXiv implements a one-year prohibition for researchers who submit papers containing unverified AI-generated material.
TL;DR: ArXiv will impose a one-year ban on researchers who submit papers showing clear signs of unchecked AI generation, such as incorrect references or leftover chatbot instructions. This policy, announced by computer science section chair Thomas Dietterich, marks the first formal consequence from a major preprint platform for poorly generated AI content.
ArXiv, the open-access repository that has been the main distribution channel for preprint research in computer science, mathematics, and physics for over thirty years, will now ban authors for a year if they submit papers with evident signs of unchecked AI generation. On Thursday, Thomas Dietterich, chair of arXiv’s computer science section, revealed that submissions containing “incontrovertible evidence” of unvetted output from large language models indicate that “we can’t trust anything in the paper.”
This rule is not a total ban on AI tool usage. Researchers are still allowed to use language models for drafting, editing, or analysis. The penalty is triggered by evidence that an author copied LLM output into a paper without verifying it, a type of negligence that leads to fabricated references, instructions from the chatbot lingering in the text, or false data tables with notes saying “fill in with the real numbers from your experiments.” If moderators find such evidence and a section chair corroborates it, the author will face a one-year suspension from arXiv, after which all future submissions must first be approved by a peer-reviewed journal before being posted on the platform.
Why this is important:
ArXiv functions differently from a journal and does not conduct peer reviews. However, it has become the primary way research is disseminated in several rapidly advancing fields like machine learning and artificial intelligence. Papers shared on arXiv are read, cited, and built upon long before they are published formally, if they are published at all. This makes the platform’s quality standards especially significant: a fabricated citation on arXiv can spread through the research literature as effectively—and often more quickly—than one in a peer-reviewed journal.
The extent of the issue is considerable. A study published in The Lancet in May 2026 by Columbia University researchers audited 2.5 million biomedical papers and 126 million references indexed on PubMed Central. It found that the occurrence of fabricated citations had increased twelvefold since 2023. In that year, roughly one in 2,828 papers contained at least one fake reference; by 2025, it was one in 458; and in the initial seven weeks of 2026, it rose to one in 277. The researchers attributed this surge to the increased use of AI writing tools, noting that previous studies estimate that 30 to 69 percent of LLM-generated references in biomedical contexts are false.
ArXiv has solid reasons to view this threat seriously. The platform receives thousands of submissions each month, and its volunteer moderation system was not originally designed to handle large-scale screening for machine-generated content. Dietterich’s announcement describes the new penalty as a “one-strike” rule, though decisions can be appealed and require a section chair's confirmation before being enforced.
What constitutes evidence:
The policy is purposely focused on specific issues. Dietterich identified clear examples of “incontrovertible evidence”: fabricated references that do not correspond to any actual publication, comments from the language model left in the text (such as “here is a 200-word summary; would you like me to make any changes?”), and placeholder data with instructions to the author that were never deleted. These are not subtle quality issues; they demonstrate that the author did not read the paper before submitting it.
This distinction is significant as it avoids the more complex question of whether AI-assisted writing should be allowed at all. ArXiv's current policy already states that authors hold “full responsibility” for their content “regardless of how it is generated.” The new penalty reinforces this principle by targeting the most blatant violations, where the author's lack of oversight can be proven from the text itself.
This approach has practical benefits. It is currently unreliable to determine if a well-prepared paper was created with the help of an LLM using existing detection tools, and enforcing a broader ban would be technically challenging and could unjustly penalize researchers using AI tools responsibly. By concentrating on clear errors, arXiv can implement the rule without needing to develop or acquire an AI-detection system, a technology that still has its own limitations.
A wider problem:
ArXiv is not the only organization grappling with these issues. Academic conferences in computer science, such as NeurIPS and ICML, have reported increases in submissions that seem to be generated with minimal human oversight. Nature published an article in late 2025 detailing how AI-generated content is causing a crisis in computer science, where the influx of low-quality submissions is overwhelming reviewers and muddling the quality of the field’s output.
Peer-reviewed journals are facing a similar dilemma. The Lancet study
Other articles
ArXiv implements a one-year prohibition for researchers who submit papers containing unverified AI-generated material.
The preprint platform will impose penalties on authors whose manuscripts include fictional references or remnants of LLM directives. A study published in The Lancet revealed that instances of fabricated citations in biomedical research increased twelve times since 2023.
