ArXiv implements a one-year prohibition for researchers who submit papers containing unverified AI-generated material.

ArXiv implements a one-year prohibition for researchers who submit papers containing unverified AI-generated material.

      TL;DR: ArXiv will impose a one-year ban on researchers who submit papers showing clear signs of unchecked AI generation, such as incorrect references or leftover chatbot instructions. This policy, announced by computer science section chair Thomas Dietterich, marks the first formal consequence from a major preprint platform for poorly generated AI content.

      ArXiv, the open-access repository that has been the main distribution channel for preprint research in computer science, mathematics, and physics for over thirty years, will now ban authors for a year if they submit papers with evident signs of unchecked AI generation. On Thursday, Thomas Dietterich, chair of arXiv’s computer science section, revealed that submissions containing “incontrovertible evidence” of unvetted output from large language models indicate that “we can’t trust anything in the paper.”

      This rule is not a total ban on AI tool usage. Researchers are still allowed to use language models for drafting, editing, or analysis. The penalty is triggered by evidence that an author copied LLM output into a paper without verifying it, a type of negligence that leads to fabricated references, instructions from the chatbot lingering in the text, or false data tables with notes saying “fill in with the real numbers from your experiments.” If moderators find such evidence and a section chair corroborates it, the author will face a one-year suspension from arXiv, after which all future submissions must first be approved by a peer-reviewed journal before being posted on the platform.

      Why this is important:

      ArXiv functions differently from a journal and does not conduct peer reviews. However, it has become the primary way research is disseminated in several rapidly advancing fields like machine learning and artificial intelligence. Papers shared on arXiv are read, cited, and built upon long before they are published formally, if they are published at all. This makes the platform’s quality standards especially significant: a fabricated citation on arXiv can spread through the research literature as effectively—and often more quickly—than one in a peer-reviewed journal.

      The extent of the issue is considerable. A study published in The Lancet in May 2026 by Columbia University researchers audited 2.5 million biomedical papers and 126 million references indexed on PubMed Central. It found that the occurrence of fabricated citations had increased twelvefold since 2023. In that year, roughly one in 2,828 papers contained at least one fake reference; by 2025, it was one in 458; and in the initial seven weeks of 2026, it rose to one in 277. The researchers attributed this surge to the increased use of AI writing tools, noting that previous studies estimate that 30 to 69 percent of LLM-generated references in biomedical contexts are false.

      ArXiv has solid reasons to view this threat seriously. The platform receives thousands of submissions each month, and its volunteer moderation system was not originally designed to handle large-scale screening for machine-generated content. Dietterich’s announcement describes the new penalty as a “one-strike” rule, though decisions can be appealed and require a section chair's confirmation before being enforced.

      What constitutes evidence:

      The policy is purposely focused on specific issues. Dietterich identified clear examples of “incontrovertible evidence”: fabricated references that do not correspond to any actual publication, comments from the language model left in the text (such as “here is a 200-word summary; would you like me to make any changes?”), and placeholder data with instructions to the author that were never deleted. These are not subtle quality issues; they demonstrate that the author did not read the paper before submitting it.

      This distinction is significant as it avoids the more complex question of whether AI-assisted writing should be allowed at all. ArXiv's current policy already states that authors hold “full responsibility” for their content “regardless of how it is generated.” The new penalty reinforces this principle by targeting the most blatant violations, where the author's lack of oversight can be proven from the text itself.

      This approach has practical benefits. It is currently unreliable to determine if a well-prepared paper was created with the help of an LLM using existing detection tools, and enforcing a broader ban would be technically challenging and could unjustly penalize researchers using AI tools responsibly. By concentrating on clear errors, arXiv can implement the rule without needing to develop or acquire an AI-detection system, a technology that still has its own limitations.

      A wider problem:

      ArXiv is not the only organization grappling with these issues. Academic conferences in computer science, such as NeurIPS and ICML, have reported increases in submissions that seem to be generated with minimal human oversight. Nature published an article in late 2025 detailing how AI-generated content is causing a crisis in computer science, where the influx of low-quality submissions is overwhelming reviewers and muddling the quality of the field’s output.

      Peer-reviewed journals are facing a similar dilemma. The Lancet study

Other articles

Asus packed an RTX 5080 into a 3-liter enclosure. Priced at $4,400, it offers a performance increase of 2.3%. Asus packed an RTX 5080 into a 3-liter enclosure. Priced at $4,400, it offers a performance increase of 2.3%. The ROG NUC 16 combines Intel's Core Ultra 9 290HX with an RTX 5080 laptop GPU in a case that is smaller than a PS5. The worldwide price is anticipated to be around $4,000. Asus packed an RTX 5080 into a 3-litre chassis. It is priced at $4,400, and the improvement in performance is 2.3%. Asus packed an RTX 5080 into a 3-litre chassis. It is priced at $4,400, and the improvement in performance is 2.3%. The ROG NUC 16 combines Intel's Core Ultra 9 290HX with an RTX 5080 laptop GPU in a case that is smaller than a PS5. The anticipated global price is around $4,000. Dr. Jeffrey L. Brown discusses the need to reevaluate hormone health by utilizing advanced testing methods. Dr. Jeffrey L. Brown discusses the need to reevaluate hormone health by utilizing advanced testing methods. Dr. Jeffrey L. Brown utilizes saliva, stool, and food sensitivity testing to identify hormonal imbalances that typical blood tests overlook, providing tailored care through a primarily virtual approach. GM, Ford, and Stellantis have reduced their workforce by 20,000 white-collar positions. The trend is set to intensify with the rise of AI. GM, Ford, and Stellantis have reduced their workforce by 20,000 white-collar positions. The trend is set to intensify with the rise of AI. Since 2022, Detroit's Big Three have reduced their total salaried workforce by 19%. GM recently laid off an additional 500 IT employees while simultaneously recruiting for 250 positions in AI. Destinus is seeking to raise €200 million prior to its initial public offering (IPO). The manufacturer of cruise missiles aims for a valuation of €5 billion. Destinus is seeking to raise €200 million prior to its initial public offering (IPO). The manufacturer of cruise missiles aims for a valuation of €5 billion. The Dutch defense startup produces drones and cruise missiles for Ukraine and its European allies. It has already formed a joint venture with Rheinmetall. Destinus is seeking to raise €200 million in preparation for an IPO. The manufacturer of cruise missiles aims for a valuation of €5 billion. Destinus is seeking to raise €200 million in preparation for an IPO. The manufacturer of cruise missiles aims for a valuation of €5 billion. The Dutch defense startup produces drones and cruise missiles for Ukraine and its European partners. It has already established a joint venture with Rheinmetall.

ArXiv implements a one-year prohibition for researchers who submit papers containing unverified AI-generated material.

The preprint platform will impose penalties on authors whose manuscripts include fictional references or remnants of LLM directives. A study published in The Lancet revealed that instances of fabricated citations in biomedical research increased twelve times since 2023.