

ArXiv has moved to tighten its rules on careless use of large language models in research papers. The open-access preprint platform said authors must check all content before submitting work, even when they use AI tools during writing or research.
The policy does not block researchers from using large language models. However, it makes authors fully responsible for errors, fake citations, copied text, or unchecked AI output. The move comes as research platforms face rising pressure to control low-quality papers linked to automated writing tools.
ArXiv serves as a major preprint repository for fields such as computer science, mathematics, physics, and related research areas. Researchers use the platform to share papers before formal peer review, which makes its moderation process important for early research circulation.
Thomas Dietterich, chair of arXiv’s computer science section, said submissions with clear signs of unchecked large language model output create trust concerns. He wrote that if a paper contains 'incontrovertible evidence' that authors failed to review AI-generated results, arXiv cannot trust the paper’s content.
That evidence may include 'hallucinated references' or leftover comments written to or from an AI model. These signs suggest that authors may have copied AI output without proper review. ArXiv said such cases can damage the reliability of research hosted on the platform.
Under the new rule, authors may face a one-year ban if arXiv finds clear proof that they submitted unchecked AI-generated content. After the ban, future submissions from those authors must first receive acceptance from a reputable peer-reviewed venue before appearing on arXiv.
Dietterich described the rule as a 'one-strike' policy. However, arXiv will not apply the penalty through an automatic process. Moderators must first flag the issue, and section chairs must then confirm the evidence before any ban takes effect.
Authors will also have the right to appeal the decision. This process gives researchers a way to challenge penalties if they believe arXiv made an error. Still, the policy sends a direct message that researchers cannot shift responsibility to AI tools.
ArXiv has not banned large language models from the research process. Researchers may still use AI tools to edit text, improve structure, or support their work. However, they must check every claim, reference, and result before submission.
Dietterich said authors carry 'full responsibility' for paper content, 'irrespective of how the contents are generated.' That means researchers remain accountable for plagiarized text, biased content, errors, incorrect references, or misleading claims if they submit them.
This rule places responsibility on authors rather than on the tools. Therefore, researchers who use AI must treat the output as draft material that needs careful checking. They cannot submit AI-generated material without review and expect arXiv to treat it as reliable.
ArXiv has already taken steps to control low-quality submissions. One measure requires first-time submitters to receive an endorsement from an established author before posting papers. The system aims to reduce spam and weak submissions before they enter public circulation.
The repository has also started a transition into an independent nonprofit after more than two decades under Cornell University. This change may help arXiv raise more funding for moderation, technology upgrades, and quality controls.
Recent peer-reviewed research has also reported more fabricated citations in biomedical papers, with large language models seen as a likely factor. As AI tools become more common, research platforms now face growing pressure to protect the accuracy of scientific records.
Also Read: OnePlus Halts OxygenOS 16 Rollout: What Went Wrong