Scientific Production in the Era of Large Language Models: Early Evidence from Large-scale Preprint Data
The rapid adoption of AI across disciplines is reshaping the landscape of scientific production. While both enthusiasm and concern about generative AI in research are rising, systematic empirical evidence on the impact of large language models (LLMs) remains limited. In this talk, I draw on several large-scale analyses to examine how LLM use affects the productivity of individual scientists, reshapes attention to prior work, introduces hallucinated content into the scientific record, and creates new challenges for peer review. Taken together, these findings provide macro-level evidence on the impact of generative AI on science, highlighting the need for institutions, journals, funding agencies, and the broader public to rethink how scientific work should be evaluated in this new era.
Reading List
Scientific Production in the Era of Large Language Models: Early Evidence from Large-scale Preprint Data
The rapid adoption of AI across disciplines is reshaping the landscape of scientific production. While both enthusiasm and concern about generative AI in research are rising, systematic empirical evidence on the impact of large language models (LLMs) remains limited. In this talk, I draw on several large-scale analyses to examine how LLM use affects the productivity of individual scientists, reshapes attention to prior work, introduces hallucinated content into the scientific record, and creates new challenges for peer review. Taken together, these findings provide macro-level evidence on the impact of generative AI on science, highlighting the need for institutions, journals, funding agencies, and the broader public to rethink how scientific work should be evaluated in this new era.
Reading List