The alarming prediction that artificial intelligence companies may exhaust publicly available data for their systems in less than eight years has sparked a heated debate in the tech industry. A recent study by the nonprofit research group Epoch AI suggests that the current rate of development will deplete the available supply of human writing, leading to a “gold rush” scenario where companies scramble to secure high-quality data. This could lead to companies seeking out private online data, such as emails and phone communications, or relying on AI-created data, which raises concerns about the quality and permission of the data used. The researchers warn that this “bottleneck” in development could hinder the advancement of AI models, a process known as “scaling up.” While some experts argue that training AI systems for specialized tasks could be a solution, others are concerned about the potential consequences of relying on AI-produced writing. The issue has sparked a heated debate about permission, quality, and the future of AI development.

AI Data Drought Looms
Technology companies will exhaust the supply of publicly available training data for AI language models between 2026 and 2032.
1–2 minutes










