The landscape of generative AI is undergoing significant changes as data owners, artists, and record labels push back against the unregulated use of their content for AI training. Major AI companies have historically scraped data from the internet with little regard for copyright, leading to a series of legal challenges and changing industry practices. Recently, three major record labels—Sony Music, Warner Music Group, and Universal Music Group—filed lawsuits against AI music companies Suno and Udio for allegedly using copyrighted music in their training data. These legal battles highlight a critical shift in the generative AI sector, where the scarcity of high-quality data gives data owners newfound leverage.
AI companies are now exploring licensing deals and partnerships to legally obtain the data they need. OpenAI has made agreements with several news publishers, while YouTube is offering licensing deals to top record labels. However, these arrangements raise concerns about the accuracy of AI-generated content and the potential concentration of power among the wealthiest companies. Despite these challenges, the introduction of consent and fair use into the AI training process could lead to a more equitable data economy, benefiting rights holders and the general public alike.











