Recent research from UC Berkeley reveals that while individual AI models may be deemed safe, their combined use can lead to significant security threats. This study emphasizes that adversaries can exploit the combination of different AI systems using a strategy called task decomposition. This technique involves breaking down a malicious activity into smaller, manageable tasks and assigning them to various AI models based on their capabilities and safety measures. The research demonstrated that such combinations have a significantly higher success rate in producing harmful outputs than individual models alone. For instance, models like Llama 2 70B and Claude 3 Opus, when combined, had a success rate of 43% in generating malicious code, compared to a maximum of 3% when used independently. This finding underscores the escalating risk as AI models improve, highlighting the need for continuous vigilance and red-teaming to mitigate potential misuse throughout the AI lifecycle. The study concludes with a call for persistent scrutiny and experimentation with AI model configurations to identify and address emerging threats.

Source.

TOP STORIES

Bollywood Stars Battle AI-Driven Identity Theft in India
Indian celebrities are taking legal action against AI-driven identity theft, shaping how personality rights are protected online …
The Legal Battle Between Media and AI - Who Owns the Content?
The legal landscape offers little protection for content creators against unauthorized scraping by AI companies …
OpenAI Considers Legal Action Against Apple Over Frustrating Partnership
OpenAI is exploring legal action against Apple due to unmet expectations from their partnership …
AI's New Trusted Contacts - A Safety Net for Mental Health
OpenAI’s trusted contacts feature aims to enhance mental health support in AI interactions …
AI Misjudgments - The Risks of Relying on Technology in Policing
AI misidentifications in policing can lead to wrongful arrests and serious consequences for innocent people …
Canada's Bold Move for Digital Independence at Web Summit
Canada unveils a $300 million AI datacenter initiative, aiming for digital independence …

latest stories