LLMs’ “Bullsh*t” problem, DARPA, and testing for nonsense

Defence research agency looks to challenge the "fundamental gaps between state-of-the-art AI systems and national security applications"

Catherine Sarisky

Jun 18, 2024 - 4 min read

The tendency of large language models (LLMs) to “hallucinate” continues to trouble CIOs eyeing production use-cases – even as efforts around fine-tuning and retrieval augmented generation-based optimisations continue.

This post is for subscribers only

Subscribe now and have access to all our stories, enjoy exclusive content and stay up to date with constant updates.

Subscribe now

Already a member? Sign in

Success! You now have access to additional content.