Bullshitbench
Extracting target signal
Bullshitbench
Arcmira media summary
Browse BullshitBench reviews, demos & launch coverage — 1 indexed from AI Engineer, updated Apr 2026.
A benchmark created by Peter Gostev to test if LLMs push back against nonsense questions.
Arcmira tracks 1 indexed media appearances or mentions for BullshitBench, tied to source videos, channels, and transcript-derived context.
Arcmira uses indexed YouTube videos and transcripts. Representative source evidence on this page includes "What Do Models Still Suck At? - Peter Gostev, Arena.ai, BullshitBench" with transcript-derived context and links when available.
BullshitBench is connected to Model Reasoning, LLM benchmarking in Arcmira's media graph.
1
Mentions
446
Views

“A benchmark created by Peter Gostev to test if LLMs push back against nonsense questions.”