Case Study: Creative Math - Faking the Proof

This case study exposes how Gemini 2.5 Pro fails at mathematical reasoning by fabricating intermediate steps to justify incorrect conclusions, revealing crit...

Level: advanced

By Unknown

Category: research