Language models: flawlessly reasoning or confidently hallucinating?

Phone