AI Scanner — 2026-04-13

market signal @bridgemindai

7/10

Claude Opus 4.5 Outperforms 4.6 on Hallucination Benchmark

Benchmark results indicate that Claude Opus 4.5 is outperforming its successor, 4.6, in terms of hallucination rates. This raises questions about the effectiveness of the latest model and could influence future development decisions.

Claude Opus 4.5 is now OUTPERFORMING Claude Opus 4.6 on BridgeBench Hallucination. Read that again. The legacy model is beating the current flagship. We benchmarked Opus 4.5 this morning to confirm what we saw yesterday. Claude Opus 4.6 fell from #2 to #10 with a 98%

👁 36,211 views ❤ 599 🔁 69 💬 58 🔖 84 2.0% eng

AIbenchmarkingClaude Opusmodel performancehallucination

market signal @bridgebench

7/10

Grok 4.20 Tops BridgeBench Reasoning Benchmark

Grok 4.20 has achieved the top position on the BridgeBench Reasoning benchmark, outperforming GPT 5.4 and Claude Opus 4.6. This indicates a significant advancement in reasoning capabilities, which may influence future AI model development.

Grok 4.20 Reasoning just took #1 on the new BridgeBench Reasoning benchmark. Beating GPT 5.4 and Claude Opus 4.6. This model keeps climbing every single week. Hallucination #1. Now Reasoning #1. While Anthropic is throwing 500 errors, xAI is quietly building the most

👁 7,231 views ❤ 79 🔁 3 💬 21 🔖 8 1.4% eng

GrokbenchmarkAI reasoningxAImodel performance

market signal @teslaownersSV

7/10

Grok 4.20 Tops BridgeBench Rankings

Grok 4.20 has achieved the top ranking on BridgeBench, surpassing other models like GPT-5.4 and Claude Opus 4.6. This benchmark may indicate a shift in competitive performance among AI models, which could influence future development decisions.

Grok 4.20 takes the #1 spot on BridgeBench Outperforming GPT-5.4, Claude Opus 4.6, and Gemini. It just keeps climbing

👁 3,158 views ❤ 45 🔁 9 💬 6 🔖 0 1.9% eng

GrokBridgeBenchAI modelsbenchmarkingperformance

AI Twitter Scanner