Reasoning Models

OpenAI’s New Scorecard Measures AI’s Business Value by Completed Work

Jul 20, 2026

•

13 min read

OpenAI’s New Scorecard Measures AI’s Business Value by Completed Work

A low token price can become expensive when employees still have to review, correct, retry, rework, or finish the work AI was meant to complete.

Alicia Shapiro

Reasoning Models

ARC-AGI-3 Benchmark Shows Top AI Models Fail at General Reasoning, Scoring Below 1%

Mar 26, 2026

•

15 min read

ARC-AGI-3 Benchmark Shows Top AI Models Fail at General Reasoning, Scoring Below 1%

A new benchmark is forcing a hard question: are today’s most advanced AI models actually capable of reasoning—or just very good at pattern recognition?

Alicia Shapiro

Reasoning Models

Gemini 3.1 Pro Improves AI Reasoning Across Google’s Ecosystem, Targeting Complex Enterprise and Developer Workflows

Feb 20, 2026

•

8 min read

Gemini 3.1 Pro Improves AI Reasoning Across Google’s Ecosystem, Targeting Complex Enterprise and Developer Workflows

Google’s newest Gemini model focuses on deeper reasoning — not just faster answers — as AI competition shifts toward intelligence quality.

Alicia Shapiro

Reasoning Models

Google and OpenAI Models Outperform Humans at ICPC Coding Finals

Sep 18, 2025

•

11 min read

Google and OpenAI Models Outperform Humans at ICPC Coding Finals

OpenAI’s GPT-5 achieved a perfect score of 12/12, while Google’s Gemini 2.5 Deep Think solved 10/12, ranking second overall at the 2025 ICPC World Finals.

Alicia Shapiro

Reasoning Models

OpenAI’s BrowseComp Tests AI Browsing Skills on Hard-to-Find Questions

Apr 11, 2025

•

8 min read

OpenAI’s BrowseComp Tests AI Browsing Skills on Hard-to-Find Questions

Alicia Shapiro

Reasoning Models

Amazon to Launch ‘Reasoning’ AI Model to Rival OpenAI & Anthropic

Mar 6, 2025

•

4 min read

Amazon to Launch ‘Reasoning’ AI Model to Rival OpenAI & Anthropic

Alicia Shapiro

Reasoning Models

OpenAI’s New Scorecard Measures AI’s Business Value by Completed Work

ARC-AGI-3 Benchmark Shows Top AI Models Fail at General Reasoning, Scoring Below 1%

Gemini 3.1 Pro Improves AI Reasoning Across Google’s Ecosystem, Targeting Complex Enterprise and Developer Workflows

Google and OpenAI Models Outperform Humans at ICPC Coding Finals

OpenAI’s BrowseComp Tests AI Browsing Skills on Hard-to-Find Questions

Amazon to Launch ‘Reasoning’ AI Model to Rival OpenAI & Anthropic

AiNews.com