Industry Insights·March 16, 2026

Nvidia's 35x inference number lost its denominator on the way to the headline

Nvidia's Groq 3 LPX claims 35x inference throughput, but the unit is per megawatt, not absolute. The real story is 128GB of on-chip SRAM replacing HBM entirely — a supply chain end-run hiding inside a performance slide.

4 min read

Industry Insights·March 16, 2026

`safe_mode=True` got its first CVEs. The Hugging Face scanner missed them.

Researchers found six zero-day vulnerabilities in ML model loading, including the first CVEs ever assigned to Keras safe_mode. Over 90% of non-security ML practitioners believed safe_mode=True prevented arbitrary code execution. It did not.

3 min read

Industry Insights·March 16, 2026

OpenAI Codex Subagents Turn One Coding Task Into a Coordination Problem

OpenAI shipped subagents in Codex on March 16, 2026, making parallel agent workflows available in both the app and CLI. The real change is not raw speed; it is that one coding task can now be split into delegation, review, and merge discipline.

5 min read

Small Business AI·March 16, 2026

The cheaper tool created the more expensive process

A $20/seat AI writing tool that saves 4 hours of drafting can quietly add 6 hours of review, editing, and rework. The math only works if you price the full loop.

7 min read

Industry Insights·March 16, 2026

Nvidia GTC 2026: The $1 Trillion Demand Signal

Jensen Huang doubled his AI infrastructure demand forecast to $1 trillion through 2027 at GTC 2026. The 60/40 cloud-to-enterprise split and his comments on inference reflection reshape planning assumptions for anyone building on AI.

5 min read

Industry Insights·March 16, 2026

AMD Wants the Next PC Cycle to Be an Agent Computer Cycle

AMD is no longer talking about AI PCs as glorified copilots. Its latest framing points toward 'Agent Computers': local-first machines built to keep autonomous AI workloads running continuously instead of waiting for a prompt.

5 min read

Industry Insights·March 16, 2026

Anthropic Crossed $19 Billion in Annual Revenue. The Enterprise AI Race Just Changed Hands.

Anthropic hit $19B in annual revenue run rate — jumping from $9B to $19B in ten weeks — while its share of U.S. enterprise AI spending surged from 4% to 40% in one year. The company that was an also-ran in enterprise is now the frontrunner.

5 min read

Industry Insights·March 16, 2026

AI Insurance Splits Into Two Camps: Cover Hallucinations or Exclude AI Entirely

AI liability insurance is splitting fast: some insurers now cover hallucinations and malfunctions, while others are writing absolute AI exclusions into legacy policies.

5 min read

Industry Insights·March 15, 2026

Five dimensions mattered more than the chatbot demos

More than 80 vendors applied to NATO’s Maven Smart System industry day, four were selected, and the teams had three weeks to integrate. Add Amazon’s five-dimensional Alexa tuning, Google’s 50-language Chrome push, and Meta’s MTIA roadmap, and the real signal was packaging, not raw model theater.

4 min read

Industry Insights·March 15, 2026

$167 per finished minute: the production cost hiding inside $2 AI video clips

A creative director spent $1,000 on Seedance 2.0 and got six minutes of footage. Per-clip generation ran $2–7, but re-rolls and a broken Continue Video feature pushed the real cost to $167 per finished minute.

4 min read

Industry Insights·March 15, 2026

Everyone noticed GLOMAP in COLMAP 4.0. The useful part was the image library swap.

COLMAP 4.0 shipped with GLOMAP as a first-class global SfM pipeline, but the FreeImage-to-OpenImageIO swap delivers 2.5x faster I/O and breaks pixel-level compatibility in existing pipelines.

5 min read

Industry Insights·March 15, 2026

An 8B Model Ranked #2 on Arena-Hard by Inventing Fake Policies. Benchmarks Did Not Catch It.

A Llama 3.1 8B model ranked #2 on Arena-Hard by refusing harmless prompts and fabricating platform policies — then scoring itself highly. The AI judge fell for it every time. Here's what happened and what to test for.

4 min read

Blog

All Articles

Nvidia's 35x inference number lost its denominator on the way to the headline

`safe_mode=True` got its first CVEs. The Hugging Face scanner missed them.

OpenAI Codex Subagents Turn One Coding Task Into a Coordination Problem

The cheaper tool created the more expensive process

Nvidia GTC 2026: The $1 Trillion Demand Signal

AMD Wants the Next PC Cycle to Be an Agent Computer Cycle

Anthropic Crossed $19 Billion in Annual Revenue. The Enterprise AI Race Just Changed Hands.

AI Insurance Splits Into Two Camps: Cover Hallucinations or Exclude AI Entirely

Five dimensions mattered more than the chatbot demos

$167 per finished minute: the production cost hiding inside $2 AI video clips

Everyone noticed GLOMAP in COLMAP 4.0. The useful part was the image library swap.

An 8B Model Ranked #2 on Arena-Hard by Inventing Fake Policies. Benchmarks Did Not Catch It.

Explore by Topic

AI Development

Announcements

Industry Insights

Machine Learning

Small Business AI

Technical Tutorials

Blog

All Articles

Nvidia's 35x inference number lost its denominator on the way to the headline

`safe_mode=True` got its first CVEs. The Hugging Face scanner missed them.

OpenAI Codex Subagents Turn One Coding Task Into a Coordination Problem

The cheaper tool created the more expensive process

Nvidia GTC 2026: The $1 Trillion Demand Signal

AMD Wants the Next PC Cycle to Be an Agent Computer Cycle

Anthropic Crossed $19 Billion in Annual Revenue. The Enterprise AI Race Just Changed Hands.

AI Insurance Splits Into Two Camps: Cover Hallucinations or Exclude AI Entirely

Five dimensions mattered more than the chatbot demos

$167 per finished minute: the production cost hiding inside $2 AI video clips

Everyone noticed GLOMAP in COLMAP 4.0. The useful part was the image library swap.

An 8B Model Ranked #2 on Arena-Hard by Inventing Fake Policies. Benchmarks Did Not Catch It.

Explore by Topic

AI Development

Announcements

Industry Insights

Machine Learning

Small Business AI

Technical Tutorials