LIVE — UPDATED EVERY 2 HOURS

Everything happening in AI, tuned for you

Models · Research · Tools · Safety — one feed, daily

41
stories today
5+
sources tracked
41
articles (last 30d)
0
archived
Boston Children’s uses AI to unlock new diagnoses· 2d agoHow Braintrust turns customer requests into code with Codex· 2d agoStrengthening societal resilience with Rosalind Biodefense· 3d agoProfiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler· 3d agoA shared playbook for trustworthy third party evaluations· 3d agoHow Endava builds an agentic organization with Codex· 3d agoOpenAI’s Frontier Governance Framework· 4d agoMUFG aims to become AI-native with OpenAI· 4d agoITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM· 4d agoCisco and OpenAI redefine enterprise engineering with Codex· 4d agoBuilding self-improving tax agents with Codex· 4d agoReachy Mini goes fully local· 5d agoBoston Children’s uses AI to unlock new diagnoses· 2d agoHow Braintrust turns customer requests into code with Codex· 2d agoStrengthening societal resilience with Rosalind Biodefense· 3d agoProfiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler· 3d agoA shared playbook for trustworthy third party evaluations· 3d agoHow Endava builds an agentic organization with Codex· 3d agoOpenAI’s Frontier Governance Framework· 4d agoMUFG aims to become AI-native with OpenAI· 4d agoITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM· 4d agoCisco and OpenAI redefine enterprise engineering with Codex· 4d agoBuilding self-improving tax agents with Codex· 4d agoReachy Mini goes fully local· 5d ago
General

Boston Children’s uses AI to unlock new diagnoses

2d ago · OpenAI
General

How Braintrust turns customer requests into code with Codex

2d ago · OpenAI
General

Strengthening societal resilience with Rosalind Biodefense

3d ago · OpenAI
General

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

3d ago · Hugging Face
General

A shared playbook for trustworthy third party evaluations

3d ago · OpenAI
General

How Endava builds an agentic organization with Codex

3d ago · OpenAI
General

OpenAI’s Frontier Governance Framework

4d ago · OpenAI
General

MUFG aims to become AI-native with OpenAI

4d ago · OpenAI
General

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

4d ago · Hugging Face
General

Cisco and OpenAI redefine enterprise engineering with Codex

4d ago · OpenAI
General

Building self-improving tax agents with Codex

4d ago · OpenAI
General

Reachy Mini goes fully local

5d ago · Hugging Face
General

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

5d ago · Hugging Face
General

Election information and safeguards in 2026

5d ago · OpenAI
General

Warp’s big bet on building open source with GPT-5.5

5d ago · OpenAI
Model

Claude Sonnet 4.6 tops GPQA Diamond with 81.2%

Anthropic's latest model beats human experts on graduate-level science questions across physics, chemistry and biology. The model achieves state-of-the-art results without sacrificing speed.

7d ago · Anthropic
General

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

7d ago · Hugging Face
General

OpenAI, Grupo Folha and Grupo UOL announce strategic content partnership

7d ago · OpenAI
Research

Stanford: AI agents now match senior radiologists on CT reads

A multi-agent system evaluated 12,000 chest CTs achieving 94.3% accuracy, on par with board-certified radiologists. The system uses vision models and domain-specific reasoning chains.

7d ago · Stanford HAI
Tool

Cursor 1.0 — full codebase awareness + MCP support ships

After 18 months in beta, Cursor hits 1.0 with whole-repo indexing, natural language refactors, and MCP tool integration. The release includes a redesigned agent mode.

7d ago · Cursor
Safety

DeepMind: Constitutional RL cuts harmful outputs by 73%

New training method embeds 58 constitutional principles directly into the reward model, reducing policy violations without sacrificing capability. The approach generalises across model sizes.

7d ago · DeepMind
Model

Mistral 7B v3 — 128k context, fully open weights

Mistral drops their best open model yet: 128k token context, Apache 2.0 license, and scores that rival GPT-4o-mini. Optimised for long-document tasks and RAG applications.

7d ago · Mistral AI
Research

Emergent reasoning appears at 30B params: new scaling law

MIT CSAIL paper identifies a sharp phase transition in reasoning ability at ~30B parameters, challenging previous compute-optimal training assumptions. Key implications for resource allocation.

7d ago · MIT CSAIL
Tool

OpenAI launches Realtime API for voice assistants

The Realtime API enables sub-300ms voice-to-voice latency, making responsive voice assistants practical. Supports interruption handling and speaker diarization out of the box.

7d ago · OpenAI
Safety

EU AI Act: first enforcement actions filed against three vendors

European regulators filed the first formal enforcement actions under the EU AI Act against vendors of high-risk AI systems. Cases involve medical AI and recruitment screening tools.

8d ago · EU Commission
General

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

9d ago · Hugging Face
General

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

9d ago · Hugging Face
General

OpenAI named a Leader in enterprise coding agents by Gartner

10d ago · OpenAI
General

How Virgin Atlantic ships faster with Codex

10d ago · OpenAI
General

AdventHealth advances whole-person care with OpenAI

10d ago · OpenAI
General

An OpenAI model has disproved a central conjecture in discrete geometry

12d ago · OpenAI
General

The next phase of OpenAI’s Education for Countries

12d ago · OpenAI
General

How Ramp engineers accelerate code review with Codex

12d ago · OpenAI
General

Introducing OpenAI for Singapore

12d ago · OpenAI
General

OlmoEarth v1.1: A more efficient family of Earth observation models

12d ago · Hugging Face
General

Advancing content provenance for a safer, more transparent AI ecosystem

12d ago · OpenAI
General

Introducing the Ettin Reranker Family

13d ago · Hugging Face
General

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

13d ago · Hugging Face
General

The Open Agent Leaderboard

13d ago · Hugging Face
General

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

17d ago · Hugging Face
General

Unlocking asynchronicity in continuous batching

18d ago · Hugging Face