LIVE — UPDATED EVERY 2 HOURS

Everything happening in AI, tuned for you

Models · Research · Tools · Safety — one feed, daily

stories today

sources tracked

articles (last 30d)

archived

Boston Children’s uses AI to unlock new diagnoses· 2d agoHow Braintrust turns customer requests into code with Codex· 2d agoStrengthening societal resilience with Rosalind Biodefense· 3d agoProfiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler· 3d agoA shared playbook for trustworthy third party evaluations· 3d agoHow Endava builds an agentic organization with Codex· 3d agoOpenAI’s Frontier Governance Framework· 4d agoMUFG aims to become AI-native with OpenAI· 4d agoITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM· 4d agoCisco and OpenAI redefine enterprise engineering with Codex· 4d agoBuilding self-improving tax agents with Codex· 4d agoReachy Mini goes fully local· 5d agoBoston Children’s uses AI to unlock new diagnoses· 2d agoHow Braintrust turns customer requests into code with Codex· 2d agoStrengthening societal resilience with Rosalind Biodefense· 3d agoProfiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler· 3d agoA shared playbook for trustworthy third party evaluations· 3d agoHow Endava builds an agentic organization with Codex· 3d agoOpenAI’s Frontier Governance Framework· 4d agoMUFG aims to become AI-native with OpenAI· 4d agoITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM· 4d agoCisco and OpenAI redefine enterprise engineering with Codex· 4d agoBuilding self-improving tax agents with Codex· 4d agoReachy Mini goes fully local· 5d ago

Everything happening in AI, tuned for you

Boston Children’s uses AI to unlock new diagnoses

How Braintrust turns customer requests into code with Codex

Strengthening societal resilience with Rosalind Biodefense

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

A shared playbook for trustworthy third party evaluations

How Endava builds an agentic organization with Codex

OpenAI’s Frontier Governance Framework

MUFG aims to become AI-native with OpenAI

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Cisco and OpenAI redefine enterprise engineering with Codex

Building self-improving tax agents with Codex

Reachy Mini goes fully local

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

Election information and safeguards in 2026

Warp’s big bet on building open source with GPT-5.5

Claude Sonnet 4.6 tops GPQA Diamond with 81.2%

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

OpenAI, Grupo Folha and Grupo UOL announce strategic content partnership

Stanford: AI agents now match senior radiologists on CT reads

Cursor 1.0 — full codebase awareness + MCP support ships

DeepMind: Constitutional RL cuts harmful outputs by 73%

Mistral 7B v3 — 128k context, fully open weights

Emergent reasoning appears at 30B params: new scaling law

OpenAI launches Realtime API for voice assistants

EU AI Act: first enforcement actions filed against three vendors

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

OpenAI named a Leader in enterprise coding agents by Gartner

How Virgin Atlantic ships faster with Codex

AdventHealth advances whole-person care with OpenAI

An OpenAI model has disproved a central conjecture in discrete geometry

The next phase of OpenAI’s Education for Countries

How Ramp engineers accelerate code review with Codex

Introducing OpenAI for Singapore

OlmoEarth v1.1: A more efficient family of Earth observation models

Advancing content provenance for a safer, more transparent AI ecosystem

Introducing the Ettin Reranker Family

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

The Open Agent Leaderboard

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Unlocking asynchronicity in continuous batching