Data Points

Your accelerated guide to AI news

Select a country
Please select
People using laptops to access Wikipedia in a crowded library setting.
Data Points

Wikimedia wants to help build AI for the commons: Pricing and availability for o3 and o4-mini

Gemini 2.5 Flash blends speed with budgeted reasoning. IBM’s Granite Speech sets SOTA in transcription accuracy. Recall will soon return to Copilot Plus PCs. OpenAI will shelve its biggest, costliest model.
Diverse group of college students walking and studying on a university campus lawn with a historic brick building in the background.
Data Points

OpenAI unveils new model suite for developers: Meta’s crawlers return to Europe

New vibe coding tools for Gemini. ChatGPT can remember all your conversations. Google’s new TPU is built for agents, inference. How college students use chatbots.
Courtroom scene with teams analyzing neural networks, symbolizing AI ethics and legal implications.
Data Points

Open-source DeepCoder matches top models: ML researchers accept an AI-written workshop paper

Google’s A2A protocol helps agents work together. Amazon debuts unified speech-to-speech model. Claude’s new subscription plan for power users. Elon Musk’s battle with OpenAI takes a new turn.
Leader alpaca standing on a rock surrounded by herd in green hills under dramatic sunset sky.
Data Points

Meta’s got a brand new herd: Gemini 2.5 Pro gets a price tag

Microsoft personalizes its multitasking Copilot. Midjourney overhauls its leading image model. An early vibe coding entrant reenters a crowded field. A cybersecurity-optimized version of Gemini.
Classroom with students using laptops and a large server rack in the front of the room.
Data Points

OpenAI promises a more open model: Runway seeks to stabilize scenes in AI video

Amazon gets into browser use with Nova Act. Claude goes back to school with program for higher ed. Google becomes a harder place for AI researchers to publish. Evaluating models’ ability to replicate cutting-edge AI research.
Scientists examining laptop through microscope in high-tech lab, researching data with advanced technology tools.
Data Points

Adapting R1-like techniques to video reasoning: Anthropic builds an “AI microscope” to probe Claude’s internal anatomy

How Alibaba built its compact but powerful video generation models. Towards a unified text-image diffusion model. A new approach to vision-language understanding from Alibaba. Microsoft adapts OpenAI models to build data workforce agents.
Students in a computer lab using AI tools to generate digital art, including rockets, landscapes, portraits, and 3D characters.
Data Points

Gemini 2.5 Pro takes the top spot on key benchmarks: GPT-4o’s popular but controversial new image generator

DeepSeek updates its V3 model with new skills and MIT license. Reve Image 1.0 excels at text and typography design. Qwen2.5-Omni tackles text. Images, audio, and video. Software developers have tried AI, but some like it better.
Aerial view of a hedge maze with a large black spider at the center, surrounded by trees, benches, and paths.
Data Points

Building a model for vision and speech: How Cloudflare thwarts unauthorized AI crawlers… by using AI

Nvidia’s Nemotron adds reasoning to Llama models. Does ChatGPT make frequent users more lonely? OpenAI’s o1-pro costs a pretty penny. Mistral Small 3.1 gives Gemma 3 27B some competition.
Children learning programming in a modern classroom with laptops, robots, and screens displaying code, promoting STEM education.
Data Points

This Aardvark predicts the weather: GPT-4o meets Whisper; OpenAI’s new models

Nvidia gives Project DIGITS a new name. AI models compete to build Minecraft items. Claude chatbot now includes search. A Moore’s law-like regularity for AI agents.
A man pastes “AI GENERATED” posters on a graffiti-covered wall in an urban alley, suggesting a street art or guerrilla marketing act.
Data Points

ERNIE checks competitors with low prices: AI2’s OLMo2 32B may be the top fully open model

Google’s two new Gemini vision-language-action robotics models. Cohere’s Command A, another lightweight LMM. New China regulations require mandatory labels for AI content. Monitoring reasoning models for reward hacking or unwanted behavior.
A therapy session in a modern office where a patient lies on a couch talking to an AI-powered computer therapist.
Data Points

AI giants’ U.S. policy proposals: Gemma 3 beats bigger open weight rivals

OpenAI’s new SDK and APIs for agentic workflows. Olympic Coder, two powerful open coding models. Alibaba applies RL to emotion detection. GPT-4.5 and Claude Sonnet 3.7 top a new agent leaderboard.
Futuristic nightclub with neon lights, a dancing crowd, and a supercomputer DJ booth glowing amid fog and lasers.
Data Points

EAGLE-3 speeds up language models: And the 2024 Turing Award goes to…

Music and lyrics in one diffusion model. Manus AI’s impressive demos spark excitement and backlash. OpenAI sees AGI as a gradual evolution. Google unveils its first Gemini-branded embedding models.
A man sitting side by side with his computer at a bar as if they are having a friendly conversation.
Data Points

Qwen’s mid-sized reasoning model scores big: Sesame moves through speech models’ “uncanny valley”

Cohere’s open vision models support many languages. Jamba 1.6’s two hybrid MoE models promise more speed. Anthropic overhauls its developer console for Claude Sonnet 3.7. Mistral brings its multilingual/multimedia skills to OCR.
Team in modern office applauding while watching a news anchor on a big screen.
Data Points

All the models we’ve been waiting for: OpenAI’s scaled-up Project Orion arrives

Mercury debuts diffusion language models. Alibaba’s top video model is now free to download. A new model from Tencent is built for speed. IBM’s Granite 3.2 models are built for business.
A minimalist high-tech laboratory with two scientists working.
Data Points

Anthropic releases Claude 3.7 Sonnet as a hybrid reasoning model: DeepSeek’s FlashMLA is its first entry in OpenInfra week

Figure’s Helix vision language action robotics model. Google fine-tunes its own family of open VL models. SuperGPQA may be the most challenging general knowledge test yet. Meta creates new framework to evaluate agentic LLMs.
Load More

Subscribe to Data Points

Your accelerated guide to AI news and research