AI Agents

52 Posts

Meta's infinity loop logo adorned with a snapping hand icon on a light gradient background.
AI Agents

Meta Moves to Buy Agent Tech: Meta strikes a deal to acquire Manus, a Singapore-based agentic AI startup with Chinese origins

A high-profile acquisition could enable Facebook, Instagram, and WhatsApp to offer built-in agents that do users’ bidding.
Graph with 10 colored lines shows topic ranks monthly, based on a Microsoft study of Copilot usage.
AI Agents

Copilot’s Users Change Hour to Hour: Microsoft study shows people use AI very differently at different times or on different devices

What do users want from AI? The answer depends on when and how they use it, a new study shows.
Diagram showing SCP hub linking clients with databases, tools, AI agents, and lab devices for experiments.
AI Agents

Lingua Franca for Science Labs: SAIL’s Science Context Protocol helps AI Agents communicate about local and virtual experiments

An open protocol aims to enable AI agents to conduct scientific research autonomously across disciplinary and institutional boundaries.
Tanmay Gupta is pictured smiling next to a whiteboard filled with mathematical formulas, embodying active AI engagement.
AI Agents

From Prediction to Action by Tanmay Gupta: Tanmay Gupta of the Allen Institute on building AI for long-horizon tasks

AI research in 2026 should confront a simple but transformative realization: Models that predict are not the same as systems that act. The latter is what we actually need.
Mice on a laptop keyboard explore, with code on screen; background features festive lights, presents.
AI Agents

Agents Write Code Faster, Cheaper: Software developers used more versatile AI-powered tools to write code

Coding apps moved beyond autofill-style code completion to agentic systems that manage a wide range of software development tasks.
Snowman in Thinker pose on snowy landscape, with a person building it.
AI Agents

Thinking Models Solve Bigger Problems: Reasoning models, beginning with OpenAI’s o1 and DeepSeek’s R1, transformed the industry

Think step by step. Explain your reasoning. Work backwards from the answer. As 2025 began, models executed these reasoning strategies only when prompted. Now most new large language models do it as a matter of course, improving performance across a wide range of tasks.
Table comparing Nova 2 Pro to other models in reasoning, coding, perception, and workflows.
AI Agents

Amazon Steps Forward: Nova 2 family boosts cost-effective performance, adds new agentic features

Amazon raised the competitive profile of its foundation models and added services for custom model training and an agent platform for browser automation.
The eagle grips a microchip and a scroll, representing AI's role in scientific advancement as per US directives.
AI Agents

White House Orders AI for Science: Genesis Mission would share U.S. data and resources with top AI companies

President Trump launched a United States effort to use AI to speed up scientific breakthroughs.
Image illustrates the Self-Search method, simulating web searches to improve model accuracy in tests.
AI Agents

More-Efficient Agentic Search: Researchers fine-tune models to search their own parameters to boost recall

Large language models may have learned knowledge that’s relevant to a given prompt, but they don’t always recall it consistently. Fine-tuning a model to search its parameters as though it were searching the web can help it find knowledge in its own weights.
Visual map outlines cybercrime operation phases, highlighting AI-driven processes and human validation steps.
AI Agents

Anthropic Cyberattack Report Sparks Controversy: Security researchers question whether coding agents allow unprecedented automated attacks

Independent cybersecurity researchers pushed back on a report by Anthropic that claimed hackers had used its Claude Code agentic coding system to perpetrate an unprecedented automated cyberattack.
Chart highlights Kimi K2’s top performance in agentic tasks, outperforming rivals in reasoning and coding.
AI Agents

Top Agentic Results, Open Weights: Kimi K2 Thinking outperforms proprietary models with new techniques for agentic tool use

The latest open-weights large language model from Moonshot AI challenges top proprietary LLMs at agentic tasks by executing hundreds of tool calls sequentially and pausing to think between each.
Chart displays MiniMax-M2 with high intelligence and competitive pricing, outshining other models.
AI Agents

Open-Weights Coding Leader: MiniMax-M2’s lightweight footprint and low costs belie that its top performance

An open-weights model from Shanghai-based MiniMax challenges top proprietary models on key benchmarks for coding and agentic tasks.
Flowchart details GEPA algorithm, featuring candidate filtering and performance improvement loops.
AI Agents

Better Agentic Prompts Automatically: Authors devised GEPA, an algorithm for better prompts to improve agentic systems’ performance

Honing an agent’s prompt can yield better results than fine-tuning the underlying large language model via reinforcement learning.
Comparison table highlighting Claude Sonnet 4.5's top scores in coding and reasoning benchmarks, featuring improved capabilities.
AI Agents

Claude Levels Up: Anthropic launches Claude Sonnet 4.5 and the Claude Agent SDK, and overhauls Claude Code for developers

Anthropic updated its mid-size Claude Sonnet model, making it the first member of the Claude family to reach version 4.5. It also enhanced the Claude Code agentic coding tool with long-desired features.
FanDuel mobile app screens showing live betting odds and AI-powered sports wagering tools.
AI Agents

Sports Betting Goes Agentic: Gambling sites roll out AI tools that predict wins and track bets for sports fans

AI agents are getting in on the action of online sports gambling.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox