May 7, 2025

6 Posts

Blue performance gauge with needle pointing to maximum, indicating high level or peak performance.
May 7, 2025

Hot Tips for Speedy Startups: Speed is the most important factor in successful startups. Here are four ways to accelerate your company.

I’m delighted to announce that AI Fund has closed $190M for our new fund, in an oversubscribed round.
Blue performance gauge with needle pointing to maximum, indicating high level or peak performance.
May 7, 2025

ChatGPT Grovels, Qwen3 Takes on DeepSeek-R1, Johnson & Johnson Reveals AI Strategy, Easy Reasoning Hack

The Batch AI News and Insights: I’m delighted to announce that AI Fund has closed $190M for our new fund, in an oversubscribed round.
Chart showing LLM accuracy increasing with reasoning tokens across math and science benchmarks like AIME24 and GPQA.
May 7, 2025

One Weird Trick for Better Reasoning: Researchers fine-tune LLM for reasoning with only 1,000 examples

Researchers showed that supervised fine-tuning on as few as 1,000 examples can enable a pretrained large language model to reason — and a clever gambit can boost its performance to rival that of top reasoning models.
Gloved hand holds Johnson & Johnson vaccine vial with syringe, representing pharmaceutical and vaccination concepts.
May 7, 2025

AI Insights from Big Pharma: Johnson & Johnson reveals its revised AI strategy

The world’s biggest pharmaceutical company by revenue shed light on its AI strategy.
Man at desk overwhelmed by robot coworkers in office setting with city and tree views.
May 7, 2025

The User Is Always… a Genius!: OpenAI pulls GPT-4o update after users report sycophantic behavior

OpenAI’s most widely used model briefly developed a habit of flattering users, with laughable and sometimes worrisome results.
LLM performance benchmark table comparing Qwen, OpenAI, Gemini, and others on coding, math, and language tasks.
May 7, 2025

Qwen3 Takes On DeepSeek-R1: Alibaba releases the Qwen3 family of open LLMs with optional reasoning

Alibaba’s new model family may unseat DeepSeek-R1’s four-month reign as the top open-weights large language model.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox