Machine Learning Research

376 Posts

Grounding DINO animation depicting object detection with bounding boxes on images.
Machine Learning Research

Object Detection for Small Devices: Grounding DINO 1.5, an edge device model built for faster, smarter object detection

An open source model is designed to perform sophisticated object detection on edge devices like phones, cars, medical equipment, and smart doorbells.
Bar charts comparing performance of AI models across six tasks.
Machine Learning Research

Reasoning Revealed: DeepSeek-R1, a transparent challenger to OpenAI o1

An up-and-coming Hangzhou AI lab unveiled a model that implements run-time reasoning similar to OpenAI o1 and delivers competitive performance. Unlike o1, it displays its reasoning steps.
Efficient Foundations animation showing layered AI model components.
Machine Learning Research

More-Efficient Training for Transformers: Researchers reduce transformer training costs by 20% with minimal performance loss

Researchers cut the processing required to train transformers by around 20 percent with only a slight degradation in performance.
Comparison of Minecraft terrain with and without player modifications.
Machine Learning Research

No Game Engine Required: AI creates an interactive Minecraft-like world in real time

A real-time video generator lets you explore an open-ended, interactive virtual world — a video game without a game engine.
Graph showing test loss decreases with more tokens and larger model sizes (103-109 parameters).
Machine Learning Research

Next-Gen Models Show Limited Gains: AI giants rethink model training strategy as scaling laws break down

Builders of large AI models have relied on the idea that bigger neural networks trained on more data and given more processing power would show steady improvements. Recent developments are challenging that idea.
OpenDevin animation illustrating open-source AI model collaboration.
Machine Learning Research

Free Agents: OpenHands launches as an open toolkit for advanced code generation and automation

An open source package inspired by the commercial agentic code generator Devin aims to automate computer programming and more.
Model performance comparison across English, Chinese, Math, and Code tasks, with Hunyuan-Large leading.
Machine Learning Research

Mixture of Experts Pulls Ahead: Hunyuan-Large outshines open competitors with high benchmark scores

A new open source large language model outperforms competitors, including the open-weights Llama 3.1 405B, on a variety of benchmarks.
MLE-Bench workflow showing competition steps for model training, testing, and leaderboard scoring.
Machine Learning Research

When Agents Train Algorithms: OpenAI’s MLE-bench tests AI coding agents

Coding agents are improving, but can they tackle machine learning tasks? 
COMPL-AI workflow diagram showing compliance steps for AI models under the EU AI Act.
Machine Learning Research

Does Your Model Comply With the AI Act?: COMPL-AI study measures LLMs’ compliance with EU’s AI act

A new study suggests that leading AI models may meet the requirements of the European Union’s AI Act in some areas, but probably not in others.
User retrieves vendor contact information to fill out a request form, verifying each entry.
Machine Learning Research

Claude Controls Computers: Anthropic empowers Claude Sonnet 3.5 to operate desktop apps, but cautions remain

API commands for Claude Sonnet 3.5 enable Anthropic’s large language model to operate desktop apps much like humans do. Be cautious, though: It’s a work in progress.
Green creatures with confused expressions surrounded by mirrors creating infinite reflections.
Machine Learning Research

Synthetic Data Distorts Models: Could training on generated output doom AI’s future?

Training successive neural networks on the outputs of previous networks gradually degrades performance. Will future models succumb to the curse of recursive training?
Cartoon of a ghost helping a professor answer Halloween trivia questions on a chalkboard, with students watching.
Machine Learning Research

Benchmark Tests Are Meaningless: The problem with training data contamination in machine learning

The universe of web pages includes correct answers to common questions that are used to test large language models. How can we evaluate new models if they’ve studied the answers before we give them the test?
Temporal pyramids in rows (left) and position encoding in space-time pyramid shown in the pyramidal flow matching process.
Machine Learning Research

Faster, Cheaper Video Generation: Pyramidal Flow Matching, a cost-cutting method for training video generators

Researchers devised a way to cut the cost of training video generators. They used it to build a competitive open source text-to-video model and promised to release the training code.
Comparison table of pre-trained models like Mistral, Llama, and Gemma, showcasing performance across evaluation metrics.
Machine Learning Research

Mistral AI Sharpens the Edge: Mistral AI unveils Ministral 3B and 8B models, outperforming rivals in small-scale AI

Mistral AI launched two models that raise the bar for language models with 8 billion or fewer parameters, small enough to run on many edge devices.
Diagram of a transformer model using Jina embeddings and LoRA adapters, tailored for tasks like sentiment classification.
Machine Learning Research

Better Text Embeddings: Jina AI launches jina-embeddings-v3, a text embedding model with task-specific adapters

Text embedding models are often used to retrieve text, cluster text, determine similarity between texts, and generate initial embeddings for text classifiers. A new embedding model comes with adapters that specialize it to each of these use cases.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox