Large Language Models (LLMs)

67 Posts

Mini but Mighty: OpenAI's GPT-4o Mini offers big performance at a small price
Large Language Models (LLMs)

Mini but Mighty: OpenAI's GPT-4o Mini offers big performance at a small price

A slimmed-down version of Open AI’s multimodal flagship packs a low-price punch.
Hallucination Detector: Oxford scientists propose effective method to detect AI hallucinations
Large Language Models (LLMs)

Hallucination Detector: Oxford scientists propose effective method to detect AI hallucinations

Large language models can produce output that’s convincing but false. Researchers proposed a way to identify such hallucinations. 
How Open Are Open Models?: Radboud University study ranks AI models on openness
Large Language Models (LLMs)

How Open Are Open Models?: Radboud University study ranks AI models on openness

The word “open” can mean many things with respect to AI. A new paper outlines the variations and ranks popular models for openness.
Like LoRA, But for Pretraining: GaLore, a memory-saving method for pretraining and fine-tuning LLMs
Large Language Models (LLMs)

Like LoRA, But for Pretraining: GaLore, a memory-saving method for pretraining and fine-tuning LLMs

Low-rank adaptation (LoRA) reduces memory requirements when fine-tuning large language models, but it isn’t as conducive to pretraining.
Amazon Onboards Adept: Amazon adds majority of Adept AI staff to boost agentic AI capabilities
Large Language Models (LLMs)

Amazon Onboards Adept: Amazon adds majority of Adept AI staff to boost agentic AI capabilities

Amazon hired most of the staff of agentic-AI specialist Adept AI in a move that echoes Microsoft’s absorption of Inflection in March.
Claude Advances the LLM Interface: Claude 3.5 Sonnet’s Artifacts feature makes it easier to build and code on-site
Large Language Models (LLMs)

Claude Advances the LLM Interface: Claude 3.5 Sonnet’s Artifacts feature makes it easier to build and code on-site

Claude 3.5 Sonnet lets users work on generated outputs as though they were independent files — a step forward in large language model user interfaces.
Model Merging Evolves: Researchers developed automated system for efficient model merging
Large Language Models (LLMs)

Model Merging Evolves: Researchers developed automated system for efficient model merging

The technique of model merging combines separate models into a single, more capable model without further training, but it requires expertise and manual effort. Researchers automated the process.
Challenging Human-Level Models: Hugging Face overhauls open LLM leaderboard with tougher benchmarks
Large Language Models (LLMs)

Challenging Human-Level Models: Hugging Face overhauls open LLM leaderboard with tougher benchmarks

An influential ranking of open models revamped its criteria, as large language models approach human-level performance on popular tests.
Chatbot for Minority Languages: Startup Two AI launches SUTRA, a multilingual model for South Asian markets
Large Language Models (LLMs)

Chatbot for Minority Languages: Startup Two AI launches SUTRA, a multilingual model for South Asian markets

An AI startup that aims to crack markets in southern Asia launched a multilingual competitor to GPT-4.
Safety, Evaluations and Alignment Lab (SEAL) Leaderboards.
Large Language Models (LLMs)

Private Benchmarks for Fairer Tests: Scale AI launches SEAL leaderboards to benchmark model performance

Scale AI offers new leaderboards based on its own benchmarks.
Different results from new text-to-image models from Nvidia, Alibaba, and Stability AI
Large Language Models (LLMs)

More New Open Models: New models from Nvidia, Alibaba, and Stability AI expand open options

A trio of powerful open and semi-open models give developers new options for both text and image generation. 
Apple’s Gen AI Strategy Revealed: Apple unveils AI features in new iOS and MacOS update during WWDC
Large Language Models (LLMs)

Apple’s Gen AI Strategy Revealed: Apple unveils AI features in new iOS and MacOS update during WWDC

Apple presented its plan to imbue its phones and computers with artificial intelligence. 
Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models
Large Language Models (LLMs)

Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models

A relatively small student LLM that learns to mimic a larger teacher model can perform nearly as well as the teacher while using much less computation. It can come even closer if the teacher also teaches reasoning techniques.
Disinformation Documented: OpenAI takes action against misuse of its models in propaganda
Large Language Models (LLMs)

Disinformation Documented: OpenAI takes action against misuse of its models in propaganda

OpenAI models were used in five disinformation campaigns, the company said.
Windows Laptop displaying a colored wallpaper
Large Language Models (LLMs)

Rise of the AI PC: Microsoft launches AI-driven Copilot+ PCs

Generative AI plays a starring role in the latest Windows PCs.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox