Large Language Models (LLMs)

67 Posts

Mini but Mighty: OpenAI's GPT-4o Mini offers big performance at a small price

A slimmed-down version of Open AI’s multimodal flagship packs a low-price punch.

Hallucination Detector: Oxford scientists propose effective method to detect AI hallucinations

Large language models can produce output that’s convincing but false. Researchers proposed a way to identify such hallucinations.

Large Language Models (LLMs)

How Open Are Open Models?: Radboud University study ranks AI models on openness

The word “open” can mean many things with respect to AI. A new paper outlines the variations and ranks popular models for openness.

Large Language Models (LLMs)

Like LoRA, But for Pretraining: GaLore, a memory-saving method for pretraining and fine-tuning LLMs

Low-rank adaptation (LoRA) reduces memory requirements when fine-tuning large language models, but it isn’t as conducive to pretraining.

Large Language Models (LLMs)

Amazon Onboards Adept: Amazon adds majority of Adept AI staff to boost agentic AI capabilities

Amazon hired most of the staff of agentic-AI specialist Adept AI in a move that echoes Microsoft’s absorption of Inflection in March.

Large Language Models (LLMs)

Claude Advances the LLM Interface: Claude 3.5 Sonnet’s Artifacts feature makes it easier to build and code on-site

Claude 3.5 Sonnet lets users work on generated outputs as though they were independent files — a step forward in large language model user interfaces.

Large Language Models (LLMs)

Model Merging Evolves: Researchers developed automated system for efficient model merging

The technique of model merging combines separate models into a single, more capable model without further training, but it requires expertise and manual effort. Researchers automated the process.

Large Language Models (LLMs)

Challenging Human-Level Models: Hugging Face overhauls open LLM leaderboard with tougher benchmarks

An influential ranking of open models revamped its criteria, as large language models approach human-level performance on popular tests.

Large Language Models (LLMs)

Chatbot for Minority Languages: Startup Two AI launches SUTRA, a multilingual model for South Asian markets

An AI startup that aims to crack markets in southern Asia launched a multilingual competitor to GPT-4.

Safety, Evaluations and Alignment Lab (SEAL) Leaderboards.

Large Language Models (LLMs)

Private Benchmarks for Fairer Tests: Scale AI launches SEAL leaderboards to benchmark model performance

Scale AI offers new leaderboards based on its own benchmarks.

Different results from new text-to-image models from Nvidia, Alibaba, and Stability AI

Large Language Models (LLMs)

More New Open Models: New models from Nvidia, Alibaba, and Stability AI expand open options

A trio of powerful open and semi-open models give developers new options for both text and image generation.

Large Language Models (LLMs)

Apple’s Gen AI Strategy Revealed: Apple unveils AI features in new iOS and MacOS update during WWDC

Apple presented its plan to imbue its phones and computers with artificial intelligence.

Large Language Models (LLMs)

Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models

A relatively small student LLM that learns to mimic a larger teacher model can perform nearly as well as the teacher while using much less computation. It can come even closer if the teacher also teaches reasoning techniques.

Large Language Models (LLMs)

Disinformation Documented: OpenAI takes action against misuse of its models in propaganda

OpenAI models were used in five disinformation campaigns, the company said.

Windows Laptop displaying a colored wallpaper

Large Language Models (LLMs)

Rise of the AI PC: Microsoft launches AI-driven Copilot+ PCs

Generative AI plays a starring role in the latest Windows PCs.

Large Language Models (LLMs)

Mini but Mighty: OpenAI's GPT-4o Mini offers big performance at a small price

Hallucination Detector: Oxford scientists propose effective method to detect AI hallucinations

How Open Are Open Models?: Radboud University study ranks AI models on openness

Like LoRA, But for Pretraining: GaLore, a memory-saving method for pretraining and fine-tuning LLMs

Amazon Onboards Adept: Amazon adds majority of Adept AI staff to boost agentic AI capabilities

Claude Advances the LLM Interface: Claude 3.5 Sonnet’s Artifacts feature makes it easier to build and code on-site

Model Merging Evolves: Researchers developed automated system for efficient model merging

Challenging Human-Level Models: Hugging Face overhauls open LLM leaderboard with tougher benchmarks

Chatbot for Minority Languages: Startup Two AI launches SUTRA, a multilingual model for South Asian markets

Private Benchmarks for Fairer Tests: Scale AI launches SEAL leaderboards to benchmark model performance

More New Open Models: New models from Nvidia, Alibaba, and Stability AI expand open options

Apple’s Gen AI Strategy Revealed: Apple unveils AI features in new iOS and MacOS update during WWDC

Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models

Disinformation Documented: OpenAI takes action against misuse of its models in propaganda

Rise of the AI PC: Microsoft launches AI-driven Copilot+ PCs

Subscribe to The Batch