Large Language Models (LLMs)

55 Posts

Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models
Large Language Models (LLMs)

Better Teachers Make Better Students: Microsoft‘s Orca 2 strengthens the native reasoning abilities of smaller models

A relatively small student LLM that learns to mimic a larger teacher model can perform nearly as well as the teacher while using much less computation. It can come even closer if the teacher also teaches reasoning techniques.
Disinformation Documented: OpenAI takes action against misuse of its models in propaganda
Large Language Models (LLMs)

Disinformation Documented: OpenAI takes action against misuse of its models in propaganda

OpenAI models were used in five disinformation campaigns, the company said.
Windows Laptop displaying a colored wallpaper
Large Language Models (LLMs)

Rise of the AI PC: Microsoft launches AI-driven Copilot+ PCs

Generative AI plays a starring role in the latest Windows PCs.
Richer Context for RAG: RAPTOR, a recursive summarizer, captures more relevant context for LLM inputs
Large Language Models (LLMs)

Richer Context for RAG: RAPTOR, a recursive summarizer, captures more relevant context for LLM inputs

Text excerpts used in retrieval augmented generation (RAG) tend to be short. Researchers used summarization to pack more relevant context into the same amount of text.
2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.
Large Language Models (LLMs)

2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.

Google’s annual I/O developers’ conference brought a plethora of updates and new models. 
Why ChatGPT Acts That Way: OpenAI introduces guidelines for model behavior, seeks public feedback
Large Language Models (LLMs)

Why ChatGPT Acts That Way: OpenAI introduces guidelines for model behavior, seeks public feedback

OpenAI pulled back the curtain on revised rules that will guide its models. 
Deja Vu, an algorithm that accelerates inferencing of large language models
Large Language Models (LLMs)

Streamlined Inference: Deja Vu, a method that boosts LLM speed by activating only essential neural parts

It’s not necessary to activate all parts of a large language model to process a given input. Using only the necessary parts saves processing.
GitHub Copilot Workspace preview
Large Language Models (LLMs)

Coding Assistance Start to Finish: GitHub previews Copilot Workspace for end-to-end software development

GitHub Copilot’s latest features are designed to help manage software development from plan to pull request. 
Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.
Large Language Models (LLMs)

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.

Apple is thinking small — very small — with a new family of open large language models.
Benchmarks that rank large language models’ performance of industry tasks
Large Language Models (LLMs)

Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.

How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval
Large Language Models (LLMs)

Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval

Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.
Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code
Large Language Models (LLMs)

Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code

Language models can generate code that erroneously points to software packages, creating vulnerabilities that attackers can exploit.
More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback
Large Language Models (LLMs)

More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback

Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
The Inflection AI logo merging with the Microsoft logo
Large Language Models (LLMs)

Microsoft Absorbs Inflection: Microsoft pays Inflection AI $650 Million, hires most of its staff

Microsoft took over most of the once high-flying chatbot startup Inflection AI in an unusual deal.
Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality
Large Language Models (LLMs)

Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality

Research aims to help users select large language models that minimize expenses while maintaining quality.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox