Neural Networks

15 Posts

BitNet b1.58 matrix multiplication shows ternary weights enabling faster neural network computation.

Low Precision, High Performance: Researchers at Microsoft and Tsinghua researchers propose 1.58-bit AI model that rivals full-precision competitors

Reducing the number of bits used to represent each parameter in a neural network from, say, 16 bits to 8 bits shrinks the network’s size and boosts its speed. Researchers took this approach to an extreme: They built a competitive large language model whose weights are limited to three values.

Dual line graphs showing factual QA accuracy and NLL against memory size for NQ and TQA datasets in AI models.

Neural Networks

Memory Layers for More-Factual Output: Meta researchers build Llama-style models that recall details without needing more computing resources

Improving a large language model’s factual accuracy typically requires making it bigger, which in turn, involves more computation. Researchers devised an architecture that enables models to recall relevant details without significantly increasing the amount of computation required.

A participant types while an MEG scan decodes brain activity into text in real-time, showing typed vs. decoded text.

Neural Networks

Reading Minds, No Brain Implant Required: Brain2Qwerty, a system that decodes thoughts using brain waves without surgery

To date, efforts to decode what people are thinking from their brain waves often relied on electrodes implanted in the cortex. New work used devices outside the head to pick up brain signals that enabled an AI system, as a subject typed, to accurately guess what they were typing.

Robotic arms collaborating to fold a red garment on a table.

Neural Networks

Household Help: π0, a machine learning system for household robotics

A new generation of robots can handle some household chores with unusual skill.

Deja Vu, an algorithm that accelerates inferencing of large language models

Neural Networks

Streamlined Inference: Deja Vu, a method that boosts LLM speed by activating only essential neural parts

It’s not necessary to activate all parts of a large language model to process a given input. Using only the necessary parts saves processing.

Neural Networks

Cross-Species Cell Embeddings: AI enhances cell type discovery, identifies previously elusive “Norn cells”

Researchers used an AI system to identify animal cell types from gene sequences, including a cell type that conventional approaches had discovered only in the past year.

Neural Networks

Better, Faster Network Pruning: Researchers devise pruning method that boosts AI speed

Pruning weights from a neural network makes it smaller and faster, but it can take a lot of computation to choose weights that can be removed without degrading the network’s performance.

Neural Networks

Early Detection for Pancreatic Cancer: A neural network shows remarkable accuracy in forecasting risk of pancreatic cancer.

A neural network detected early signs of pancreatic cancer more effectively than doctors who used the usual risk-assessment criteria. Researchers at MIT and oncologists at Beth Israel Medical Center in Boston...

SingSong's process for manufacturing instrumental music to accompany input vocals.

Neural Networks

Sing a Tune, Generate an Accompaniment: SingSong, a tool that generates instrumental music for unaccompanied input vocals

A neural network makes music for unaccompanied vocal tracks. Chris Donahue, Antoine Caillon, Adam Roberts, and colleagues at Google proposed SingSong, a system that generates musical accompaniments for sung melodies. You can listen to its output here.

Neural Networks

Deep Learning Discovers Antibiotics: Researchers used neural networks to find a new class of antibiotics.

Biologists used neural networks to find a new class of antibiotics. Researchers at MIT and Harvard trained models to screen chemical compounds for those that kill methicillin-resistant Staphylococcus aureus (MRSA), the deadliest among bacteria that have...

Assembly pseudocode before and after applying the AlphaDev swap move

Neural Networks

AI Builds Better Sorting Algorithms: AlphaDev, a new system for high-speed sorting of lists and numbers

Online sorting algorithms run trillions of times a day to organize lists according to users’ interests. New work found faster alternatives. Daniel J. Mankowitz and colleagues at Google developed AlphaDev, a system that learned to generate algorithms that sort three...

Neural Networks

Guiding the Scalpel: Researchers trained neural networks to assist brain surgeons' real-time tumor removal decisions.

A neural network helped brain surgeons decide how much healthy tissue to cut out when removing tumors — while the patients were on the operating table.

Neural Networks

Optimizer Without Hyperparameters: VeLO, the system that eliminates the need for optimizer hyperparameters

During training, a neural network usually updates its weights according to an optimizer that’s tuned using hand-picked hyperparameters. New work eliminates the need for optimizer hyperparameters.

Illustration of a robot with a captain costume

Neural Networks

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture.

One person pouring a drink of poison in the company of another person

Neural Networks

Logistic Regression: Follow the Curve — A Basic Introduction to Logistic Regression for Machine Learning

There was a moment when logistic regression was used to classify just one thing: If you drink a vial of poison, are you likely to be labeled “living” or “deceased”? Times have changed.

Neural Networks

Low Precision, High Performance: Researchers at Microsoft and Tsinghua researchers propose 1.58-bit AI model that rivals full-precision competitors

Memory Layers for More-Factual Output: Meta researchers build Llama-style models that recall details without needing more computing resources

Reading Minds, No Brain Implant Required: Brain2Qwerty, a system that decodes thoughts using brain waves without surgery

Household Help: π0, a machine learning system for household robotics

Streamlined Inference: Deja Vu, a method that boosts LLM speed by activating only essential neural parts

Cross-Species Cell Embeddings: AI enhances cell type discovery, identifies previously elusive “Norn cells”

Better, Faster Network Pruning: Researchers devise pruning method that boosts AI speed

Early Detection for Pancreatic Cancer: A neural network shows remarkable accuracy in forecasting risk of pancreatic cancer.

Sing a Tune, Generate an Accompaniment: SingSong, a tool that generates instrumental music for unaccompanied input vocals

Deep Learning Discovers Antibiotics: Researchers used neural networks to find a new class of antibiotics.

AI Builds Better Sorting Algorithms: AlphaDev, a new system for high-speed sorting of lists and numbers

Guiding the Scalpel: Researchers trained neural networks to assist brain surgeons' real-time tumor removal decisions.

Optimizer Without Hyperparameters: VeLO, the system that eliminates the need for optimizer hyperparameters

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Logistic Regression: Follow the Curve — A Basic Introduction to Logistic Regression for Machine Learning

Subscribe to The Batch