Natural Language Processing (NLP)

8 Posts

Image shows MolmoAct system processing "Put the plate in the dishwasher" through spatial reasoning.
Natural Language Processing (NLP)

Better Spatial Perception for Robots: MolmoAct creates spatial maps for robots to plot their actions before executing text directions

Robot control systems that accept only text input struggle to translate words into motions in space. Researchers developed a system that enables robots to plan spatial paths before they execute text instructions.
Chart details ChatGPT conversations. Writing (28.1%), info-seeking (21.3%), and guidance (28.3%) lead.
Natural Language Processing (NLP)

What ChatGPT Users Want: ChatGPT users now more likely to be young, female, and seeking info, study shows

What do ChatGPT’s 700 million weekly active users do with it? OpenAI teamed up with a Harvard economist to find out.
Diagram of Walmart’s Element platform for AI app development, which unifies data and containerizes processing across multiple cloud providers.
Natural Language Processing (NLP)

Inside Walmart’s AI App Factory: Walmart’s Element platform for industrial-scale AI app development — a progress report

The world’s biggest retailer by revenue revealed new details about its cloud- and model-agnostic AI application development platform.
AYA Vision architecture diagram showing vision encoder, multimodal merging, and LLM backbone for image processing
Natural Language Processing (NLP)

Equally Fluent in Many Languages: Cohere’s Aya Vision beats multilingual rivals in text & image understanding

Multilingual AI models often suffer uneven performance across languages, especially in multimodal tasks. A pair of lean models counters this trend with consistent understanding of text and images across major languages.
Amazon smart display with widgets for recipes, calendar, weather, events, and streaming (Prime Video, Netflix, Disney+).
Natural Language Processing (NLP)

Amazon’s Next-Gen Voice Assistant: Alexa+ adds generative AI and agents, using Claude and other models

Amazon announced Alexa+, a major upgrade to its long-running voice assistant.
A participant types while an MEG scan decodes brain activity into text in real-time, showing typed vs. decoded text.
Natural Language Processing (NLP)

Reading Minds, No Brain Implant Required: Brain2Qwerty, a system that decodes thoughts using brain waves without surgery

To date, efforts to decode what people are thinking from their brain waves often relied on electrodes implanted in the cortex. New work used devices outside the head to pick up brain signals that enabled an AI system, as a subject typed, to accurately guess what they were typing.
Diagram showing GPT-4o with and without search, highlighting task execution success and failure.
Natural Language Processing (NLP)

Tree Search for Web Agents: How tree search improves AI agents’ ability to browse the web and complete tasks

Browsing the web to achieve a specific goal can be challenging for agents based on large language models and even for vision-language models that can process onscreen images of a browser.
Bar chart comparing active vs. random sampling effects on length, diversity, and toxicity after fine-tuning.
Natural Language Processing (NLP)

Fine-Tuning Fine Points: Active inheritance, a smarter way to fine-tune models on synthetic data

The practice of fine-tuning models on synthetic data is becoming well established. But synthetic training data, even if it represents the training task well, may include characteristics like toxicity that impart unwelcome properties in the trained model’s output...

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox