Generative AI

153 Posts

Collage with comic strip, concert poster, diagrams on water cycle and trash sorting, and movie poster.

Refining Words in Pictures: Z.ai’s GLM-Image blends transformer and diffusion architectures for better text in images

Image generators often mangle text. An open-weights model outperforms open and proprietary competitors in text rendering.

A warm-toned room features a sofa, a decorated shelf, and sunlight filtering through patterned curtains.

Generative AI

Detailed Text- or Image-to-3D, Pronto: FlashWorld generates 3D objects, scenes, and surfaces with photorealistic fidelity

Current methods that produce 3D scenes from text or images are slow and produce inconsistent results. Researchers introduced a technique that generates detailed, coherent 3D scenes seconds.

ChatGPT interface on a phone displays a conversation and a sponsored grocery ad at the bottom of the screen.

Generative AI

ChatGPT Shows Ads: OpenAI tests advertisements for U.S. chatbot users in free and lower-cost tiers

AI has a new revenue stream, and it looks a lot like old web banner ads.

Juan M. Lavista Ferres is pictured holding a laptop while students watch a video about AI on a screen, linking education and technology.

Generative AI

Education That Works With — Not Against — AI by Juan M. Lavista Ferres: Juan M. Lavista Ferres, Chief Data Scientist at Microsoft, on assignments that properly test students’ abilities

A little more than three years ago, OpenAI released ChatGPT, and education changed forever. For students, the ability to generate fluent, credible text on demand in seconds is an incredible new tool.

Generative AI

Disney Teams Up With OpenAI: OpenAI’s Sora video generator will include Disney characters, with fan videos on Disney+

Disney, the entertainment conglomerate that owns Marvel, Pixar, Lucasfilm and its own animated classics from 101 Dalmatians to Zootopia, licensed OpenAI to use its characters in generated videos.

GIF showing a robotic arm picking up glasses on a table and handling tools on a kitchen countertop.

Generative AI

Coherent, Interactive Worlds: Runway’s GWM-1 models generate videos with consistent physics for robots and entertainment

Runway’s GWM-1 family of video-generation models respond to user input in real time while producing scenes that remain consistent regardless of the camera’s position.

GIF showing a 360° walkthrough of a conference room with a wooden table, high-back chairs, wall screens, and ceiling lights.

Generative AI

Generated, Editable Virtual Spaces: World Labs makes Marble world model public, adds Chisel editing tool

Models that generate 3D spaces typically generate them as users move through them without generating a persistent world to be explored later. A new model produces 3D worlds that can be exported and modified.

GIF showing AI object detection tagging penguins on a beach, cars in traffic, and dancing people.

Generative AI

Open 3D Generation Pipeline: Meta’s SAM 3 image segmentation models can analyze and create bodies and other objects

Meta’s Segment Anything Model (SAM) image-segmentation model has evolved into an open-weights suite for generating 3D objects. SAM 3 segments images, SAM 3D turns the segments into 3D objects, and SAM 3D Body produces 3D objects of any people among the segments. You can experiment with all three.

Hands strum a guitar covered in labels from major record companies, symbolizing AI music innovation.

Generative AI

Record Labels Back AI-Music Startup: Klay Image emerges from relative obscurity to announce deals with Sony, Warner, and Universal

A music-generation newcomer emerged from stealth mode with licenses to train generative AI models on music controlled by the world’s biggest recording companies.

Bar chart shows HunyuanImage 3.0's performance against Nano Banana and Seedream 4.0, highlighting differences.

Generative AI

Better Images Through Reasoning: HunyuanImage-3.0 uses reinforcement learning and thinking tokens to better understand prompts

A new image generator reasons over prompts to produce outstanding pictures.

Generative AI

AI Music With Major-Label Support: Universal Music Group and music generator Udio struck a deal to settle a lawsuit and build a new platform to remix copyrighted music

Music-generation service Udio will build an AI streaming platform in collaboration with the world’s biggest record label.

Icons for files, pictures, and shopping connect through nodes to a dollar sign, illustrating AI-driven profit pathways.

Generative AI

OpenAI, Meta Diversify AI Product Lines: OpenAI and Meta launch social video apps while ChatGPT adds Pulse and Instant Checkout

OpenAI and Meta, which have been content to offer standalone chatbots or tuck them into existing products, introduced dueling social video networks and other initiatives designed to boost revenue and engagement.

Robots with lighters attend a live concert, underlining AI's role in music creation and performance.

Generative AI

Generating Music, Paying Musicians: Sweden’s STIM built an ecosystem for training AI models on copyrighted music and compensating original artists

A Swedish organization that collects royalties on behalf of songwriters and record companies has formed a technology-legal-business ecosystem designed to allow AI developers to use music legally while compensating publishers of recordings and compositions.

Electron microscope image of bacteriophages with distinct hexagonal heads and tails on a gray background.

Generative AI

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Researchers used AI models to create novel viruses from scratch.

Three AI-generated video clips: a man vaulting over a moving car, a gymnast flipping on a plane wing, and a rabbit ice skating in pink boots.

Generative AI

Mixture of Video Experts: Alibaba’s Wan 2.2 video models adopt a new architecture to sort noisy from less-noisy inputs

The mixture-of-experts approach that has boosted the performance of large language models may do the same for video generation.

Generative AI

Refining Words in Pictures: Z.ai’s GLM-Image blends transformer and diffusion architectures for better text in images

Detailed Text- or Image-to-3D, Pronto: FlashWorld generates 3D objects, scenes, and surfaces with photorealistic fidelity

ChatGPT Shows Ads: OpenAI tests advertisements for U.S. chatbot users in free and lower-cost tiers

Education That Works With — Not Against — AI by Juan M. Lavista Ferres: Juan M. Lavista Ferres, Chief Data Scientist at Microsoft, on assignments that properly test students’ abilities

Disney Teams Up With OpenAI: OpenAI’s Sora video generator will include Disney characters, with fan videos on Disney+

Coherent, Interactive Worlds: Runway’s GWM-1 models generate videos with consistent physics for robots and entertainment

Generated, Editable Virtual Spaces: World Labs makes Marble world model public, adds Chisel editing tool

Open 3D Generation Pipeline: Meta’s SAM 3 image segmentation models can analyze and create bodies and other objects

Record Labels Back AI-Music Startup: Klay Image emerges from relative obscurity to announce deals with Sony, Warner, and Universal

Better Images Through Reasoning: HunyuanImage-3.0 uses reinforcement learning and thinking tokens to better understand prompts

AI Music With Major-Label Support: Universal Music Group and music generator Udio struck a deal to settle a lawsuit and build a new platform to remix copyrighted music

OpenAI, Meta Diversify AI Product Lines: OpenAI and Meta launch social video apps while ChatGPT adds Pulse and Instant Checkout

Generating Music, Paying Musicians: Sweden’s STIM built an ecosystem for training AI models on copyrighted music and compensating original artists

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Mixture of Video Experts: Alibaba’s Wan 2.2 video models adopt a new architecture to sort noisy from less-noisy inputs

Subscribe to The Batch