Generative AI

147 Posts

GIF showing a 360° walkthrough of a conference room with a wooden table, high-back chairs, wall screens, and ceiling lights.

Generated, Editable Virtual Spaces: World Labs makes Marble world model public, adds Chisel editing tool

Models that generate 3D spaces typically generate them as users move through them without generating a persistent world to be explored later. A new model produces 3D worlds that can be exported and modified.

GIF showing AI object detection tagging penguins on a beach, cars in traffic, and dancing people.

Generative AI

Open 3D Generation Pipeline: Meta’s SAM 3 image segmentation models can analyze and create bodies and other objects

Meta’s Segment Anything Model (SAM) image-segmentation model has evolved into an open-weights suite for generating 3D objects. SAM 3 segments images, SAM 3D turns the segments into 3D objects, and SAM 3D Body produces 3D objects of any people among the segments. You can experiment with all three.

Hands strum a guitar covered in labels from major record companies, symbolizing AI music innovation.

Generative AI

Record Labels Back AI-Music Startup: Klay Image emerges from relative obscurity to announce deals with Sony, Warner, and Universal

A music-generation newcomer emerged from stealth mode with licenses to train generative AI models on music controlled by the world’s biggest recording companies.

Bar chart shows HunyuanImage 3.0's performance against Nano Banana and Seedream 4.0, highlighting differences.

Generative AI

Better Images Through Reasoning: HunyuanImage-3.0 uses reinforcement learning and thinking tokens to better understand prompts

A new image generator reasons over prompts to produce outstanding pictures.

Generative AI

AI Music With Major-Label Support: Universal Music Group and music generator Udio struck a deal to settle a lawsuit and build a new platform to remix copyrighted music

Music-generation service Udio will build an AI streaming platform in collaboration with the world’s biggest record label.

Icons for files, pictures, and shopping connect through nodes to a dollar sign, illustrating AI-driven profit pathways.

Generative AI

OpenAI, Meta Diversify AI Product Lines: OpenAI and Meta launch social video apps while ChatGPT adds Pulse and Instant Checkout

OpenAI and Meta, which have been content to offer standalone chatbots or tuck them into existing products, introduced dueling social video networks and other initiatives designed to boost revenue and engagement.

Robots with lighters attend a live concert, underlining AI's role in music creation and performance.

Generative AI

Generating Music, Paying Musicians: Sweden’s STIM built an ecosystem for training AI models on copyrighted music and compensating original artists

A Swedish organization that collects royalties on behalf of songwriters and record companies has formed a technology-legal-business ecosystem designed to allow AI developers to use music legally while compensating publishers of recordings and compositions.

Electron microscope image of bacteriophages with distinct hexagonal heads and tails on a gray background.

Generative AI

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Researchers used AI models to create novel viruses from scratch.

Three AI-generated video clips: a man vaulting over a moving car, a gymnast flipping on a plane wing, and a rabbit ice skating in pink boots.

Generative AI

Mixture of Video Experts: Alibaba’s Wan 2.2 video models adopt a new architecture to sort noisy from less-noisy inputs

The mixture-of-experts approach that has boosted the performance of large language models may do the same for video generation.

Man in suit holding AI book in destroyed office, from viral AI-generated video ad by The Dor Brothers.

Generative AI

AI Video Goes Mainstream: Meta, Google, and other giants slice up text-to-video

Generated video clips are capturing eyeballs in viral videos, ad campaigns, and a Netflix show.

Apple AI models outperform rivals in instruction accuracy and human text evaluations across devices and servers.

Generative AI

Apple Sharpens Its GenAI Profile: Apple updates its on-device and cloud AI models, introduces a new developer API

Apple revamped two vision-language models in a bid to catch up with fast-moving competitors.

Midjourney AI outputs mimic Disney characters, raising copyright concerns in lawsuit by Disney and Universal.

Generative AI

Hollywood Joins AI Copyright Fight: Disney and Universal sue Midjourney, alleging the image generator violates their intellectual property rights

Hollywood studios joined the record companies, publishers, and artists in the fight against companies that have trained AI models on their copyrighted works.

The FLUX.1 Kontext family of image generators from Black Forest Labs edits images to remove or add objects, apply art styles, and extract details.

Generative AI

More Consistent Characters and Styles: Black Forest Labs Launches FLUX.1 Kontext for Generating and Alterating Images with Consistent Details

Same character, new background, new action. That’s the focus of the latest text-to-image models from Germany’s Black Forest Labs.

Duolingo owl mascots dressed in cultural costumes, representing global languages and cultures.

Generative AI

Machine Translation in Action: Duolingo turns to AI translation to expand its most popular courses to all 28 user languages

AI is bringing a massive boost in productivity to Duolingo, maker of the most popular app for learning languages.

AI music generation interface showing waveform and text prompts like deep house, djembe, and saxophone.

Generative AI

Music Generation for Pros: Google upgrades its AI music tools for professional use

Google refreshed its experimental tools for composers and producers.

Generative AI

Generated, Editable Virtual Spaces: World Labs makes Marble world model public, adds Chisel editing tool

Open 3D Generation Pipeline: Meta’s SAM 3 image segmentation models can analyze and create bodies and other objects

Record Labels Back AI-Music Startup: Klay Image emerges from relative obscurity to announce deals with Sony, Warner, and Universal

Better Images Through Reasoning: HunyuanImage-3.0 uses reinforcement learning and thinking tokens to better understand prompts

AI Music With Major-Label Support: Universal Music Group and music generator Udio struck a deal to settle a lawsuit and build a new platform to remix copyrighted music

OpenAI, Meta Diversify AI Product Lines: OpenAI and Meta launch social video apps while ChatGPT adds Pulse and Instant Checkout

Generating Music, Paying Musicians: Sweden’s STIM built an ecosystem for training AI models on copyrighted music and compensating original artists

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Mixture of Video Experts: Alibaba’s Wan 2.2 video models adopt a new architecture to sort noisy from less-noisy inputs

AI Video Goes Mainstream: Meta, Google, and other giants slice up text-to-video

Apple Sharpens Its GenAI Profile: Apple updates its on-device and cloud AI models, introduces a new developer API

Hollywood Joins AI Copyright Fight: Disney and Universal sue Midjourney, alleging the image generator violates their intellectual property rights

More Consistent Characters and Styles: Black Forest Labs Launches FLUX.1 Kontext for Generating and Alterating Images with Consistent Details

Machine Translation in Action: Duolingo turns to AI translation to expand its most popular courses to all 28 user languages

Music Generation for Pros: Google upgrades its AI music tools for professional use

Subscribe to The Batch