Nvidia’s new AI audio model can synthesize sounds that have never existed

November 25, 2024

At this point, anyone who has been following AI research is long familiar with generative models that can synthesize speech or melodic music from nothing but text prompting. Nvidia’s newly revealed “Fugatto” model looks to go a step further, using new synthetic training methods and inference-level combination techniques to “transform any mix of music, voices, and sounds,” including the synthesis of sounds that have never existed.

While Fugatto isn’t available for public testing yet, a sample-filled website showcases how Fugatto can be used to dial a number of distinct audio traits and descriptions up or down, resulting in everything from the sound of saxophones barking to people speaking underwater

→ Continue reading at Ars Technica

Comments

Supreme Court wants US input on whether ISPs should be liable for users’ piracy

Indigenous Pop-Up Javelina Is Gearing Up For a Winter Residency and a Full-Blown Restaurant

Nvidia’s new AI audio model can synthesize sounds that have never existed

Related articles

Comments

Share article

Latest articles

Keller Auditorium, Arlene Schnitzer Concert Hall, more face 19% budget cut amid rising costs

Trader Joe’s mini tote bags are back — and in new colors

B.C. immigration lawyer ‘very busy’ fielding calls for advice about travelling to U.S.

Hikers Now Smashing Their Own Car Windows to Embrace That Authentic Northwest Outdoor Vibe

Green candidate kicks off campaign in qathet region

Trump administration’s attack on university research accelerates