Nvidia introduces Fugatto, an AI model for audio generation and modification, with potential applications in music, film, and gaming.

Nvidia has unveiled a new artificial intelligence model, Fugatto (short for Foundational Generative Audio Transformer Opus 1), capable of generating music and audio, including modifying voices and creating novel sounds. This technology is primarily aimed at producers of music, films, and video games.

Nvidia’s Fugatto AI Model

While Nvidia, the leading supplier of chips and software for AI systems, currently has no immediate plans for public release, Fugatto joins a growing field of AI tools for audio and video generation. Similar technologies are being developed by startups like Runway and larger companies like Meta Platforms, often utilizing text prompts to generate content.

Unique Features and Capabilities

Nvidia’s Fugatto distinguishes itself through its ability to modify existing audio. It can transform a piano melody into a vocal line or alter the accent and mood of a spoken word recording. This capability sets it apart from other AI technologies focused primarily on generating new content from scratch.

Potential Impact on the Entertainment Industry

Bryan Catanzaro, vice president of applied deep learning research at Nvidia, believes that generative AI like Fugatto will revolutionize how music and audio are created and experienced. He envisions new possibilities for music, video games, and everyday users alike.

Navigating the Intersection of AI and Entertainment

The emergence of AI in the entertainment industry has raised concerns and sparked debate. Companies like OpenAI are currently in negotiations with Hollywood studios to determine the acceptable use of AI in content creation. The relationship between technology and Hollywood has become strained, particularly after actress Scarlett Johansson’s lawsuit against OpenAI over the alleged imitation of her voice.

About the author

View All Posts

Mahak Aggarwal

With a BA in Mass Communication from Symbiosis, Pune, and 5 years of experience, Mahak brings compelling tech stories to life. Her engaging style has won her the 'Rising Star in Tech Journalism' award at a recent media conclave. Her in-depth research and engaging writing style make her pieces both informative and captivating, providing readers with valuable insights.