Nvidia has unveiled a new artificial intelligence model, Fugatto (short for Foundational Generative Audio Transformer Opus 1), capable of generating music and audio, including modifying voices and creating novel sounds. This technology is primarily aimed at producers of music, films, and video games.
Nvidia’s Fugatto AI Model
While Nvidia, the leading supplier of chips and software for AI systems, currently has no immediate plans for public release, Fugatto joins a growing field of AI tools for audio and video generation. Similar technologies are being developed by startups like Runway and larger companies like Meta Platforms, often utilizing text prompts to generate content.
Unique Features and Capabilities
Nvidia’s Fugatto distinguishes itself through its ability to modify existing audio. It can transform a piano melody into a vocal line or alter the accent and mood of a spoken word recording. This capability sets it apart from other AI technologies focused primarily on generating new content from scratch.
Potential Impact on the Entertainment Industry
Bryan Catanzaro, vice president of applied deep learning research at Nvidia, believes that generative AI like Fugatto will revolutionize how music and audio are created and experienced. He envisions new possibilities for music, video games, and everyday users alike.
Navigating the Intersection of AI and Entertainment
The emergence of AI in the entertainment industry has raised concerns and sparked debate. Companies like OpenAI are currently in negotiations with Hollywood studios to determine the acceptable use of AI in content creation. The relationship between technology and Hollywood has become strained, particularly after actress Scarlett Johansson’s lawsuit against OpenAI over the alleged imitation of her voice.
Add Comment