AI Sound Generation Capable of Creating Unique Sounds by Nvidia

by Inside Telecom Staff - November 26, 2024
Reading time: 2 min

Post Views: 185

In November, Nvidia introduced Fugatto, an AI sound generation music editor tool that can create entirely novel sounds, music, and speech from text and audio inputs.

Nvidia also pointed out that the tool can also alter human voices by distorting accents or tone and changing out instruments in a melody-a piano for an opera singer, for example.

Capable of producing new AI generated sounds effects and morphing audio flawlessly into one another, Fugatto may become the go-to piece of software for the entertainment, gaming, and advertising industries. Inasmuch as this technology gets better, it also raises some pertinent questions about intellectual property and the ethical limits, ushering in an era whereby AI reimagines the frontiers of creativity.

Take, for example, Fugatto creating scores of unusual sounds like a “saxophone howling, barking then electronic music with dogs barking” or generate effects from prompts like “deep, rumbling bass pulses paired with intermittent, high-pitched digital chirps.”

Innovative Features in Competition

While AI sound effect generator tools from Adobe, OpenAI, and Google DeepMind exist, Fugatto distinguishes itself by creating entirely unique sounds. Nvidia’s announcement on November 25, included a paper detailing the extensive datasets used to train the model, comprising millions of audio samples, including resources like the BBC’s sound effects library, according to The Verge.

The chip giant’s researchers created guidelines that allowed for AI sound generation such as Fugatto to substantially broaden its scope of work without needing new data, with far greater accuracy and the ability to create novel works.

It’s not clear if or when Nvidia will release the tool publicly.

The Controversy Over AI Music

Debates have emerged in the music industry with the rise of tools that generate AI sounds, technology opens creative opportunities, but it calls into question originality and copyright. Major record labels have filed lawsuits against AI companies like Udio and Suno, alleging unauthorized use of copyrighted material for training their model.

Investigations revealed that companies like Nvidia, Apple, and Google’s partner, Anthropic, have used subtitled data from YouTube videos to train their AI systems – a practice that increasingly comes under scrutiny.

Fugatto stands out with capabilities like isolating vocals from songs and creating never-before-heard soundscapes – highlighting on rapid evolution in AI generates sounds for creative industries, a wide diffusion of accessibility, or an apparatus for research. Fugatto moves one step further in the relationship between technology and artistry.

Blending Voice AI

Voice AI is an advancement from Voice Recognition, and it goes far beyond the integration of natural language processing, machine learning, and speech recognition to understand context, emotions, and accents.

Advances in AI sound generation drive innovations in virtual assistants, accessibility tools, and automated customer service for smoother interactions and greater inclusivity. However, challenges remain in accent recognition and privacy protection.

Fugatto focuses on creativity, while Voice AI emphasizes functionality, enabling natural human-machines interaction. Both leverage AI sounds generator to redefine their fields but simultaneously raise ethical questions about data use and originality. As AI generates sounds it continues to evolve, this convergence of creative and functional tools could lead to groundbreaking applications across industries.

Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Tech sections to stay informed and up-to-date with our daily articles.

Group-IB Launches Next-Gen Fraud Matrix to Transform Fraud Detection and Response

Did Comium Set a Trap for Africell and Qcell or Did They Trap Themselves?

Kia PV5 Tech Day:technology for limitless mobility

Monty Holding Launches Its FinTech Academy with USJ as Its First Partner

Mastercard’s President Adam Jones: MyMonty's Partnership Will Unlock Lebanon’s Digital Finance Potential

Wi-Fi 8 Taking Connectivity to New Levels Starting 2028

Meta’s Under Sea Internet Cables Will Keep Us Connected

Is Ericsson’s 5G Uplink Speed Worth the Cybersecurity Risk?

Starlink’s Direct-to-Cell Service Goes Beyond Consumer Use

China Telecom Industry Open to Foreign Investors

OpenAI Teaching ChatGPT to Feel in a World It Can’t Understand

AI Schism of Hinton’s Doomsday Warnings and Zuck’s Superintelligence Utopia

Bid Farwell to Camera Bumps with Samsung’s Paper-Thin Smartphone Metalens

Google’s New AI Acts Like a Virtual Satellite

AI Disrupts Job Market for Young Professionals, Goldman Sachs

MyMonty: The New Era of Banking

Entering the Monty Multiverse at Seamless 2023

Seamless Dubai 2023 - From Concept to Reality: Shaffra Technologies Opens Doors to Metaverse Mastery

Take A Look in the Mirror. The Greatest Technology of All Will Stare Back at You

Monty Mobile Enters Multibillion-Dollar MNO Equipment Industry

Are We Addicted to Social Media? IG, TikTok Trigger Physical and Emotional Withdrawal

Meta's AI on Instagram, Facebook Helps Save Lives

US DoT’s New Safety Plan Introduces Car Communication

Little Girl Receives First Prosthetic Eye from MRI, CT Scans

DeepL’s AI Translation Software to Get Traditional Chinese

Nvidia’s Audio AI-Generator Tool Will Creating Unique Sounds

Innovative Features in Competition

The Controversy Over AI Music

Blending Voice AI