By using this site, you agree privacy policies
Accept
Geek RoomGeek RoomGeek Room
  • Home
  • Tech
    TechShow More
    Split Technology Park welcomes first tenants: 26 MPSs and 6 startups
    October 31, 2024
    INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans
    October 31, 2024
    Diaspora 4 Innovation: Kick-off event launches a new era for Albanian higher education
    October 31, 2024
    AI for good: Generative AI – Tirana chapter empowers Albanian Youth in tech innovation
    October 29, 2024
    Business Angel Summit 2024: Pioneering Investment and Startup Growth in Sarajevo
    October 29, 2024
  • Mobile
    MobileShow More
    Xiaomi 15 and 15 Pro set to launch on October 29: Official renders released
    October 24, 2024
    Dangerous virus infects millions of mobile phones through popular apps
    October 3, 2024
    The new iPhone 16 arrives in Croatia with a steep price tag
    September 26, 2024
    Beware of these phone numbers: Block them immediately to avoid scams
    September 11, 2024
    Beyond the brand: What really matters when buying a mobile phone
    September 5, 2024
  • Apps
    AppsShow More
    Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content
    October 31, 2024
    Intel prevails in long-running legal battle against €1 billion EU fine
    October 31, 2024
    New definition of open source artificial intelligence released by OSI
    October 29, 2024
    CaSys introduces “Pay by Link” payment service for SMEs in Macedonia
    October 24, 2024
    Kickstarter surpasses $8 billion in donations across all projects
    October 17, 2024
  • Science
    ScienceShow More
    Sofia Tech Park: A thriving innovation hub for Southeast Europe
    October 29, 2024
    Breakthrough in prostate cancer treatment: Croatian scientists develop Vini, a tool to predict effective drug combinations
    October 24, 2024
    Digital Realty partners with Ecolab to pilot AI-powered water conservation solution
    October 24, 2024
    Sofia Tech Park to host the Southeast European Innovators Challenge Conference
    October 11, 2024
    ACG accelerates European growth with major expansion in Croatia
    October 9, 2024
  • Gaming
    GamingShow More
    “Windblown” – The new game from the creators of Dead Cells
    October 24, 2024
    Kraken Empire’s Journey and the creative brilliance of Toy Tactics
    October 21, 2024
    Serbian game studio Tricoman set to make a mark with their new RPG ‘Godforged’ on Steam
    October 16, 2024
    Release the demon with Kill Knight: A phenomenal combat experience with untapped potential
    October 14, 2024
    Nordeus launches new football game “Top Goal: Football Champion” in Serbia
    October 9, 2024
  • Cars
    CarsShow More
    Serbia signs strategic agreement with Hyundai Engineering for 1 GW of Solar Power
    October 16, 2024
    Stara Zagora: Poised to lead Bulgaria’s automotive revolution
    October 15, 2024
    Dacia unveils new Bigster: The flagship model for the C-SUV segment
    October 9, 2024
    Kineton Albania: Pioneering innovation in the automotive industry
    October 8, 2024
    Albania’s vehicle numbers surge in 2024: 73% of registered cars are over 15 years old
    August 20, 2024
  • Entertainment
    EntertainmentShow More
    Where are Generation Z’s famous tech entrepreneurs?
    October 29, 2024
    AllWeb offers special discounts for startups: A unique opportunity for networking and growth
    October 23, 2024
    Montenegro census reveals no ethnic majority, Montenegrins and Serbs nearly equal
    October 16, 2024
    “Primordial Passion” is the first luxury Albanian watch valued at €1.4 million by Argjendari Pirro
    October 15, 2024
    Albania takes the stage at BIG event Paris: Culture and innovation as economic drivers
    October 12, 2024
Search
Reading: DeepMind’s new AI generates soundtracks and dialogue for videos
Notification Show More
Aa
Geek RoomGeek Room
Aa
  • Tech
  • Mobile
  • Apps
  • Science
  • Gaming
  • Cars
  • Entertainment
Search
  • Home
  • Tech
  • Mobile
  • Apps
  • Science
  • Gaming
  • Cars
  • Entertainment
Geek Room > Blog > Tech > DeepMind’s new AI generates soundtracks and dialogue for videos
Tech

DeepMind’s new AI generates soundtracks and dialogue for videos

Last updated: 2024/06/19 at 5:43 PM
Share
4 Min Read

DeepMind, Google’s AI research lab, is pioneering a new AI technology designed to create soundtracks for videos. This innovation, named V2A (video-to-audio), is being positioned as a crucial component in the evolving landscape of AI-generated media. Despite significant advancements in video-generating AI models, including those developed by DeepMind, the challenge remains that these models cannot produce synchronized sound effects.

“Video generation models are advancing at an incredible pace, but many current systems can only generate silent output,” DeepMind stated in an official blog post. “V2A technology [could] become a promising approach for bringing generated movies to life.”

DeepMind’s V2A technology operates by using a description of a soundtrack (for example, “jellyfish pulsating under water, marine life, ocean”) paired with a video to produce music, sound effects, and dialogue that match the video’s characters and tone. This is enhanced with DeepMind’s deepfakes-combating SynthID technology. The AI model driving V2A, a diffusion model, was trained on a diverse set of sounds, dialogue transcripts, and video clips.

“By training on video, audio, and additional annotations, our technology learns to associate specific audio events with various visual scenes, while responding to the information provided in the annotations or transcripts,” explained DeepMind. DeepMind has not disclosed whether any of the training data was copyrighted or if the data creators were informed about the use of their content. Clarification has been sought from DeepMind on this matter.

AI-powered sound-generating tools are not new. Recent examples include Stability AI‘s release and ElevenLabs‘ launch in May. Similarly, models that generate video sound effects have been developed by Microsoft, which can produce talking and singing videos from still images, and platforms like Pika and GenreX that generate music or effects for videos.

However, DeepMind claims that V2A technology is unique because it can understand the raw pixels of a video and automatically sync generated sounds with the video, even without a description. Despite its potential, V2A technology has its limitations. The model was not extensively trained on videos with artifacts or distortions, resulting in lower-quality audio for such videos. Generally, the generated audio lacks a natural sound quality, with critiques describing it as a “smorgasbord of stereotypical sounds.”

To prevent misuse, DeepMind has decided against releasing the technology to the public in the near future. “To ensure our V2A technology has a positive impact on the creative community, we’re gathering diverse perspectives and insights from leading creators and filmmakers and using this feedback to guide our ongoing research and development,” DeepMind noted. Before considering broader public access, the technology will undergo rigorous safety assessments and testing.

DeepMind envisions V2A technology as particularly beneficial for archivists and individuals working with historical footage. However, the broader implications of generative AI in the film and TV industry pose significant challenges. Ensuring that such tools do not eliminate jobs or entire professions will require strong labour protections and careful consideration of the technology’s impact.

In summary, while DeepMind’s V2A technology holds promise for enhancing AI-generated media, its development and deployment must be handled with caution to avoid unintended negative consequences.

You Might Also Like

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups

INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans

Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content

Intel prevails in long-running legal battle against €1 billion EU fine

Diaspora 4 Innovation: Kick-off event launches a new era for Albanian higher education

Share This Article
Facebook Whatsapp Whatsapp Copy Link
Previous Article Surgeon General urges congress for social media warning labels amid mental health concerns
Next Article U.S. lags behind China in high-tech nuclear power development

Social networks

Instagram Follow

Latest news

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups
Tech October 31, 2024
INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans
Tech October 31, 2024
Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content
Apps October 31, 2024
Intel prevails in long-running legal battle against €1 billion EU fine
Apps October 31, 2024

Related articles

Tech

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups

October 31, 2024
Tech

INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans

October 31, 2024
Apps

Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content

October 31, 2024
Apps

Intel prevails in long-running legal battle against €1 billion EU fine

October 31, 2024

About us

Geek Room is dedicated to technology and its enthusiasts through real-time information and videos about the latest innovations. Connect with our staff via email at: [email protected]
For cooperation opportunities, write to us at: [email protected]

Find us:

© 2023 Geekroom All Rights Reserved. Developed by MIMS
adbanner
AdBlock Detected
Our site is an advertising supported site. Please whitelist to support our site.
Okay, I'll Whitelist
Welcome Back!

Sign in to your account

Lost your password?