【Philippines Archives】
Everyone knows sound is Philippines Archivesa critical component to most films and videos. After all, even when films were silent, there was still a musical accompanist letting the audience know how to feel.
This natural law remains the same for the new crop of generative AI videos, which emerge eerily silent. That's part of why Google has been working on "video-to-audio" technology (V2A) which "makes synchronized audiovisual generation possible." On Monday, Google's AI lab, DeepMind, shared progress on generating such audio including soundtracks and dialogue that automatically match up with AI-generated videos.
Google has been hard at work developing multimodal generative AI technology to compete with rivals. OpenAI has its AI video generator Sora (yet to be publicly released) and GPT-4o, which creates AI voice responses. Companies like Meta and Suno have been exploring AI-generated audio and music, but pairing audio with video is relatively new. ElevenLabs has a similar tool that matches audio to text prompts, but DeepMind says V2A is different because it doesn't require text prompts.
You May Also Like
SEE ALSO: Luma AI Dream Machine: What it is, how to try it
V2A can be paired with AI video tools like Google Veo or existing archival footage and silent films. This can be used for soundtracks, sound effects, and even dialogue. It works by using a diffusion model trained with visual inputs, natural language prompts, and video annotations to gradually refine random noise into audio that fits the tone and context of videos.
Google DeepMind says V2A can "understand raw pixels" therefore you don't actually need a text prompt to generate the audio, but it does help with the accuracy. The model can also be prompted to make the tone of the audio sound positive or negative. Along with the announcement, DeepMind released some demo videos, including a video of a dark, creepy hallway accompanied by horror music, a lone cowboy at sunset scored to a mellow harmonica tune, and an animated figure talking about its dinner.
Related Stories
- Google apologises after Gemini AI generates images of Nazis as people of colour
- OpenAI, Google DeepMind insiders have serious warnings about AI
- Sora-created short films to screen at Tribeca Film Festival
V2A will include Google's SynthID watermarking as a safeguarding measure against misuse, and Deepmind's blog post says the feature is currently undergoing testing before it's released to the public.
Topics Artificial Intelligence Google
Search
Categories
Latest Posts
Your Faceprint Tomorrow
2025-06-26 02:34Best Black Friday streaming add
2025-06-26 01:24Early Black Friday drone deals [2024]
2025-06-26 00:58What Comes After Resistance?
2025-06-26 00:27Popular Posts
Memory Keepers
2025-06-26 02:29NYT Strands hints, answers for November 28
2025-06-26 02:22Three AI products that flopped in 2024
2025-06-26 02:06Early Black Friday drone deals [2024]
2025-06-26 01:17The Madness of King Musk
2025-06-26 00:26Featured Posts
American Mirage
2025-06-26 01:50Best Black Friday Hulu deal: $0.99 per month for 1 year
2025-06-26 01:48Shop the best Black Friday deals on tablets
2025-06-26 01:17Wordle today: The answer and hints for November 29
2025-06-26 00:44Carbon Omissions
2025-06-26 00:03Popular Articles
Yesterday’s Liberal
2025-06-26 02:03Best Black Friday TV deal: Save $600 on the Samsung Frame TV
2025-06-26 01:19Best Black Friday iPad deal: Save $90 on Apple iPad (10th Gen)
2025-06-26 01:01Underwriters of the World, Ideate!
2025-06-25 23:52Newsletter
Subscribe to our newsletter for the latest updates.
Comments (74988)
Follow Information Network
It’s Fun to Be in the DSA!
2025-06-26 02:26Co-creation Information Network
Early Black Friday Apple Watch deals: Series 10, 9, and SE down to record lows
2025-06-26 01:31Exploration Information Network
Early Black Friday Chromebook deals: Save on Asus, Lenovo, and more
2025-06-26 01:21Discovery Information Network
OpenAI Sora leak: What it was and what it wasn’t.
2025-06-26 01:17Information Information Network
Far Cry 5 Benchmarked: 50 GPUs Tested
2025-06-25 23:53