424
submitted 11 months ago by L4s@lemmy.world to c/technology@lemmy.world

Google is embedding inaudible watermarks right into its AI generated music::Audio created using Google DeepMind’s AI Lyria model will be watermarked with SynthID to let people identify its AI-generated origins after the fact.

you are viewing a single comment's thread
view the rest of the comments
[-] autotldr@lemmings.world 8 points 11 months ago

This is the best summary I could come up with:


Audio created using Google DeepMind’s AI Lyria model, such as tracks made with YouTube’s new audio generation features, will be watermarked with SynthID to let people identify their AI-generated origins after the fact.

In a blog post, DeepMind said the watermark shouldn’t be detectable by the human ear and “doesn’t compromise the listening experience,” and added that it should still be detectable even if an audio track is compressed, sped up or down, or has extra noise added.

President Joe Biden’s executive order on artificial intelligence, for example, calls for a new set of government-led standards for watermarking AI-generated content.

According to DeepMind, SynthID’s audio implementation works by “converting the audio wave into a two-dimensional visualization that shows how the spectrum of frequencies in a sound evolves over time.” It claims the approach is “unlike anything that exists today.”

The news that Google is embedding the watermarking feature into AI-generated audio comes just a few short months after the company released SynthID in beta for images created by Imagen on Google Cloud’s Vertex AI.

The watermark is resistant to editing like cropping or resizing, although DeepMind cautioned that it’s not foolproof against “extreme image manipulations.”


The original article contains 230 words, the summary contains 195 words. Saved 15%. I'm a bot and I'm open source!

[-] SuckMyWang@lemmy.world 10 points 11 months ago* (last edited 11 months ago)

it does this by converting the audio into a 2d visualisation that shows how the spectrum of frequencies evolves in a sound over time

Old school windows media player has entered the chat

Seriously fuck off with this jargon, it doesn’t explain anything

[-] Terminarchs@slrpnk.net 22 points 11 months ago

That's actually an accurate description of what is happening: an audio file turned into a 2d image with the x axis being time, the y axis being frequency and color being amplitude.

[-] RufusLoacker@feddit.it 9 points 11 months ago

That's literally a spectrograph

[-] Terminarchs@slrpnk.net 8 points 11 months ago
[-] Viking_Hippie@lemmy.world 6 points 11 months ago

Your mom's literally a spectrograph.

[-] SuckMyWang@lemmy.world 1 points 11 months ago

I know, it’s like the old windows media player visualisations.

load more comments (7 replies)
load more comments (7 replies)
this post was submitted on 17 Nov 2023
424 points (99.5% liked)

Technology

59081 readers
3211 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS