While almost everything sounded really good and well put together, the voice its...

lemoncookiechip · on April 11, 2024

That static is present in most generations, but how bad it is depends on the generation itself (you get 2 variations of the same song every generation). I've had some amazingly crisp sounding generations in vastly different genres and languages, such as Opera tenor, UK rap, reggae, metal, country, Broadway musical, rock n' roll, Japanese, Italian, Swedish, various English accents/dialects, etc... Suno is a technical masterpiece, I understand why some people dislike the idea, but the point stands that we are HERE now and we started with most people not even imagining it possible, and those who did saying it wouldn't be this good.

Like many people have said, this tech will only get better.

orthoxerox · on April 11, 2024

Yes, it had this GlaDOS-like timbre.

stanac · on April 11, 2024

My thoughts exactly, like I just finished portal.

jeffhuys · on April 11, 2024

It’s called a vocoder. It allows for (for instance) a monotone-sung piece of text to follow a set of midi notes, by modulating it using a carrier wave (I think. Please correct any inaccuracies!).

I use it sometimes in FL Studio when creating electronic music (plugin is called Vocodex).

Presumably they take the AI-generated voice and generate midi notes, and apply a vocoder to the voice, following the notes.

guitarlimeo · on April 11, 2024

I think that's just an artifact, as they can also produce heavy metal scream singing etc. It just mimics something that was in the training data.

My guess is that they train the vocals and the music separately, the training data is trivial to create from any tracks with tools like with https://vocalremover.org/.

makeitshine · on April 11, 2024

You mean it sounds autotuned?

viraptor · on April 11, 2024

Not quite. I'm not skilled in mixing enough to know the right description for it, sorry. I can hear vibrato-like modulation/beating, but in the vocal part only.

tokai · on April 11, 2024

Yeah, surprised at the amount of comments here about how good it sounds. The voice is full of artifacts, making it quite uncomfortable.

viraptor · on April 11, 2024

It's got elements which are great and elements which fail hard. I can complain about one bit specifically but still recognise the massive improvements in other areas over what we've seen so far.