A friend was a full time jazz musician for years (he’s not famous but played with many/most of the big names in jazz) and said it < https://google-research.github.io/seanet/musiclm/examples/> was really interesting to him, musically. Sort of like the ideal for a jazz band, because you’ve got all independent instruments but coming together from a single “mind”.
It seems like it's trained on entire soundwaves. I'm curious if you'd get a better result by training it on transcribed MIDI and then taking the output MIDI and plugging it into VST's.
Seems like you would still get that "central brain" compositional approach without the garbled sound quality and unidentifiable instrument noises.