I agree with your overall description of creation. But I do not agree that generative models are something else entirely. They are tools, and while their affordances do influence what people do with it, in the end the responsibility is on the creator. You can make "soulless shit" or "thoughtful commentary" or anything else you put your mind to, by using these tools in combination with all the existing ones.
Models that are oriented around one-shot, text-only direction are pretty limiting in creative flow. This will hopefully continue to improve.
To make what I consider a halfway decent song with these current easiest-to-use services (like Suno and Udiio) takes a few hours in my experience.
To get there one has to work with the text, the song structure, find a decent style, and then do corrections on sections where the models goes off track.
To make something that is closer to "good", I would go and re-record all the lead vocals myself, and then mix this in a DAW.
Models that are oriented around one-shot, text-only direction are pretty limiting in creative flow. This will hopefully continue to improve.