As I know , new model like AudioLDM 1 and ALDM 2 can not generate long music. The youtube link
https://www.youtube.com/watch?v=1wAdQhFJy54 has 4 minutes and this music is high quality.
I recommand the InspireMusic model https://github.com/FunAudioLLM/InspireMusic that is announced in 2025. This model can generate long music, support complicated prompt text, generate high quality music.