Only recently did I discover the text-to-music AI companies (udio.com, suno.com) and I was surprised about how good the results are. Both are under lawsuit from RIAA.

I am curious if there are any local ones I can experiment with or train myself. I know there is facebook/musicgen-large on HuggingFace. That model is over 1 year old and there might be others by now. Also, based on the card I get the feeling that model is not going to be good at doing specific song lyrics (maybe the lyrics just were absent from the training data?). I am most interested in trying my hand at writing songs and fine-tuning a model on specific types of music to get the sounds I am looking for.

  • Mechanize@feddit.it
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    The only text-to-audio model I can think of at the moment is Stable Audio Open, which AFAIK is rather underwhelming for your use-case, if it can even handle stuff more complex than basic sounds - and no lyrics.
    It is even under the “new” membership licensing of SAI.

    I remember reading about a more recent one, but I currently can’t find it, and I don’t think that that one too could handle lyrics.

    I suppose the Music industry is a lot harder to fight, so not a lot of people want to entangle themself with it.