Skip to Main Content

Audio AIGC Model

Currently, there are no good basic models available in the audio field.

Basic ModelRelease DatePublisherNote
dance-diffusion2022.09harmonai
audio-diffusion2022.08teticio
riffusion2022.12Seth ForsgrenGenerates a spectrum using a diffusion model and converts it to music
audioldm2023.01haoheliu
bark2023.04suno.ai

Currently available open-source models have a certain distance from application. If you want to experience better effects, you can take a look at mubert.

In addition to models for generating music or sound from scratch, another technology that is currently approaching the application threshold is voice conversion.

Voice conversion is a technology that can modify the speech of the source speaker to sound like the voice of another target speaker.

Currently, a well-known product is so-vits-svc for Chinese.