MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation | IEEE Conference Publication | IEEE Xplore