Exploiting the Complementarity of Audio and Visual Data in Multi-speaker Tracking | IEEE Conference Publication | IEEE Xplore