Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation | IEEE Conference Publication | IEEE Xplore