POLO: Learning Explicit Cross-Modality Fusion for Temporal Action Localization | IEEE Journals & Magazine | IEEE Xplore