A Dataset and Exploration of Models for Understanding Video Data through Fill-in-the-Blank Question-Answering | IEEE Conference Publication | IEEE Xplore