"Emanuele Bugliarello,Ryan Cotterell,Naoaki Okazaki,Desmond Elliott","Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs",,"Transactions of the Association for Computational Linguistics",,"Vol. 9",,"pp. 978-994",2021,Sept.