June 17, 2024
2024
Our new paper is now on arXiv Localizing Events in Videos with Multimodal Queries!