ImLoc: Revisiting Visual Localization with Image-based Representation
arxiv
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
arxiv
WS-IMUBench: Can Weakly Supervised Methods from Audio, Image, and Video Be Adapted for IMU-based Temporal Action Localization?
arxiv
Panoramic Affordance Prediction
arxiv
MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis
arxiv