SALMUBench: A Benchmark for Sensitive Association-Level Multimodal Unlearning
arxiv
Lingua-SafetyBench: A Benchmark for Safety Evaluation of Multilingual Vision-Language Models
arxiv
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
arxiv
HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network
arxiv
StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues
arxiv