
Beyond Modality Collapse: Representations Blending for Multimodal...
Multimodal Dataset Distillation (MDD) seeks to condense large-scale image-text datasets into compact surrogates while retaining their effectiveness for cross-modal learning. Despite recent...




