Rotary Encoder Code - Search News

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

Abstract: Visual encoders are fundamental components in vision-language models (VLMs), each showcasing unique strengths derived from various pre-trained visual foundation models. To leverage the ...

GitHub

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

We propose an encoder-decoder for open-vocabulary semantic segmentation comprising a hierarchical encoder-based cost map generation and a gradual fusion decoder. We introduce a category early ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

Trending now