Vision Encoder/Decoder Model

Rethinking Encoder-Decoder Flow Through Shared Structures

Abstract: Dense prediction tasks have enjoyed a growing complexity of encoder architectures, decoders, however, have remained largely the same. They rely on individual blocks decoding intermediate ...

IEEE

SLADE: Shielding against Dual Exploits in Large Vision-Language Models

Abstract: Large Vision-Language Models (LVLMs) have emerged as transformative tools in multimodal tasks, seamlessly integrating pretrained vision encoders to align visual and textual modalities. Prior ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Rethinking Encoder-Decoder Flow Through Shared Structures

SLADE: Shielding against Dual Exploits in Large Vision-Language Models

Trending now