Align Text in One Note

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Abstract: Text-based Visual Question Answering (TextVQA) focuses on answering questions about the scene text in images. Most works in this field uses transformer based models to modeling the ...

IEEE

AMVLM: Alignment-Multiplicity Aware Vision-Language Model for Semi-Supervised Medical Image Segmentation

Abstract: Low-quality pseudo labels pose a significant obstacle in semi-supervised medical image segmentation (SSMIS), impeding consistency learning on unlabeled data. Leveraging vision-language model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

AMVLM: Alignment-Multiplicity Aware Vision-Language Model for Semi-Supervised Medical Image Segmentation

Trending now