Align Text in One Note

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Abstract: Text-based Visual Question Answering (TextVQA) focuses on answering questions about the scene text in images. Most works in this field uses transformer based models to modeling the ...

IEEE

FAME: Fusion of Alignment and Multiview Enhancement for Remote Sensing Image–Text Retrieval

Abstract: Contemporary advancements in Earth observation technologies have generated substantial data resources for remote sensing image retrieval applications. However, existing models exhibit ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

FAME: Fusion of Alignment and Multiview Enhancement for Remote Sensing Image–Text Retrieval

Trending now