Abstract: In this paper, we focus on weakly supervised referring expression comprehension (REC), and identify that the lack of fine-grained visual capability greatly limits the upper performance bound ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results