Abstract: Integrating information from vision and language modalities has sparked interesting applications in the fields of computer vision and natural language processing. Existing methods, though ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results