Addition of Large Number in C Programing Visual Studio

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

Abstract: Recently, audio-visual speech recognition has attracted increasing attention. However, most existing works only focused on scenarios with two speakers. In this work, we study the effect of ...

GitHub

This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models".

We find a commonality of various dirty samples is visual-linguistic inconsistency between images and associated labels. To capture the semantic inconsistency between modalities, we propose versatile ...

IEEE

Hybrid-NeuroSLAM: A Neurobiologically Inspired Hybrid Visual-Inertial SLAM Method for Large Scale Environment

Abstract: Animals in nature exhibit remarkable spatial cognition abilities, enabling them to achieve long-distance autonomous navigation efficiently in unknown environments. Neurobiologically inspired ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models".

Hybrid-NeuroSLAM: A Neurobiologically Inspired Hybrid Visual-Inertial SLAM Method for Large Scale Environment

Trending now