YuanGongND

Follow

Yuan Gong YuanGongND

Follow

Research Scientist, MIT CSAIL

446 followers · 2 following

MIT
Cambridge, MA
15:38 (UTC -04:00)
yuangongnd.github.io

Achievements

Achievements

Pinned Loading

ltu ltu Public

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 472 41
whisper-at whisper-at Public

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 412 36
gopt gopt Public

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Python 201 38
cav-mae cav-mae Public

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 290 24
ssast ssast Public

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 419 67
ast ast Public

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.4k 245