Skip to content
View YuanGongND's full-sized avatar

Block or report YuanGongND

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ltu ltu Public

    Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

    Python 472 41

  2. whisper-at whisper-at Public

    Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

    Python 412 36

  3. gopt gopt Public

    Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

    Python 201 38

  4. cav-mae cav-mae Public

    Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

    Python 290 24

  5. ssast ssast Public

    Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

    Python 419 67

  6. ast ast Public

    Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

    Jupyter Notebook 1.4k 245