Skip to content

y-otn/mri-to-speech

Repository files navigation

mri-to-speech

This repository provides code for performing speech synthesis and F0 estimation from MRI videos.

Visit our demo website to listen to audio samples. Multi-speaker samples are also available here.

Requirements

  1. NVIDIA GPU with NVIDIA Driver and Docker
  2. TensorFlow NGC Container ~= nvcr.io/nvidia/tensorflow:23.02-tf2-py3

Setup

  1. Clone this repository: git clone https://github.com/y-otn/mri-to-speech.git

  2. Install the required packages:

    apt update
    apt install libsndfile1
  3. Install Python dependencies: pip install -r requirements.txt

Training & Inference

  1. python run.py (for speech synthesis)
  2. python run_f0.py (for F0 estimation)

Note

The code will be refactored in future updates to improve readability.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages