This repository provides code for performing speech synthesis and F0 estimation from MRI videos.
Visit our demo website to listen to audio samples. Multi-speaker samples are also available here.
- NVIDIA GPU with NVIDIA Driver and Docker
- TensorFlow NGC Container ~= nvcr.io/nvidia/tensorflow:23.02-tf2-py3
-
Clone this repository:
git clone https://github.com/y-otn/mri-to-speech.git -
Install the required packages:
apt update apt install libsndfile1
-
Install Python dependencies:
pip install -r requirements.txt
python run.py(for speech synthesis)python run_f0.py(for F0 estimation)
The code will be refactored in future updates to improve readability.