mri-to-speech

This repository provides code for performing speech synthesis and F0 estimation from MRI videos.

Visit our demo website to listen to audio samples. Multi-speaker samples are also available here.

Requirements

NVIDIA GPU with NVIDIA Driver and Docker
TensorFlow NGC Container ~= nvcr.io/nvidia/tensorflow:23.02-tf2-py3

Setup

Clone this repository: git clone https://github.com/y-otn/mri-to-speech.git
Install the required packages:
```
apt update
apt install libsndfile1
```
Install Python dependencies: pip install -r requirements.txt

Training & Inference

python run.py (for speech synthesis)
python run_f0.py (for F0 estimation)

Note

The code will be refactored in future updates to improve readability.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
realTimeMRI_ATR503_dummy		realTimeMRI_ATR503_dummy
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
inference.py		inference.py
models.py		models.py
requirements.txt		requirements.txt
run.py		run.py
run_f0.py		run_f0.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mri-to-speech

Requirements

Setup

Training & Inference

Note

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mri-to-speech

Requirements

Setup

Training & Inference

Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages