Skip to content

Commit 70e5e71

Browse files
authored
add OpenVoice (#1633)
* initial * moved some parts to openvoice_utils * move wrapper to separate classes * successfully moved all infer into openvino_utils.py * successully run gradio * ready for review * add Readme, add steps explanations * Update README.md * fetch notebook_utils.py, delete quantization * fix pip conflicts * spellcheck fix * remove webm * update readme, install ffmpeg * typo fix * remove webm file * typo fix * apply comments * fix enable_chinese_lang = False * typos fix * ignore openvoice on treon mac * add -q to pip install * remove redundant close for gradio * make notebook independable from download utils * cleanup and add metadata * make linter happy * make manually selected voice more natural: fix source source speaker embeddings
1 parent de82a90 commit 70e5e71

File tree

5 files changed

+1080
-1
lines changed

5 files changed

+1080
-1
lines changed

.ci/ignore_treon_mac.txt

+1
Original file line numberDiff line numberDiff line change
@@ -47,4 +47,5 @@
4747
281-kosmos2-multimodal-large-language-model
4848
279-mobilevlm-language-assistant
4949
283-photo-maker
50+
284-openvoice
5051
404-style-transfer-webcam

.ci/spellcheck/.pyspelling.wordlist.txt

+8-1
Original file line numberDiff line numberDiff line change
@@ -781,4 +781,11 @@ ZavyChromaXL
781781
Zongyuan
782782
ZeroScope
783783
zeroscope
784-
xformers
784+
xformers
785+
OpenVoice
786+
BaseSpeakerTTS
787+
ToneColorConverter
788+
nn
789+
lang
790+
OpenVoiceBaseClass
791+
processings

README.md

+1
Original file line numberDiff line numberDiff line change
@@ -55,6 +55,7 @@ Check out the latest notebooks that show how to optimize and deploy popular mode
5555
| [DepthAnything](notebooks/280-depth-anything)<br>[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/openvinotoolkit/openvino_notebooks/HEAD?filepath=notebooks%2F280-depth-anythingh%2F280-depth-anything.ipynb)<br>[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openvinotoolkit/openvino_notebooks/blob/main/notebooks/280-depth-anything/280-depth-anything.ipynb) | Monocular Depth estimation with DepthAnything and OpenVINO | <img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/29454499/a9a16658-512f-470c-a33c-0e1f9d0ae72c width=300> |
5656
| [Kosmos-2: Grounding Multimodal Large Language Models](notebooks/281-kosmos2-multimodal-large-language-model)<br> | Kosmos-2: Grounding Multimodal Large Language Model and OpenVINO™ | <img src=https://huggingface.co/microsoft/kosmos-2-patch14-224/resolve/main/annotated_snowman.jpg width=225> |
5757
| [PhotoMaker](notebooks/283-photo-maker)<br> | Text-to-image generation using PhotoMaker and OpenVINO | <img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/91237924/88bccc4a-5789-42ca-8a68-f402c3e7c5a4 width=300> |
58+
| [OpenVoice](notebooks/284-openvoice)<br>[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/openvinotoolkit/openvino_notebooks/HEAD?filepath=notebooks%2F284-openvoice%2F284-openvoice.ipynb)[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openvinotoolkit/openvino_notebooks/blob/main/notebooks/284-openvoice/284-openvoice.ipynb) | OpenVoice a versatile instant voice tone transferring and generating speech in various languages. |<img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/5703039/ca7eab80-148d-45b0-84e8-a5a279846b51 width=300> |
5859

5960
## Table of Contents
6061

0 commit comments

Comments
 (0)