Skip to content

Commit 0b4b204

Browse files
Added quantization for OUTETTS (#2662)
- Added quantization via `nncf.quantize` to the notebook - Quantization with the `mixed` preset and `transformer` model type due to model architecture (LLM) - Quantization with the `ignored scope` due to OpenVINO issues with the optimized SDPA inference - Quality validation of the quantized model only with expert listening is possible because of the nature of the task - Performance validation only with the `generate` pipeline is possible because of the model architecture Ticket: 157133
1 parent 6361a6b commit 0b4b204

File tree

3 files changed

+380
-181
lines changed

3 files changed

+380
-181
lines changed

.ci/spellcheck/.pyspelling.wordlist.txt

+1
Original file line numberDiff line numberDiff line change
@@ -457,6 +457,7 @@ learnable
457457
LeViT
458458
LibriSpeech
459459
librispeech
460+
LibriTTS
460461
Lim
461462
LinearCameraEmbedder
462463
Lippipeline

notebooks/outetts-text-to-speech/README.md

+7-5
Original file line numberDiff line numberDiff line change
@@ -2,17 +2,19 @@
22

33
[OuteTTS-0.1-350M](https://huggingface.co/OuteAI/OuteTTS-0.1-350M) is a novel text-to-speech synthesis model that leverages pure language modeling without external adapters or complex architectures, built upon the LLaMa architecture. It demonstrates that high-quality speech synthesis is achievable through a straightforward approach using crafted prompts and audio tokens.
44

5-
More details about model can be found in [original repo](https://github.com/edwko/OuteTTS).
5+
More details about the model can be found in [original repo](https://github.com/edwko/OuteTTS).
66

7-
In this tutorial we consider how to run OuteTTS pipeline using OpenVINO.
7+
In this tutorial, we consider how to run the OuteTTS pipeline using OpenVINO.
88

99
## Notebook Contents
1010

1111
The tutorial consists of the following steps:
1212

13-
* Convert model to OpenVINO format using Optimum Intel
14-
* Run Text-to-Speech synthesis using OpenVINO model
15-
* Run Text-to-Speech synthesis with Voice Cloning using OpenVINO model
13+
* Convert the model to OpenVINO format using Optimum Intel
14+
* Run Text-to-Speech synthesis using the OpenVINO model
15+
* Run Text-to-Speech synthesis with Voice Cloning using the OpenVINO model
16+
* Optimize model using OpenVINO and NNCF
17+
* Compare original and quantized model Text-to-Speech synthesis and performance
1618
* Interactive demo
1719

1820
## Installation Instructions

notebooks/outetts-text-to-speech/outetts-text-to-speech.ipynb

+372-176
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)