Guide : W-Okada, realtime voice cloning (2024)

Table of Contents
Information Update :
Back to Articles
Community Article

PublishedFebruary 17, 2024

Upvote4

LenylvtLeny

Information

Before you begin, please note that footnotes are provided at the end of this guide. These footnotes are indicated by superscript numbers, such as ^0, corresponding to additional information or references found at the bottom of the document.

1 - To begin, download the corresponding archive, go on this site then selecte your version (name of the file) :

  • For Windows and Nvidia GPU : MMVCServerSIO_win_onnxgpu-cuda_v* ^1
  • Download For Windows and AMD GPU : MMVCServerSIO_win_onnxdirectML-cuda_v* ^1

Important Note for AMD GPU Users ^2

  • Download For macOS (Apple Silicon) : MMVCServerSIO_mac_onnxcpu-nocuda_v* ^1

2 - Next, you need to extract the archive to your main disk. To do this, right-click on it and select the extraction option. If you're on a Mac, simply double-click it to open.

Update :

During the update, delete everything EXCEPT the following:

  • The model_dir folder
  • Your shortcut to start_http.bat
  • The VBS script if you created one to prevent a command prompt from appearing.

Don't forget to reset your additional parameters, clipping, and audio to a value higher than the previous, as sometimes they display incorrect values.

This also includes the S.R. for server mode if you're using it.

VB-Cable is necessary to send sound to the virtual microphone for use with Discord or other software.

1 - Go to the official VB-Audio page for VB-Cable and click "Download"

2 - This will download a .zip archive that you'll need to extract into a new empty folder.

3 - Run setup_x64.exe (for 64-bit Windows), setup.exe (for 32-bit Windows) or VBCable_MACDriver_Pack*.dmg^1 (for MacOS).

4 - After installation, restart your PC/Mac so the OS can detect the VB-Cable audio device.

To use the voice changer, follow these steps:

1 - Open the folder you extracted earlier.

2 - Run the start_http.

3 - Models will start downloading. The duration of this process depends on your internet connection.

4 - After a few minutes, the application should open.

Here are the settings I recommend using for a better experience:

Hardwaref0ChunkExtra
GPU NVIDIARMVPE or CREPE_TINY1124096-16384
GPU AMD/INTELRMVPE_ONNX or CREPE_TINY1124096-16384
CPUDio or Harvest4484096-8192
Mac M2 Max and -Harvest or RMVPE_ONNX or CREPE_TINY448131072
Mac M2 Max and +RMVPE or RMVPE_ONNX or CREPE_TINY25665536
  • Please pay attention to the "Extra" option. A higher value will utilize more CPU processing power.
  • The quantity of "Chunk" affects the delay before the audio processed by the voice changer is transmitted to VB-Cable.

It's recommended to adjust these settings based on your needs and your system's power.

To achieve the best audio quality, follow these steps to configure the audio settings:

1 - Select the server audio option. It's faster than the client audio.

2 - Choose your audio devices:

  • Select your primary microphone for input.
  • Choose the VB-Cable audio device: "[MME] CABLE Input (VB-Audio Virtual Cable)" for output.
  • Use the monitor to listen to the output audio. Select your headset.

⚠️ Make sure your headphones are correctly configured as the default output device in the system settings.

If you're using other software like Discord, configure them as follows:

  • For input, select « CABLE Output ».
  • For output, choose your headset.

These configurations will ensure that the voice changer works properly with your other applications.

You can choose any you want, but we recommend the following in IV. Recommended Settings

⚠️ AMD GPU: rmvpe-onnx or crepe_tiny

The choice of the "f0Detector" model depends on how you intend to use it, whether it's for singing, speaking, rapping, etc. Here are some recommendations for different use cases:

  • RMVPE : It provides excellent quality and performance, suitable for all purposes.
  • Harvest : Suitable for basic conversations and rap with lower pitches.
  • Dio : Suitable for basic conversations and rap with medium/high pitches.
  • Crepe / Crepe-full : Recommended for speaking and singing with various pitches.
  • Crepe-tiny : A faster and less resource-intensive version of the Crepe model, ideal for many uses.

Select the model based on your specific needs to achieve the best possible results with voice conversion.

You have the option to enable or disable the noise suppression function. However, please note that this function is only available in "Client Device" mode. It's important to note that noise suppression in "Client Device" mode is slower compared to "Server Device" mode. To enable it, check the box next to "Sup1" or "Sup2". This option is effective in significantly reducing unwanted noise. However, keep in mind that it may impact audio quality and increase CPU processing load.

  • NVIDIA Broadcast can work extremely well. However, upon system restart, if you don't set its default settings separately from everything else, it might choose the virtual cable as the microphone and not work. To do this, open sound settings, scroll down to where you can press "App volume and device preferences," find the input area for this app, and choose your actual microphone. This solves any issues with the voice modifier getting stuck, according to my tests.
  • Steelseries Sonar it incorporates Clearcast, which is an excellent noise elimination feature, although not as effective as NVIDIA Broadcast. Anyone should be able to use it.

The following advanced settings are recommended for an optimal experience. Follow these recommendations to achieve the best results:

  • Protocol: sio
  • Crossfade: Overlap: 4096 Start: 0.1 End: 1
  • Truncate: 300
  • SilenceFront: on
  • Protect: 0.5
  • RVC Quality: low

After configuring all the settings, select the desired voice model from the list by clicking on it.

Click the "Start" button and wait for messages to appear in the command window output.

If you want to load your own audio models into the Voice Changer, follow these steps:

1 - Click the "Edit" button in the list of models. This will open up this menu.

2 - Click "Upload" and select the .pth/.onnx file of the model you want to use.

3 - Once the model is uploaded, click on the "no image" text on the left to set an image representing the model.

⚠️ Please note that you cannot delete already downloaded RVC models. To replace them, simply download a new model in their place.

For real-time voice conversion, you also have the option to use ONNX versions of RVC audio models.

  • When downloading a custom model, import an .onnx file instead of a .pth file.

There is limited confirmed information on whether .onnx is inherently better than .pth, but some tests suggest that .onnx may be faster than .pth for real-time voice conversion.

If you have a .pth file and want to convert it to .onnx, you can do so via W-Okada's Voice Changer:

  • Select the model you want to convert to .onnx, then click "Export to .onnx"

Using .onnx files may potentially improve the speed of real-time voice conversion. Experiment to see which option works best for you.

1 - Open Task Manager, click "Details"

2 - Right-click audiodg.exe and set the priority to "High"

3 - Right-click again and choose "Set affinity" then select only CPU 2.

Regarding the number of cores, choose an even number matching your actual processor core.

^1: The asterisk (*) means that numbers or letters can.

^2: Remember to convert all the models you use from PTH to ONNX. Your GPU will only support models in the ONNX format.

Guide : W-Okada, realtime voice cloning (2024)
Top Articles
100+ Soulmate Signs To Identify He Or She Is Your True Love (spiritual, Telepathic, Psychic, Zodiac, Psychological Signs From The Universe) - Breathe To Inspire
30 Spiritual Soulmate Connection Signs That Will Leave You To Ponder!" - Breathe To Inspire
Mvd Eagle Ranch Appointment
Health Stream Kaiser
Csuf Mail
Zavvi Discount Code → 55% Off in September 2024
Shadle Park big-play combo of Hooper-to-Boston too much for Mt. Spokane in 20-16 win
Lifestyle | Stewartstown-Fawn Grove Daily Voice
Steve Wallis Wife Age
Promiseb Discontinued
24/7 Walmarts Near Me
Seth Juszkiewicz Obituary
Kathy Carrack
Dr Paul Memorial Medical Center
Fy23 Ssg Evaluation Board Fully Qualified List
Seafood Restaurants Open Late Near Me
Kinoprogramm für Berlin und Umland
Northwell.myexperience
The Obscure Spring Watch Online Free
Test Nvidia GeForce GTX 1660 Ti, la carte graphique parfaite pour le jeu en 1080p
ZQuiet Review | My Wife and I Both Tried ZQuiet for Snoring
Gargoyle Name Generator
Krunker.io - Play Krunker io on Kevin Games
Staar English 2 2022 Answer Key
Pair sentenced for May 2023 murder of Roger Driesel
My Eschedule Greatpeople Me
Craigslist St. Paul
Lg Un9000 Review Rtings
Penn Foster 1098 T Form
0Gomovies To To
Gracex Rayne
De Chromecast met Google TV en stembediening instellen
How Much Does Hasa Pay For Rent 2022
Family Violence Prevention Program - YWCA Wheeling
No Compromise in Maneuverability and Effectiveness
Sallisaw Bin Store
Savannah Schultz Leaked
"Lebst du noch?" Roma organisieren Hilfe für die Ukraine – DW – 05.03.2022
Www Texaslottery Com
Middletown Pa Craigslist
Section 212 Metlife Stadium
Filmy4 Web Xyz.com
Okeeheelee Park Pavilion Rental Prices
Beacon Schneider La Porte
Bn9 Weather Radar
358 Edgewood Drive Denver Colorado Zillow
Cafepharma Message Boards
Gwcc Salvage
Water Temperature Robert Moses
4215 Tapper Rd Norton Oh 44203
3220 Nevada Terrace Ottawa Ks 66067
Only Partly Forgotten Wotlk
Latest Posts
Article information

Author: Allyn Kozey

Last Updated:

Views: 6471

Rating: 4.2 / 5 (63 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Allyn Kozey

Birthday: 1993-12-21

Address: Suite 454 40343 Larson Union, Port Melia, TX 16164

Phone: +2456904400762

Job: Investor Administrator

Hobby: Sketching, Puzzles, Pet, Mountaineering, Skydiving, Dowsing, Sports

Introduction: My name is Allyn Kozey, I am a outstanding, colorful, adventurous, encouraging, zealous, tender, helpful person who loves writing and wants to share my knowledge and understanding with you.