https://github.com/aryanvbw/aivoiceclonerpro

Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting-edge Tool Converts Text or Any Audio into Your Desired Voice – Your Voice, Your Way

https://github.com/aryanvbw/aivoiceclonerpro

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.9%) to scientific vocabulary

Keywords

ai artificial-intelligence artificial-neural-networks aryanvbw clone clonevoice pytorch voice-clone voice-cloneai voiceclone
Last synced: 6 months ago · JSON representation

Repository

Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting-edge Tool Converts Text or Any Audio into Your Desired Voice – Your Voice, Your Way

Basic Info
Statistics
  • Stars: 57
  • Watchers: 2
  • Forks: 9
  • Open Issues: 12
  • Releases: 1
Topics
ai artificial-intelligence artificial-neural-networks aryanvbw clone clonevoice pytorch voice-clone voice-cloneai voiceclone
Created over 2 years ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

AiVoiceClonerPRO


An easy-to-use Voice Conversion framework based on VITS, powered by python

[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb) [![Licence](https://img.shields.io/github/license/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=for-the-badge)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE) [![Huggingface](https://img.shields.io/badge/🤗%20-Spaces-yellow.svg?style=for-the-badge)](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/) [![Discord](https://img.shields.io/badge/RVC%20Developers-Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white)](https://discord.gg/HcsmBBGyVk)

Changelog | FAQ (Frequently Asked Questions)

Realtime Voice Conversion Software using RVC : w-okada/voice-changer

The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset.

High quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement.

Please look forward to the pretrained base model of RVCv3, which has larger parameters, more training data, better results, unchanged inference speed, and requires less training data for training.

Summary

This repository has the following features: + Reduce tone leakage by replacing the source feature to training-set feature using top1 retrieval; + Easy and fast training, even on relatively poor graphics cards; + Training with a small amount of data also obtains relatively good results (>=10min low noise speech recommended); + Supporting model fusion to change timbres (using ckpt processing tab->ckpt merge); + Easy-to-use Webui interface; + Use the UVR5 model to quickly separate vocals and instruments. + Use the most powerful High-pitch Voice Extraction Algorithm InterSpeech2023-RMVPE to prevent the muted sound problem. Provides the best results (significantly) and is faster, with even lower resource consumption than Crepe_full. + AMD/Intel graphics cards acceleration supported. + Intel ARC graphics cards acceleration with IPEX supported.

Preparing the environment

The following commands need to be executed in the environment of Python version 3.8 or higher.

(Windows/Linux) First install the main dependencies through pip: ```bash

Install PyTorch-related core dependencies, skip if installed

Reference: https://pytorch.org/get-started/locally/

pip install torch torchvision torchaudio

For Windows + Nvidia Ampere Architecture(RTX30xx), you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/issues/21

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

For Linux + AMD Cards, you need to use the following pytorch versions:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.4.2

```

Then can use poetry to install the other dependencies: ```bash

Install the Poetry dependency management tool, skip if installed

Reference: https://python-poetry.org/docs/#installation

curl -sSL https://install.python-poetry.org | python3 -

Install the project dependencies

poetry install ```

You can also use pip to install them: ```bash

for Nvidia graphics cards pip install -r requirements.txt

for AMD/Intel graphics cards on Windows (DirectML): pip install -r requirements-dml.txt

for Intel ARC graphics cards on Linux / WSL using Python 3.10: pip install -r requirements-ipex.txt

for AMD graphics cards on Linux (ROCm): pip install -r requirements-amd.txt ```


Mac users can install dependencies via run.sh: bash sh ./run.sh

Preparation of other Pre-models

RVC requires other pre-models to infer and train.

You need to download them from our Huggingface space.

Here's a list of Pre-models and other files that RVC needs: ```bash ./assets/hubert/hubert_base.pt

./assets/pretrained

./assets/uvr5_weights

Additional downloads are required if you want to test the v2 version of the model.

./assets/pretrained_v2

If you want to test the v2 version model (the v2 version model has changed the input from the 256 dimensional feature of 9-layer Hubert+final_proj to the 768 dimensional feature of 12-layer Hubert, and has added 3 period discriminators), you will need to download additional features

./assets/pretrained_v2

If you are using Windows, you may also need these two files, skip if FFmpeg and FFprobe are installed

ffmpeg.exe

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffmpeg.exe

ffprobe.exe

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffprobe.exe

If you want to use the latest SOTA RMVPE vocal pitch extraction algorithm, you need to download the RMVPE weights and place them in the RVC root directory

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt

For AMD/Intel graphics cards users you need download:

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.onnx

```

Intel ARC graphics cards users needs to run source /opt/intel/oneapi/setvars.sh command before starting Webui.

Then use this command to start Webui: bash python infer-web.py

If you are using Windows or macOS, you can download and extract RVC-beta.7z to use RVC directly by using go-web.bat on windows or sh ./run.sh on macOS to start Webui.

ROCm Support for AMD graphic cards (Linux only)

To use ROCm on Linux install all required drivers as described here.

On Arch use pacman to install the driver: pacman -S rocm-hip-sdk rocm-opencl-sdk

You might also need to set these environment variables (e.g. on a RX6700XT): export ROCM_PATH=/opt/rocm export HSA_OVERRIDE_GFX_VERSION=10.3.0 Also make sure your user is part of the render and video group: sudo usermod -aG render $USERNAME sudo usermod -aG video $USERNAME After that you can run the WebUI: bash python infer-web.py

Credits

Thanks to all contributors for their efforts

Owner

  • Name: Vivek W
  • Login: AryanVBW
  • Kind: user
  • Location: india
  • Company: @TEch-Shop

🛰 𝐒𝐢𝐦𝐩𝐥𝐢𝐜𝐢𝐭𝐲 𝐒𝐩𝐚𝐫𝐤𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐢𝐨𝐧🛡️ 🌟 Dive into the Future: ✨𒆜🆅🅸🆅🅴🅺 🆆𒆜 Studying in Newton school of Technology🚀

GitHub Events

Total
  • Watch event: 29
  • Delete event: 3
  • Issue comment event: 2
  • Push event: 1
  • Pull request event: 7
  • Fork event: 3
  • Create event: 4
Last Year
  • Watch event: 29
  • Delete event: 3
  • Issue comment event: 2
  • Push event: 1
  • Pull request event: 7
  • Fork event: 3
  • Create event: 4

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 1
  • Total pull requests: 16
  • Average time to close issues: about 2 months
  • Average time to close pull requests: 6 months
  • Total issue authors: 1
  • Total pull request authors: 2
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.19
  • Merged pull requests: 1
  • Bot issues: 1
  • Bot pull requests: 12
Past Year
  • Issues: 1
  • Pull requests: 2
  • Average time to close issues: about 2 months
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 1
  • Bot pull requests: 2
Top Authors
Issue Authors
Pull Request Authors
  • dependabot[bot] (12)
  • AryanVBW (4)
Top Labels
Issue Labels
Pull Request Labels
dependencies (12) python (2)