AI Python Projects: Text to Speech, Unlimited Free AI Images, FaceFusion, DeepLiveCam

Saujan Man Pradhan, WordPress Designer and Graphic Designer

Get unlimited access to every class

Taught by industry leaders & working professionals

Topics include illustration, design, photography, and more

Get unlimited access to every class

Taught by industry leaders & working professionals

Topics include illustration, design, photography, and more

Lessons in This Class

- 1.
  
  Introduction AI Projects
  
  1:35
- 2.
  
  TTS Kokoro
  
  3:41
- 3.
  
  TTS Edge
  
  5:13
- 4.
  
  Clone your Voice
  
  8:27
- 5.
  
  TTS Tortoise
  
  8:18
- 6.
  
  Clone any voice with RVC
  
  8:58
- 7.
  
  Clone Your Voice Using OpenVoice Text to Speech
  
  16:22
- 8.
  
  Fooocus Local Win PC
  
  8:17
- 9.
  
  Foocus on Google Colab
  
  2:00
- 10.
  
  Facefusion
  
  5:12
- 11.
  
  DeepLiveCam
  
  9:22
- 12.
  
  IRoopDeepFaceCam
  
  3:11

Beginner level

Intermediate level

Advanced level

All levels

Students

Projects

About This Class

Class Overview

Welcome to "AI Projects: Text to Speech, Unlimited Free AI Images, FaceFusion, DeepLiveCam" — a hands-on course where you’ll explore some of the most exciting open-source AI tools available today. This class focuses on practical, beginner-friendly ways to bring AI to your own PC, including text-to-speech, image generation, and face-swapping tools. By the end, you’ll have a collection of powerful AI applications running on your computer.

What You Will Learn

In this course, you’ll learn:
Install and use popular Text-to-Speech (TTS) tools like Kokoro TTS, Edge TTS, and Tortoise TTS
Generate unlimited AI images for free using the Fooocus AI project
Experiment with FaceFusion and other applications for fast, easy face-swapping in photos and videos
Run open-source AI projects locally without needing advanced coding skills
Build confidence using AI tools for creative, professional, and personal projects

Why You Should Take This Class

AI is rapidly transforming how we work, create, and communicate — but many people don’t know they can access powerful AI tools for free. This course gives you hands-on experience with cutting-edge applications, without requiring expensive subscriptions or advanced technical knowledge.

Whether you’re a hobbyist, a content creator, or simply curious about AI, you’ll leave this class with the skills to use open-source AI tools for your own projects.

Who This Class Is For

This course is designed for anyone curious about AI, including students, creatives, photographers, videographers, and hobbyists. No prior coding or technical expertise is needed — just a computer and a willingness to explore.

Materials / Resources

You’ll need:

A Windows Computer with internet access
Basic computer knowledge

Included in the course:

Links to all the featured open-source AI projects
Step-by-step guidance for installation and usage

By the end of this course, you’ll have a personal toolkit of free AI applications and the know-how to use them for your own creative or professional needs.

Meet Your Teacher

Saujan Man Pradhan

WordPress Designer and Graphic Designer

Teacher

Greetings everyone!

I am a WordPress Developer, Graphic Designer and a Social Media Marketing Expert with a Master Degree (MBA) from Nepal and more than 8 years of experiences in Designing & Marketing.

I have been working as a WordPress Developer for more than 5 years now. I have worked for both back-end and front –end development including WordPress themes and plugins. I do themes customization, designs and many more. Being a Graphic Designer helps me to play with color choices and as well better communicate with the clients as sometimes I use the image form to showcase the actual design before it is made.

For Graphics I mostly use Adobe Photoshop to implement my concepts to reality and also use Microsoft PowerPoint to present the ideas through presentat... See full profile

Related Skills

ChatGPT AI & Innovation AI for Music & Audio AI for Productivity

Level: Intermediate

Hands-on Class Project

In this project, you’ll create your own AI Toolkit by installing and experimenting with a selection of free, open-source AI applications — from text-to-speech tools to image generation and face-swapping software. By the end, you’ll have a fully functional set of AI tools running on your own PC.

Project Steps

Follow these steps to complete the project:

Choose and Install AI Tools
- Start with one Text-to-Speech tool (Kokoro TTS, Edge TTS, or Tortoise TTS).
- Move on to installing Fooocus AI for image generation.
- Install FaceFusion or another face-swapping tool for photo/video experimentation.
Test and Experiment
- Run each tool to understand its capabilities.
- Generate sample outputs (audio files, images, or face-swapped content).
- Try different settings and document your favorite results.
Create Your AI Toolkit
- Collect your installed tools and organize them into a folder or dashboard on your PC.
- Save your best outputs (voices, images, or videos).
Upload Your Project
- Share screenshots of your installed tools.
- Include at least one sample result from each tool you tried (audio snippet, image, or before/after face swap).
- Write a short note about which tool you liked most and why.

Final Deliverable

Upload the following to the Project Gallery:
Screenshots of your installed AI tools running on your computer
Sample outputs (image, audio snippet, or video stills) from at least two tools you tried
A short reflection (2–3 sentences) describing your favorite tool and how you plan to use it

This project is designed to give you hands-on experience and a tangible portfolio of AI applications you can showcase or continue experimenting with after the class.

Class Ratings

Why Join Skillshare?

Take award-winning Skillshare Original Classes

Each class has short lessons, hands-on projects

Your membership supports Skillshare teachers

Learn From Anywhere

Take classes on the go with the Skillshare app. Stream or download to watch on the plane, the subway, or wherever you learn best.

Transcripts

1. Introduction AI Projects: Hi, everyone. I'm Susan Pradhan. I'm excited to introduce to this interesting course. With the rise of AI, I have hand picked some amazing open source AI Python projects that you can try right on your PC. We'll begin with text to speech, TTS. Check this curo TTS with some sample voices. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Now, here is ESTTS All interesting and essential things you need to know. And we have more insight, including voice cloning. Next, we will explore AI Image generation for free. Check out the focus AI project where you can create unlimited AI images on your own PC. And for those into photography and videography, we'll look at phase fuson and other applications that can easily swap pass for your projects, making the process super fast and easy. This course is curated for all computer integers. It's carefully designed so you don't need to be an expert in coding to follow along. By the end of the course, you'll have a collection of free open source AA applications running on your computer. Let's get started. 2. TTS Kokoro: Hi, guys, here is how you can convert your text to speech using Coco TTS. And here are some of the sample voices. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. We will be installing this Kokoro TTS to our PC, and it is very simple to do. We will be using Anaconda to create a Python environment. And if you do not have anaconda, go to anaconda.com and click on Free Download. Provide your email or skip the registration and download the installer and install it onto your computer. After this is done, go to the start menu and type Anaconda Prompt. This is the one, open it. And first thing we need to do is create a Python environment. For that, type Conda, create Na I will name it CocorO. Python is equal to 3.12. This will create a Python environment, 3.12. Y and Enter. So the Python environment has been created. Let's activate it. Type Conda, activate Cocoro and Enter. You can see the base name had changed to C Coro. After this, we need to install C CoroF that type PIP install Cocoro and Enter. So this is done. Now we need to install spaces for that type, PIP install spaces and Enter. So this is also done, and now we also need to install gradioF that type, PIP install radio. So this is also done. Now in this folder, we have created a custom file. So this is a Python file, and we have modified the Coco TTS demo, and we will provide this. Anyway, copy this address. And on Anaconda, type CD, Control plus V, and Enter. So we are inside that folder. Now type Python, run that PY and Enter. You might see some warnings, but you can ignore it. And here is the Kokoro TTS. By the way, if this does not uo opens, kindly copy paste this on any of the browser. Anyway, this is the Kokoro interface. Input your text here and click on Gent. This is using this voice. Hi, please subscribe to your channel. Thank you very much. So this is how this sounds. And you can simply download this. Let me choose more voices for sample. You can also change the speed. I'm using CPU, and it works all good. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. Hi, please subscribe to our channel. Thank you very much. We hope this video was useful. Thank you very much. 3. TTS Edge: Hi, guys, here is how you can install Microsoft Text speech to your computer. Microsoft offers free text to speech. Let me quickly demonstrate. Let me open Microsoft as and here is my sample text, which I want to convert to an audio format. Let me quickly drag it to Microsoft As, drag to As, Select All Control plus A, and click on Read Aloud Selection. All interesting and essential things you need to know. So that's Microsoft As takes to speech. You can also choose a voice. There are lots of options, and you can also adjust the speed. This works great. However, it does not allow you to save it as mp three. You will have to record the screen. So instead of that, let's install this to your own PC and you will be able to save it as mp three as well. For that, we will be using this Github repository. Like every other projects, we will be using Anaconda. But you can directly install Python and use it as well. For those who do not have Anaconda, go to anaconda.com, click on Free Download. You can give your email. I will skip registration. And download the installer and install it on your PC. After this is done, go to the start menu and type Anaconda and open this anaconda Pm. So first, we need to create a Python environment. For that type Conda, create name a TTS. Python is equal to 3.13. This will install the latest Python version as of now, Enter and Enter. So the Python environment has been created. Let's activate it, Conda activate H TTS. You will see this base name has changed to TTS. Now we need to install TTS for that type PIP install AdS tTs and Enter. So we have successfully installed TTS. Now, first thing, we need to list voices that we can use. If you go to the skitter page and scroll down. So with this command, this will list all the voices. Simply type TTS and double dash list dash voices and Enter. Now, here are all the voices you can use for your project. So let's use this. Let's use this one. You can use the exact voice you want to use. Let me copy this without this dollar sign and paste it to the notepad. Here is it. Now in this part, you can add your own text. For now, I will only copy this part. Add your text here. This is for the voice you want to use. You can choose any voice you want. Let's choose one of the voices and replace it here. And now also let's choose the location where we want to save. Let me save it to the desktop. In fact, I will create a new folder. Copy this address. And here after write media, where's the location. Make sure to use the inverted commas. Likewise, this is for the subtitles in ART format. I will use the same location. Make sure to use the open and close inverted commas. So this is done, copy this and paste it here. And enter. This might take some time depending upon your text length and voice used. So this is done. And here is the output. We forgot to change the file name. This is the name the file will be saved as. You can use any name you want. Anyway, let's open this. All interesting and essential things you need to know. And now you have text in audio as mp three. But please make sure you are connected to the Internet. Hope this video is useful, thank you very much. 4. Clone your Voice: Hi, guys. Here is how you can clone your voice using just a few seconds of the audio. Please subscribe to our channel. So in order to clone your voice, we will be using this Five TTS. This is a Python application, and here are all the details on how you can install it. Like other projects, we will be using anaconda to install this. So go to anaconda.com, click on free Download. Scroll down and click on Download Minicona Installer. And in here, you can download the distribution Installer. Click here for Windows and simply download and install this. The file size is nearly a ZB, so we have already downloaded and installed it. After installing anaconda, go to the Start Menu and type Anaconda. You will find this Anaconda Prompt. Simply open it. So here is it. Now let me go back to the Guitar Base. Scroll down a bit. Here is how you can install it. Let's first create a Python environment, 3.11. Let me copy this and base it here. This will create a Python environment F five TTS with 3.11 Python. Now press enter y Enter. So we have successfully created the Python environment Five TTS. Now we need to activate this type Conda, activate Five TTS, and Enter. You will see that the base name has changed to Five TTS. Now let's install Five Tts. By the way, this is the command we use to activate the environment. If you have NVDographics, run this command. If you have AMD GPU, run this one for Intel GPU, run this. And this is for the Apple silicon one T Series. Now this BC does not have a GPU, so I will skip it. So I will run this command. Let me copy this and paste it here. Now, this will install all the packages for FiveTTs and this will take some time. So this is done. We also need to install FFMPQ so type Conda install FFMP the compatible version for this is less than eight Conda Forge. Basically, we are installing FFM Pig version less than eight, Enter, y and Enter. So this is also done. Now let's also install Dodge Codec. Type PIP install Dodge Codec Enter. So this is also done. Now go back to the guitar page and download this. Go to code and download a Zip. The file size is very small. So it's downloaded, showing folder. Now open the Zip file. I will close this all and extract this to the desktop. So here is it. Now open this folder and go to the SRC folder and go to a five on the Squatts folder and copy this address. Copy address, go back to Anaconda, type CD and paste it. Control plus. You can see we are inside this folder. Actually, it's the infofolder. So we need to be here, copy the address again. City and paste it here. So we're now inside the Info folder. Let me clear this up. After this, we need to open this Infer underscradio, Python file. So type Python, infer underscore gradio dot PI and Enter. Please note, during the first installation, it might also download some models which might be over a ZV in size, so it can take some time. Anyway, here is a URL. You can copy this and open any browser and paste it here or you can press Control and click on it. Here is the interface. Now let's try to clone the voice. Let me simply upload the do here is a sample audio. It is only 30 seconds of audio. Please use a clear audio of your voice. Let me play it and hear how it sounds. Hi, guys. So we updated our iPhone to the latest IOS 26. And what's up with these icons. So this is my voice. You can also load the audio text, but we'll leave this and on text to generate, type the text you want to convert to audio. So here is it. And on advanced settings, please experiment with this for better results. After this, simply click on synthesize. Since this PC does not have a ZPU, it will take some time to clone the voice. So please be a little patient to check if it is working or not, go to NaconaPmpt and you will see what is happening. It only uses small part of the audio. You can see this reference text from our voice, and this is the text we want to convert to audio. Again, please note this will take quite some time to clone the voice. So please be a little patient. So this is done. Now let's hear how it sounds. Play it please subscribe to our channel. Thank you very much. So this is how it sounds. Not exactly as my vocals, maybe because of the accent. We did try with other native vocals, and it worked great. So please do give it a try. And if you like this voice and settings, you can also use this sad Number, so you will have the same voice. And to download, you can simply click here, and the audio will be downloaded. Now, if you want to reopen the app after closing everything, it is very simple. Simply go to the start menu and open Anaconda Prompt. I now activate the environment, Conda, activate Five TTS and Enter. Make sure the base name had changed. After that, open the Five DTs folder, go to SRC, Five under Scotts to infer folder, copy the address, and type CD space Control plus V and run Python. So we need to run this Infer unders gradit or PI And you can press Control and click on the URL and use the application. This video is useful. Thank you very much. 5. TTS Tortoise: Greetings, everyone. In today's video, we will quickly learn how you can install tortoisTTS on your Windows computer. Now, we will be installing this tortoisTTS to our Windows 11 PC. This one is developed by James Betka. Anyway, to install this, we will use Anaconda. For those who do not have Anaconda, simply Google Anaconda. And this is the official website. Simply download this and install it to your computer. We have already downloaded and install Anaconda to save time. So I will close this. After downloading and installing Anaconda, also download these files. Click on code and download Jib, so it will be downloaded as a Z file. I have downloaded this one too, and it is on my download folder. So here is the file, open this and then extract this to the Dktop. So here are all our files. And this is the Anaconda installation which I have already installed. After doing this, if you go to the start menu and type Anaconda, you will see Anaconda Pm. Open this. Now, the first thing we need to do is create a Python environment. For that, simply type Conda, create DN, give any name you want. I will write TTS. Python is called the 3.9. We have tested with Python 3.9 and it works all good. So we will recommend this version. Anyway, and hit Nta and Enter. So the Python environment with 3.9 person has been created. So let's activate this, type Conda, activate TTS. So you will see this base name will change to TTS. I will type CLS to clay all this. Let me minimize this. Open the Gita pase. You can find all the installation instructions here. So let's first install PYTurg. Open this in newTa And now select Asper UPC, Patuch build, choose stable, OS, choose Windows, packs, you can either use PIP or Conda. I will use Conda. Language is Python, platform, I will use CPU since this PC has no graphics card. If you have NevdaGraphics, choose Koda 11.7 or 11.8 as per UPC. For us, we will use CPU. Now copy this and paste it here and Enter. So this is done. I will clear this so it looks clean. After installing this, close this. Before running these requirements, install PIs on file. Copy this. And paste it here. All done. Now let's run this requirement text. We need to make a few eddies to do this. Open the tortoise folder and open requirements. And in here, remove this NumbA which is twice here and also remove this LVM light. If required, we will install Numa manually. Save this, and it is done. Now navigate where our tortoise folder is time CD. We will navigate here, copy, and paste. So we are inside this folder. Now run PIP install. That's requirements dot TxD. So it is done. Now also run this setup dot PY, copy this. And paste it here. So it is all done. Now let's see if this works are none. We need to run this command. Basically, it will use this text, and we will use a random voice, and the preset is fast. You can change the voice and as will the preset as per your requirements. To explain Beta, go inside Tortoise folder, open this Apt PY file. I will use a notepad and search fast. So you can use ultra fast, as you can read it here. Use fast for decent quality, use standard for good quality, and use high quality for absolute best. However, this will take less time compared to fast, and high quality takes more time. For now, let's use fast. So let's copy this. Paste it here, so the audio will have, I am going to speak this text. You can change it if you want, and for now, it will use random voices and uses fast preset. Please note, this will take quite a long time. So this has been done. Let's check results. Open the tortoise folder. Go to results. So here are the audio. Let me play them. I'm going to speak this. I'm going to speak this. I'm going to speak this. These are the random audio. So you can use this audio. You can also use your own voice for it. For instance, here is my audio recording. So these are my own voices. All you have to do is copy this and in the totois folder, go to tortoise and open voices. And simply paste it here. You can check all the audios and use them too. Anyway, let me clear this. Now let's try using our own voice. We need to run some command. Here is a command. Let me paste it in a notepad. You can write any text here, and we will use our own voice. In order to do that, we can use this voice Iqalt directory name. So write IqalT and the folder name. Make sure to have the exact name of the folder. Copy the exact name and paste it here. I will use a preset fast. You can also use high quality, but it will take some time. Now, copy this code and then paste it here. Please note this process will take a very long time. So this has been done. Now, let's check the results. And here is our audio. Subscribe to our channel. Subscribe to our channel. Subscribe to our channel. It does not exactly sound like my voice, possibly because my accent is different, but please give it a try. We hope this video was useful. Thank you very much. 6. Clone any voice with RVC: Hi, guys. Here is how I clone my voices to Dracula and Andrew Tate using RVC Project. Hi, guys. So I'm using this sample photo with DeepLiveCam and the OBS Studio. To make a deep fake call on WhatsApp. Let me quickly call it, and you can see the deep fake video. Check the description on how to do this. Thank you very much. To do that, simply go to this RVC Gita page. I will provide the link, click on releases. If you have NVDaGraphics, download this version. If you're using AMD, Intel, simply download this version. After downloading this, you'll get a Z file. The file size is about three point a ZB, now simply extract it. I've already extracted the Z file as it takes a few minutes to do so. Now in this version, it does not have the web interface to clone voices, so we will have to create that, but it is very simple. Simply duplicate this go real time GI file, copy and paste it here. Here is it. I rename it to web. Now let's edit it, right click and edit in the no pad. In here, we need to make a small change. Change this GI on the score V one PY to infer dash web dot py and remove this DML. Make sure to change this to infer web dot py and save it. So this is done. Now let's open this. This may take some time to load. I so here is the interface. If this does not open automatically, in this link, replace the zero dot zero with local host and add the port and paste it to the browser. Anyway, now this offers various features, but for this video, we'll only clone our voice to other artists. Here, you can also choose to train the voice which will clone our voice. Anyway, here are the default available voices, which is limited. So go to this website, voicedestmdels.com, and you can download various voices as you want. For instance, let's try with Dracula. Simply click here, and it will download the file. Let's check the top voices. See if you like any voices. Or you can also search. Let's look for Autate voice. Let's use the one with high Epoch. So the file size is about two to 300 MB, as you can see, and you can download and use as many voices as you like. So the download has been completed. Show in folder. So here are the voices. Let me quickly extract it here. Extract here. So here is it. If I open this, you will find dot pth and dot index file. These are what we need. Let me quickly extract Jacula as well. I and here are the files. Now we need to copy this to the right folder to use them. Go back to the RCV folder, and go to assets and two weights. This is our main folder and go to assets and weights and copy the dot PTH file. I will cut it. So it's here. Let's copy the ****, one, two, Copy the dot PTH file and paste it here. Now go back to the main folder again and open the folder logs. Here, we need to copy the dot index file. Let me move this one to Logs folder. Here is the TTS one And now, again, move the Dracula one here, so it is done. Basically, we added two voices for now, but you can add as much voices as you want. So this is done. Let me close this as well. Now you should find the new voices here. If it is missing here, then simply refresh the voice list index path. And here you can see Dracula and tad voice. Let's try with Dracula. You can transpose this and raise the October 12. Here are some settings for you to play with. So please experiment. Anyway, here a the audio file to be processed. Here I have my own sample audio taken from one of my shots. So this is how it sounds. Hi, guys. So I'm using this sample photo with DeepLiveCam and the OBS Studio to make a deep fake call on WhatsApp. Let me quickly call it, and you can see the deep fake video. Check the description on how to do this. Thank you very much. Now that's my sample voice from one of my shorts. Let's convert it into Dracula's voice. We need to add the audio path too. For that, right click on the audio file and copy as path. And paste it here. Leave this feature index to default. In here, we can choose the exact index file. So for Jacula, here is the one. Here you can choose the settings. PM is faster but low quality, harvest is better, but extremely slow. Create is better quality but uses high GPU, and RM VPE is best quality with little GPU. So I will use this. You can also tweak these settings as prior requirements. So let's try to convert it. And it is done. Hi, guys. So I'm using this sample photo with DeepLiveCam and the OBS Studio to make a deep fake call on WhatsApp. Let me quickly call it, and you can see the deep fake video. Check the description on how to do this. Thank you very much. So that's the audio indraculs voice. You can also download this so it's saved. Now let's try with Andrew Tate's voice. I will simply choose Tate. The audio is same and change the Tate index. And let's try to convert it. Again, please play with the settings to get better results. Each voices are different. So it's done. Let's listen to it. Hi, guys. So I'm using this sample photo with DeepLiveCam and the OBS Studio to make a deep fake call on WhatsApp. Let me quickly call it, and you can see the deep fake video. Check the description on how to do this. Thank you very much. To make it more relevant, we can add these settings and find the best match that works. We really hope this video is useful to you. Thank you very much. 7. Clone Your Voice Using OpenVoice Text to Speech: This video, we'll quickly learn how you can clone your voice in seconds and as well convert text to speech using OpenVoice. So basically, we will be installing this OpenVoice to our Windows BC. We will be using Anaconda to run this application. So for that, go to anaconda.com and click on Free Download. You can give your email or skip registration. I will skip the registration and download the installer and install it to your computer. After Anaconda is installed, if you go to the Start menu and type Anaconda, you will find Anaconda Prompt. Let me open it. Now, the first thing we need to do is create a Python environment. For that, type Conda, create double das name. I will use OpenVoice. Python is equal to 3.10. We have tested with Python 3.10. That's why we will be using this one. This will create a Python environment, OpenVoice with Python 3.10. Wile Enter. So the Python environment has been created. Now let's activate the environment. For that type Conda activate OpenVoice. Make sure this buss name has changed to OpenVoice. Now again, go back to the guitar page. And let's download all these files, go to code and download a zip. I have actually already downloaded it. Anyway, showing folder, I would delete this one as I've already downloaded it earlier. So here is the OpenVoice C file, open this. And let me extract it to the desktop. Let me close this. So here is a folder, open it. Now, we need to copy this address. Copy this Go back to Anaconda Prompt type CD, space Control plus V. So now we are inside this folder. Now let's install all the dependencies required to run OpenVoice. For that, type PIP, Install. That's requirements dot TXT and Enter. This might take some time as it will download lots of files. You will be stuck at this screen for a few minutes, so please be a little patient. I am fast forwarding it. So this is all done. We also need to install FFMPZ to run OpenVoice. So for that type Conda install FFMPZ and Enter. Y and Enter. So FFMPZ has also been installed, so this is all done. Now to run these files, we also required to install Jupiter notebook. For that type Kunda, install C, Kunda Forge, Jupiter Lab and Enter. So there was a little typo with an extra T, so let me retype it again. It froze. Let me try to re run the command. And now this seems to work. Why Enter. So this is done. Now, we also need to install some widget for Jupiter Notebook. For that type PIP, install, notebook, IPI wicket. So now everything has been installed. Now again, go back to Guitar pays. Scroll down, and you will find this users link. Click on it. And we need to download this V one and Vtheckpoint. Click on V one. For Demo Part one and Part two, we need this checkpoint. Simply click here and download it. I have already downloaded it. The file size is about 400 MB. Likewise, also download this Vtoo checkpoint. The file size is about 116 B, and I've already download this one too. I will cancel this. Let me go to the downloads folder. Let me open this V one Checkpoint and extract it here. So it's here. Likewise, let me open V two checkpoint and extract it here. So it's here. Now let's move this to OpenVoice folder. Let me close this Open this folder and paste it here. So we have the checkpoints here. Now go back to the anaconda prompt. Let me clear this CLS. Now let's open the Jupiter notebook, type Jupiter Notebook and Enter. So this is the Jupiter Notebook. Let me close this all. Let's open this demo part one. Now let's run this. Highlight this and click on play. This asterisk means it is working. So this is done. Now click here and run this again. You might see this, but it will work anyway. I will play all this. For now, we are using the reference P three. Play all the so this is done, and now this is the text we want to transcribe. You can add your own text here. And if I go to OpenVoice folder and go to resources, this is the example reference voice. Let me quickly play it so that you can hear it. When I was a wanted man, the resistance gave me a lot of help. So this is the voice we are using for now. So let's try to see how this works. So let's run this one. So this is done. So the output name is output in default. In this folder, if you go to output, let me rephrase this. So here is the output in default. Let me play this. Please subscribe to our channel. This is a voice, and it is working all good. Now in here, we can also add different styles to the voice like friendly, cheerful, excited, sad, et cetera. For now, let's try cheerful. I will replace whispering. I will use the same earlier text. And let's run this again. So this is done. You can also change the speed here and let me repress this. So here is the audio. Let me play it. Please subscribe to our channel. So you can hear the difference. Likewise, this will use the Chinese language. Now let's try with our own voice. So here is our sample audio voice. Hi, guys, here is how you can password protect your PDF file. Here is just more than a minute audio. Let me cut this and paste it inside the resource folder. And here is my audio. And in here, replace with your audio name. It is audio dot mp three. Make sure the name and extension is correct. Let me run this. So it is all done. Now let's try with our own voice. Let me change the output name, output and default one, and run it. So this is done. So I'm in the output folder. Let me refresh it. So here is it. Let me play it. Please subscribe to our channel. So that's the clone of my voice. It's definitely not so great, possibly because of my accent, but you can try with yours. And I can also add emotions to it. Let's try whispering. And rename the output to whispering one and run this. By the way, you can add as much text as you want. And let me repress this. So here is it. And let me play the audio. Please subscribe to our channel. So that's how it sounds. So this part is done. So this is using demo part one. You can also use Demo part two and Demo part three likewise. Let me close Demo part one. Here is Demo part two and Demo part three. Let me open Demo Part two. For the Demop two, it is suggested to add the Open AI API key. For now, I'll skip this. But if you want, you can get the APIkey from Open AI website. You might get free credit if you are using it for the first time, but eventually it is a paid service for API. Anyway, I will skip this and instead do Demo path three. Now, in order to use Demo path three, we need to install some more applications. Let me go back to Anaconda Prompt. Press Control, please C. This will stop the Jupiter notebook. I will clear this as well. Again, let's open the OpenVoice guitar pase. This is the one and go to uses. And click on Version two. You will need to install Milo tts to use Version two. Copy this code and paste it to naconaPmpt. Press anyway, and it will install all the required files. And Demo Part three is a little advanced than Demopt one. So please do give it a try. Again, presenter. It will download some files, so it will take some time. So all the downloads have been finished. Now we need to install furthermore dependencies. Without them, we had encountered some issues earlier. So we have this download NLTK file. This will be provided. This will basically import NLTK and all required files. Without installing this, we could not run Demo path three. So let's install this. I will put this inside the OpenVoice folder. Let me copy the name. And now on anaconda prom, type Python andropausw dot py and Enter. These steps are only required for Demopat three. If you are satisfied with Demo path one, you do not need to install all this. Anyway, click on Download. So the downloads have finished. You can close this. So we have installed everything. Now again, let's run Jupiter Notebook, type Jupiter Notepad. Sorry, there is an error. It's actually Jupiter Notebook. Let me close the previous ones. Now, let's run the Demopath three. Let's run this first. If it is running, you will notice this asterik. So this is done. Now let's run this as well. This will use the hick Point Version two. Earlier, we were using Version one. This is done too. So this is all done. In reference audio, use your own audio. So this is all done. So this is using the melo TTS, the one we installed earlier. So this can use the actions of all these languages. Only use the ones you require. It can do it in multiple accent, but will take some time. Let me run this. By the way, the output folder had changed for this one, it's output version two. This is working. Is the output version two folder. You will find all the outputs here. And this will take some time. It will download the checkpoints as it requires for languages, and it will take some time. Let me quickly check the output folder. So here are the outputs on various tones. Let's listen to it. Did you ever hear a folk tale about a giant turtle? This is with Indian accent. Did you ever hear a folk tale about a giant turtle? British accent. Did you ever hear a folk tale about a giant turtle? Now, let me stop this Control Class C. So this is how it works. I will stop all this. So this is how OpenVoice works. Now, if you want to reopen OpenVoice later, let me first close this all. Simply go to Stop Menu and open Anaconda Prompt. Open this, activate OpenVoice. The base name should change. After this, Navigate to OpenVoice folder. Please do not delete any checkpoints, so copy this cd space Control plus V. So we are inside this folder and then run Jupiter Notebook. And you can run the Demo Part one, part two, or part three, as you want and always make sure to run itself. Anyway, we hope this video was useful to you. Thank you very much. 8. Fooocus Local Win PC: Hi, guys. In today's video, we will learn how you can create AI images for free using focus. So basically, we will be installing these to our PC. There are sever ways to do it, but we will be using the IGS method. Here are some sample images. Anyway, so we will install these to our Windows PC. Here are the instructions for that. Let's download the files. Click on Downloads. Please note you will need to download more than ten GB of files. So the file has been downloaded. Show in folder, and this is seven Zt file. You can use Win War to extract it. Double click on it. I will extract them to Desktop. Let me create a new folder. I will extract all the files here. Please note this extraction will take some time. All the files are here. You can simply run this bat file and it will download all the necessary files. However, we have had some issues, so we downloaded the files manually. Let me go back to the guitar page. Let's download these default models. I will open this in a new tab. I Now we and save as a link. I will save it to the Dekter. These are small files, but download all of them. So it is done. So these files must go inside these folders, focus models, checkpoints. Go to focus models, checkpoints and place these files here. This is done. Now let's again download this in Paint file. This is about 1.2 ZB. Open the link here. Sometimes the link may not work, so please refresh it a few times till the download starts. So the download has started, I will pause this as I've already downloaded this file. Make sure to give a few tries if the link does not work. I will cancel this. Let me navigate to the file which I have already downloaded. This is a file. Let me cut this. And we need to put it inside focus models in pain. Focus models and in pain, past it here. So this is done. Now open this run that Bt file. So it will try to download this file, which is nearly six CV. We have at issues downloading this file. So let me copy this link. And paste it here. So the file is being downloaded. I will pause this too and cancel the download since I've already downloaded the file. So this is a file which is nearly six GB. Let me cut this. So here is an error again. So we recommend downloading the files manually. And these files would go to focus models checkpoints. Let me go to the folder focus models, checkpoints and paste it here. So basically, we downloaded these files and this for checkpoints. And we also download it in pain. So this is all done. Let me close this all. Now, if I press any button, the CMD will close. Let's again open the Run Dot Bat file. This app requires Nevida, but this PC does not have NivdaGraphics. So for the smooth operation, it is recommended to have Nevida. However, if you don't have it, here's an alternative. Go back to the github, scroll down. And here is what you can do if you have an AMD graphics card. Now, copy this code. Open the focus folder. Let me close this. Right click on it, and you can edit it in the notepad or Notepad plus plus. So replace all the code with the Github code and save it. And again, open this run or Bt file. Now, this will use CPU. The generation of images will take quite some time. This will also download some files, so please be a little patient. Finally, the application is running good. Click on Advanced Hey, you can change some settings, speed, aspect ratio, image format, number of images, et cetera, and you can give any prompts you like to generate images. Please note this will take some time depending on a computer. If there are some issues, make sure your PC mess the minimum requirements. If the app does not work, you will have to do it with Google Colab. Now let's see how it actually works and create a demo image. Nepal images in advance, let's choose a smaller one. I will go with this one I will have two images. We'll choose the PNG format. In the negative proms, we'll say no people. I will also improve the prompt Nepal images with Everest and other mountains. On the style, there are lots of options. The cat image will give you an idea of what you want. Simply choose what kind of images you really need. For now, I will choose semi realistic. And I will choose si cinematic. So you can choose anything as per your requirements. And now click on Generate. I'm FastForwarding. This generation will take some time depending upon your PC capabilities. So you can see these are pretty nice and realistic images. Please try and explore the options as for your requirements. Focus will create some great images. Anyway, we hope this video was useful to you. Thank you very much. 9. Foocus on Google Colab: Hi, guys, here is how you can create AI images using focus on Google Colab. Now for that, open the browser and simply Google focus Google Colab. This is the one, open it. So you will say this. Make sure you're already logged into your Gmail. After you are signed in, simply click on this play icon and run anyway. Now, this will take some time to download all the files and make the program run. So this is done. You cannot use this local host link, but you can use this radio link. Click on it. And now you can use Focus AI. Let me quickly try it, give any prompts you want and generate. So focus has created these beautiful images. Also, you can go to Advanced. Here are a couple of options. You can choose the aspect ratio, choose the number of images to generate, the output format, negative prom, what you do not want in the image. And on style, choose the type of images you want, realistic, three D model, digital art, choose accordingly. And this you can leave it to default. One most important note while using Google Colab is you must always check the resources used. This is a free GPU provided by Google, and you can use it for a few hours, but do not continuously use it or else it will exhaust the system and you will not be able to use it for a while. So create an image, and if the GPU is ten GV or more, wait till it goes back to seven GB and create another image. Anyway, so try creating free AI images. Thank you very much. 10. Facefusion: Hi, guys. So Fis fuson has released another version, so it's 3.4 0.1. Let's try to install it on our Windows PC. We will be using Anaconda. Go to anaconda.com and click on Free Download. Now, you have to register to download Anaconda, or you can also scroll down and click on Download Miniconda Installer. And then you can download the distribution Installer from here and install it on your PC. After this, go back to the guitar pace, click on code and download Zip. We have already downloaded it so here is the downloaded jib. Once this is done, open the Anaconda Prompt. So here is it. Let's first create the Python environment. Type Conda, create, double dash, name, phase fuson. Python is equal to 3.12. This will create a Python environment, 3.12. Why and Enter so we have successfully created the Python environment 3.12. Now let's activate it. Conda, activate FaceFusion. You can see that the base name has changed to FaceFusion. So this is the FaceFusion file we have downloaded. Let me quickly extract it here. So here is it. Open the folder, copy this address Control plus C and go to Anaconda Prompt, and let's navigate to the folder, CD space Control plus. So we are inside this folder. Now type Python, Install dot PY ONNX run time. After entering the code, you will find few options. This needs to be set up as per your PC. Default Cuda, open vino, direct ML. Let me quickly go to FaceFusion Guitar pase, scroll down, and click on Installation. So here is all the installation guide on how you can install it. Anyway, here is how you can use O and NX runtime. CPU runs only on your processor, no GPU acceleration, CUDA, for PC with NVDA graphics, Coral ML for Mac OS m1m2, Apple silicon, direct ML for Windows using DirectX 12 compatible GPU, open VNO for Intel PC with CPU, GPU, et cetera, RCM for AMD graphics, CPO GPU. So use as accordingly. Since this spec is Intel and it does not have a GPU, I will use open VNO, but please make sure to use it as per PC. Let me copy this and paste it here. So we will run ONNX run time with open Vino. Enter, this will download all the required dependencies for FaceFusion, so it will take some time. So this is all done. So dependency issues. But let's see if it works or not. Let's install FFMPZ too. So type Conda install FFMPZ Enter, y and Enter, so everything has been done. Let's see if everything runs or not. For that, type, Python, FaceFusion no PY run and enter. This will download some additional files. Now you can copy and pass this R out to the browser or press Control and click on it. So here is FaceFusion 3.4 0.1. It seems to work all fine. Let's quickly try to fea. Let me upload the source image. These sample images are downloaded from Pixa Bay. And let me upload the target image. You will first see the reference image here. And here is the face wrapped image. So here is pas fusin 3.4 0.1. We really hope this video was useful to you. Thank you very much. 11. DeepLiveCam: Hi, guys. Here is how you can install DeepLiveCam by Hesider. This is how the live webcam looks like, and this is the original video, and here is the video after face-swapping. Now, we will be installing this DeepLiveCam by HecsDr on our Windows PC, and it can easily swap faces and even run on a live webcam. Now, in order to install this, we will be using Anaconda. For those who do not have anaconda, simply go to anaconda.com and click on Free Download. Provide your email address or skip registration, and then simply download and install on a computer. Once Anaconda is installed, go to the start menu and type Anaconda. You will then find Anaconda prompt, so open it. The first thing we need to do is create a Python environment. And if you go back to the Github page, and here is the whole installation guide. It is recommended to install 3.10 Python. So let's do that. Let's create a Python environment. For that type Kanda, create, double dash, name, use any name you want. I will say DeepLiveCam and then Python is equal to 3.10. So this will create a DeepLiveCam environment with Python 3.10. Yes and Enter so the Python environment has been created. Now let's activate it. For that, simply type Conda Activate DeepLiveCam. Here is the code, Conda Activate DeepLiveCam and Enter. So this has been activated, and you can see the busname has changed to DeepLiveCam. Now we need to download some files, so let's head over to the Guitar page. We will require models to run DeepLiveCam. So download these models, simply click on it. It's about 332 B, and I've already downloaded it. Also download this In swapper. It's about 264 B, and I've already downloaded this one as well. Now we need to download this whole guitar file. So for that, go to code and download a Zip I've already downloaded this one as well. So these are the three downloads you will require. Let me head over to my download folder. So these are the files we just downloaded. Now let's open this Z file. And I'll extract it to the desktop. Here is a folder, open it. Now in Models folder, open it. Now, simply copy these two models, we downloaded earlier. I'll copy this, but you can cut this as well, and paste it here. Mike sure to add it inside the models folder. Now go back to DeepLiveCam. Copy its URL. Then go back to NACOaPmpt type CD space Control plus V. So we are inside this folder. Now, we need to install all the required dependencies to run this application. For that, type PIP install, there are requirements of TXT and Enter. So basically, this will install all the required dependencies to run this application, and this might take some time. You might be stuck here for five to 8 minutes, so please be a little patient. So this is all done. Now we also need to install FFMPZ. So for that type Conda install FFMPZ and Enter. Y and Enter. So we have installed everything. Now let's try if the application works or not. For that, type Python, run that PY and Enter. So this is a simple application, and it is working all fine. Now for some reasons, if this is not working and giving some errors, then go back to the guitar pase, scroll down. And here you might need to install Visual Studio 2022 runtime for Windows. If you do not have it, open this link and simply download these build tools and install it. And if you have NVDA graphics, then you will need to install Koda. Please install this Cuda, go to this link. And download it for Windows and install it on your PC. After that, simply run these commands, PAP uninstall ONNX runtime, and install this version of ONNX runtime with GPU. And while opening the application, instead of using Python run PY, you can execute it with Koda. So this is the code for NVDA graphics. We are using Anaconda, so we do not require to create this virtual environment, so we are skipping this. Now, everything is done. Let's see how this works. Now, this is a simple application. Here is an image, and here is a sample video. I download it from Pixels and pixels. Click on Select the Face. So this is the image, open it, and you will see the image here. If you want to preview for the live chat, select the webcam here. I will use the integrated camera and then click on Live. So this is how it looks, and this is a live webcam video. It really depends on your PC on how smooth it works. Mine is a little sloppy. It's doing all right, but this is a live webcam, and you can see the moments too. So this is a live webcam feature. Let me close this. Now you can also use this app to swap faces on a video. So this is a sample video. Let me open this. So this is a sample video and it's not that smooth. Anyway, let's select the target, and in here, instead of image, choose video. So this phase should come here. Use better images and videos. Now simply click on Start. Save the output to your location. I will choose Desktop, click on Save. This will create this temporary folder. And in here, you can see processing. This means it's working. You might also see this error frame processor not found. For us, this app is working all good, even with this warning. But if that does not work for you, then we will create another video to get rid of this error, but this worked all fine for us, and this processing can take lots of time depending upon your PC, graphics, and more. So the processing is done. You can also see processing to video succeed. So here is the output. Let me open this. So this is the processed video. And this was the original video. Now, the video itself was a bit glitchy, but this is how the app works. Let me close this, or you can click on Destroy here. So this is all done. Let me close this. If we want to reopen this app, it is very simple. Open the anaconda prompt. Here is it and activate the environment. Conda activate DeepLiveCam and Enter. So you will see the base change name to DeepLiveCam. Now we need to navigate to the DeepLiveCam folder, type CD and copy the URL. And paste it here and enter. Now we are inside the folder. After this, simply run Python run PY. And if you are using NVDA, simply run it with Koda. Now, here is the application, and you can use it as you like. We really hope this video was useful to you. Thank you very much. 12. IRoopDeepFaceCam: Hi, guys, here is how you can install IRoopDeepFaceCam to your PC. Now, this is similar to Deepf cam, but it has more add on features. It has this mouthmak feature you can see the difference. Thise are some nice features. Anyway, now to install this, we will be using Anaconda. For those who do not have Anaconda, please go to anaconda.com. Click on Free Download. I will skip the registration and simply download this distribution Installer. Download it for Windows and install this to your PC. After it is done, download this, I rup DFAScamFils. Click on code and download a ZIP. We have already downloaded the files. Here is the file. Let me quickly extract it here. So here are the files. After Anaconda is installed, go to the start menu and type Anaconda Prompt. Here is Anaconda Prompt. Open this. Now first create a Python environment. For that type Conda, create double dash, name, IRoopDeepFaceCam. Python is equal to 3.10. This will create a Python environment 3.10. Y and Enter. So the Python environment has been created. Now let's activate it. You can copy this or simply type Conda activate Rub deface Cam and Enter. You will see this base name has changed to Rub deface cam. After this, go to the Rub defacecamFolder. This is a folder which we extracted earlier. Now copy this address. In here, type CD and Control plus V and Enter, so we are inside this folder. Now let's install all the dependencies required for Rob Dfascam. For that Type PIP install, that's requirements dot TxD and Enter. This will install all the dependencies for IRB Dfascam. So this will take some time. So this is D. Let me clear this all CLS and Enter. Now let's try to run the application. Python, run that PY. So here is an issue. Non type object has no attribute configure. So to fix this, we need to install FFMPZ type Conda, install FFMPZ and Enter. Please check our other video to install FFMPZ if this method does not work. Y and Enter. So it is done. Let's try to run it. Python run that PY. So here is Airub D fscan. Thank you very much.

AI Python Projects: Text to Speech, Unlimited Free AI Images, FaceFusion, DeepLiveCam

Saujan Man Pradhan, WordPress Designer and Graphic Designer

Watch this class and thousands more

Watch this class and thousands more

Lessons in This Class

1.

Introduction AI Projects

1:35

2.

TTS Kokoro

3:41

3.

TTS Edge

5:13

4.

Clone your Voice

8:27

5.

TTS Tortoise

8:18

6.

Clone any voice with RVC

8:58

7.

Clone Your Voice Using OpenVoice Text to Speech

16:22

8.

Fooocus Local Win PC

8:17

9.

Foocus on Google Colab

2:00

10.

Facefusion

5:12

11.

DeepLiveCam

9:22

12.

IRoopDeepFaceCam

3:11