Generative AI: Getting Started | Amit Diwan | Skillshare
Search

Playback Speed


1.0x


  • 0.5x
  • 0.75x
  • 1x (Normal)
  • 1.25x
  • 1.5x
  • 1.75x
  • 2x

Generative AI: Getting Started

teacher avatar Amit Diwan, Corporate Trainer

Watch this class and thousands more

Get unlimited access to every class
Taught by industry leaders & working professionals
Topics include illustration, design, photography, and more

Watch this class and thousands more

Get unlimited access to every class
Taught by industry leaders & working professionals
Topics include illustration, design, photography, and more

Lessons in This Class

    • 1.

      Generative AI - Course Introduction

      1:33

    • 2.

      AI vs ML vs DS vs DL

      5:05

    • 3.

      Deep Learning Types

      1:43

    • 4.

      What is Generative AI

      1:58

    • 5.

      Techniques for implementing Generative AI

      2:50

    • 6.

      Generative AI - Transformers

      3:34

    • 7.

      Large Language Models (LLMs) and its use cases

      4:27

    • 8.

      Generative AI - Applications & Challenges

      2:34

    • 9.

      Generative AI - Chatbots (Model Types)

      1:33

    • 10.

      Generative AI - Features & Examples

      3:27

    • 11.

      What are Prompts

      1:54

    • 12.

      Popular AI Chatbots

      2:24

    • 13.

      ChatGPT-4o Quick Overview and Use Cases (Prompts)

      6:28

  • --
  • Beginner level
  • Intermediate level
  • Advanced level
  • All levels

Community Generated

The level is determined by a majority opinion of students who have reviewed this class. The teacher's recommendation is shown until at least 5 student responses are collected.

20

Students

1

Project

About This Class

Generative AI is a subset of deep learning. It uses AI neural networks and can process both labelled and unlabelled data using supervised, unsupervised, and semi-supervised methods.

It refers to a class of artificial intelligence models and algorithms designed to create new content. These models can generate text, images, music, and other forms of data that mimic human-created content.

Generative AI applications are built on top of large language models (LLMs) and foundation models. LLMs are deep learning models.

LLMs are a subset of deep learning. LLMs are AI models that power chatbots, such as ChatGPT, Copilot, Google Gemini, etc. LLMs refer to large, general-purpose language models that can be pre-trained and then fine-tuned for specific purposes.

What you'll learn

  • Learn Generative AI from scratch.
  • Get a quick overview about Generative
  • What is a Transformer Model
  • Learn about Large Language Models
  • Generative AI Applications
  • Generative AI Challenges
  • AI Chatbots Models with examples

Who this course is for:

  • Those who want to learn Generative AI and its models
  • Those who want to learn what is a transformer model
  • Learn about the process of the Transformer Model
  • Those who want to understand the process of generating new content with Generative AI
  • Gain a deep understanding of Generative AI

**Course Lessons**

Section A: AI Introduction

1. Artificial Intelligence vs Data Science vs Machine Learning vs Deep Learning
2. Deep Learning Types

Section B: Generative AI and its techniques

3. What is Generative AI
4. Techniques for implementing Generative AI

Section C: What are Transformer Models

5. Generative AI – Transformers

Section D: Large Language Models

6. Large Language Models (LLMs) and its use cases

Section E: More about Generative AI

7. Generative AI - Applications & Challenges
8. Generative AI - Chatbots (Model Types)
9. Generative AI - Features & Examples

Section F: Prompts and AI Chatbots

10. What are Prompts
11. Popular AI Chatbots

Section G: ChatGPT4o - Writing Prompts

12. ChatGPT4o Overview and Use Cases (Prompts)

Meet Your Teacher

Teacher Profile Image

Amit Diwan

Corporate Trainer

Teacher

Hello, I'm Amit,

I'm the founder of an edtech company and a trainer based in India. I have over 10 years of experience in creating courses for students, engineers, and professionals in varied technologies, including Python, AI, Power BI, Tableau, Java, SQL, MongoDB, etc.

We are also into B2B and sell our video and text courses to top EdTechs on today's trending technologies. Over 50k learners have enrolled in our courses across all of these edtechs, including SkillShare. I left a job offer from one of the leading product-based companies and three government jobs to follow my entrepreneurial dream.

I believe in keeping things simple, and the same is reflected in my courses. I love making concepts easier for my audience.

See full profile

Level: Beginner

Class Ratings

Expectations Met?
    Exceeded!
  • 0%
  • Yes
  • 0%
  • Somewhat
  • 0%
  • Not really
  • 0%

Why Join Skillshare?

Take award-winning Skillshare Original Classes

Each class has short lessons, hands-on projects

Your membership supports Skillshare teachers

Learn From Anywhere

Take classes on the go with the Skillshare app. Stream or download to watch on the plane, the subway, or wherever you learn best.

Transcripts

1. Generative AI - Course Introduction: In this video course, learn about generative AI and its concepts. Generative AI is a subset of deep learning. It uses AI neural networks and can process both labeled and unlabeled data using supervised, unsupervised and semi suupervised methods. It refers to a class of artificial intelligence models and algorithms designed to create new content. These models can generate text, images, music, and other forms of data. That mimic human created content. Generative AI applications are built on top of large language models. These large language models are deep learning models. With Generative AVA chat boards, such as CA GPT, Google Gemini, Microsoft Co Pilate and others. You can easily create images like logos, banners, et cetera. Scan images and search PDF documents. Also write professional e mails, blogs, and articles in seconds. These chat boards can also teach you coding. Write advertisements for you. Fix the grammar, plan your vocation, and B your everyday AI assistant. The following lessons are covered in this course, Let's start with the first lesson. 2. AI vs ML vs DS vs DL: In this lesson, we will understand the difference between artificial intelligence, data science, machine learning, and deep learning. We will also see how these are related to each other. With that, we will also understand that how generative VA is related to these terms. Let's start. As I told you that I'll be discussing this first. Why? Because our generative A is also part of this AI. AI is a superset, as you can see in this wind diagram. It's a superset. It includes your machine learning, deep learning, and data science. But the ecosystem of data science also exceeds AA. What is AI A means creating smart machines to mimic human behavior? Or we can say it refers to the simulation of human intelligence in machines that are basically programmed to think and learn like humans. You must have seen AA in a lot of domains these days because you can easily analyze large amounts of data, recognize patterns, and make decisions. It is mostly used in healthcare finance, transportation, and entertainment fields. These days. Then comes your machine learning, which is a subset of artificial intelligence that is AA. Machine learning is a subset of AA, as I told before. And it is used to build a model based on training data to make predictions. Using machine learning, you can build a model to Make predictions, let's say, to predict the winner of this world cup. It focuses on developing algorithms and statistic models that enables a computer to learn from and make predictions or decisions based on data without being explicitly programmed to do so. Its techniques includes your supervised, unsupervised, semi supervised and reinforcement learning. It is also used in various fields such as image and speech recognition NLP, that is natural language processing. Forecasting medical diagnosis and others. Now comes your data science. Data science is the subset of AA, as I told above. It is an area of statistic, scientific methods, et c to extract meaning and insights from data. So I'll give an example. Let's say you went to Instagram and you liked some car videos like MG, Kia, Honda, Tesla. What will happen? You gave your data to Instagram that I like such videos. Such Instagram reels, Instagram channels, Instagram accounts. So what will happen? The next time you open Instagram, The Instagram will automatically pitch you with such reels, such post, let's say some discounts on cars. So how these things happened? All these things happened due to data signs because It extract meaning and insights from the data. Now, let's say a car company wants to approach some people who love cars. Whenever they'll add a sponsored post or story on Instagram, they know that These number of people like car videos, so the same thing will be pitched to them. What Data science did, they connected the client with the company. In this way, both parties got benefited, the client got that discount, and the company sold their product. So that's the value of data science. We say, data is the new le because an unprocessed data is of no use. Similarly an OL is of no use if it is not processed properly. Therefore, data is processed and meaningful insights are generated. Now comes your deep learning, deep learning, you can consider as a subset of machine learning. According to the n diagram, you can see. It is a class of machine learning algorithms to solve complex problems. It focuses on using artificial neural networks with multiple layers to model and understand complex patterns in data. Deep learning algorithms are inspired by the structure and function of the human brain, specifically, it's interconnected network of neurons. Why we are discussing this? Because generative VA is a part of deep learning. 3. Deep Learning Types: In this lesson, we will understand the types of deep learning. This will also help us in understanding that how generative a is related to deep learning. We will also see an example. Let's see. Dep learning types include your discriminative as well as generative. Previously, we all discussed about this, let's say to classify between a dog or a cat from a bunch of images from some images. Okay. Discriminative deep learning is used to classify or predict. It discriminates between different kinds of data instances. Let's say you have some images and you want to classify them as a dog or a cat, so it will be able to discriminate between them, and we'll predict that which of them is the pick of a dog or a cat. But generative AI is a completely different concept. It will generate new data that is similar to data it was trained on. It generates new data instances. That means in this case, it will generate a new cat Mage. Let's say you will upload your pick and it will generate your AI of TA. Or let's say you added a text prompt. Let's say you want to know about anything related to cricket. So you'll ask the prompt, and it will generate new data or content that resembles the original data it was trained on. Okay? 4. What is Generative AI: In this lesson, we will understand what is generative VA? We will also understand its process that how it generates new content. Let's see. Now, since we discussed about generative I, I told you that it is a part of deep learning. You can see. Generative VA is a subset of deep learning. It uses AA neural networks and can process both labeled and unlabeled data. That means as before the types of machine learning, a supervised unsupervised and semi supervised methods. GI, that means generative AI is a class of AI models. That is designed to create new content. It can generate not only text, but images, music, and other forms of data. It is built on large language models. We will also discuss large language models later. These LLMs, that means large language models are deep learning models. This is the process of generative AI. I told you it creates new content based on what it learned from existing content. That means the data it was trained on. Here, training means learning from existing content. It will create a statistical model. That will be used to predict an expected response. When you type a prompt. When a prompt is typed, this generative I will use the statistical model. To generate new content in the form of text, images, music, video, task, and others. 5. Techniques for implementing Generative AI: In this lesson, we will understand some techniques for implementing generative AA. You can also consider it as the approaches or the generative AI models. You must have heard about GPT three and GP four models of pen AA. These are also based on these techniques. Let's see. Now, let's see the techniques for implementing generative A, or you can also consider it as generative AI models. The first one is GAs. Generative adversarial networks. Under this, two neural networks are trained simultaneously. The first one is a generator network, and the second one is a discriminator network. The generator creates data while the discriminator evaluates it. You can say the generator network lens to generate data samples such as images or text that resemble your training data. While the discriminator network learns to distinguish between real data samples and those generated by the generator. The second one is variational auto encoders. These are basically used for encoding and reconstructing data. It is also a type of generative model used in machine learning and deep learning. The variational auto encoders, can generate new data that's similar to the input data they have been trained on. You can use it to create new images that resemble a given data set. VAs are used in generative modeling, data compression, et cetera. Now, let us see transformer based models. Using these, we can easily handle large sequences of data, particularly in NLP task. This is the topic we'll be discussing. Because this is behind some of the most advanced language models like AI GPD three and GPD four. Two of the most powerful generative a models. These are based on the transformer architecture. The transformer architecture was coined by Indian in 2017. These models are used to generate human like text. It can also help you with coding tasks and translate from one language to another. Let's learn more about this. The GPT means generative pre trained transformer. That's why we are discussing this topic. 6. Generative AI - Transformers: In this lesson, we will understand what is a transformer. We will see what is a transformer model, its architecture, who coined it with that we will also see its process. While using transformers, you may run into an issue called halucination. We will also cover what are hallucinations and why it can happen. Let's start with the transformers concept. Here comes your transformers. It is a type of generative a model. That is a type of generative a model called transformer model. You can consider the power of generative A comes from the use of these transformers. I told you it was coined by Indian in 2017, Ashish asi. Okay. It helped in actually laying the foundation for advancements in the field of NLP and machine learning. Okay. The transformers include encoder and decoder. I'll also give an example later. The encoder will encode the input sequence. Let's say you have a text in Spanish language, and you want to convert it in English language. What will happen? The encoder will encode the input sequence and pass it to the decoder, which will learn how to decode the representations for relevant task. Let's see. Here is the process I told you encoder decoder. It is the main component of the transformer architecture. L et's say we have a text my name is amet and Spanish language. What will happen with the transformers? It will get first encoded. That means the encoder will include your self attention and feed forward mechanisms. What will happen? Every word will be related to every other word in the input sequence. This will allow the process to focus on the key words in this. Now, the next mechanism feed forward, what will happen? This will further refine the understanding of each word, and it will be passed to the decoder. Further, the decoder will will generate the Spanish text in English language. That means a text in Spanish processed into its English equent using transformers. An issue may arise while using transformers. That means haucinations. You must have heard about AA showing irrelevant results, misleading results, grammatical issues. All these come under hallucinations. Here you can see misleading results. Alucinations are words or phrases that are generated by the model. That are often nonsensical or grammatically incorrect. It can be due to various factors. Let's say that data is noisy. It is not having enough context or the model is not trained on enough data. So Illustinations, since those are misleading results, make the output text difficult to understand. 7. Large Language Models (LLMs) and its use cases: In this lesson, we will learn about LLMs. That is large language models. Whenever you discuss about generative I, then this topic will always be considered. Both LLM and generative VAs are subsets of deep learning. Let us understand what are LLMs, and we will also discuss a type or you can consider a use case of LLMs. Let's see. Okay. Now we'll be discussing about large language models. I told you that generative VA is a part of deep learning, and LLMs are also a part of deep learning. Both are related. LLMs are also a subset of deep learning. As I just said, Okay, you must have heard about CAT GPT, copilot, Google Gemin, that means bad mid journey. LLMs are AI models. You can consider that power, all these chat bots. LLMs are large language models. That means large general purpose language models. That can be pre trained and then fine tuned for specific purposes. You can pretrain LLM with a large dataset, and fine tune means to fine tune it with a particular M with a smaller dataset from that large dataset. LLMs also represent a class of AI models that is used to understand and generate human like text, or you can say it provides an engine that powers the AHd bot. You AHd bots are based on These LLMs. These LLMs will allow your chat bot to easily create naturally phrased recommendations so that the content is generated by generative AI according to your personalized recommendation. That's why LLM is considered as the backbone of AHd bots, all the AHd bots. Now let us see a scenario or a use case. The large language models are trained with petabytes of data and generate billions of parameters. To solve different task. These tasks can be sentence completion, text classification, language translation. We can see this example of Palm PLM. It is a transformer based large language model. Google just announced Palm two also. It is a pathways language model, a 540 billion parameters, that is a larger training data set with a large number of parameters. It is also a transformer model. I just told you that transformer model includes your encoders and decoders. I discussed this before. So The speciality of LLMs are that it can still obtain a greater or decent performance with little domain training data. So it can be used for few shot or even zero shot scenarios. So these two scenarios, if you'll learn more about LLM and all these models, you will be getting such terms again and again. So let me explain it quickly. If you're training a model with less data with minimal amount of data, then it would be called few shot as the name suggests. And what about zero shot? It means a model can recognize things that have not been taught in the training before. That means zero shot, nothing. LLM the performance of LLM grows when you add more data and parameters. Here we just saw f 40 billion parameters. We can learn more about Palm later. It is considered as a next generation language model. With the enhanced multilingual reasoning and coding capabilities. Okay. Google also announced Audio Palm for speech to speech translation in June 2023. 8. Generative AI - Applications & Challenges: In this lesson, we will learn, what are the applications and challenges of generative VA? We can easily generate content images, logos, banners, as well as summarizing PDF using generative VA. But we should also understand the challenges behind it. Because this is also a topic to cover since generative VA is used for some unethical purposes also. Let's see. Now, let us see some applications and challenges of Generative AI. We all know that generative AA can be used to create content, proof re date, we can write e mails. We can also create characters, three D images, games. We can create complete landscapes and scenarios. It can also be used by artists and designers easily. You can also generate logos, banners, social media post. What are the challenges I would like to discuss this more? We saw that these generative AI models are basically considered to have ethical concerns, quality control, biasness. Also the images you are generating, the texts you're generating. Some people say that it can have copyright issues or even on YouTube, they are asking that is your video generated by AA or not? So you can explicitly mention it. Also on Instagram, there is an option to add your AI label. Okay. With that, one of the challenge or issue with GNI. Here it is. On Google Gini, once it was showing some misleading results, like, people can eat rocks and they can glue pizza. So someone surged and Google AI surg revealed the following results. So these are very cary. When you'll learn about Google Gemini. Now they are showing a disclaimer that if you're creating a fitness plan or a meal plan using these Had birds, there is a disclaimer that you need to contact a registered dietitian or a fitness expert before following our answers before following what the prompt result is. These things are really important. 9. Generative AI - Chatbots (Model Types): In this lesson, we will learn about the generative via chat bot model types. You must have heard about the text to text model, text to image model, text to video, text to music model. Let's see what are these. Now, let us see the model types, so this will also cover your EI chat bots currently. Okay. Text two text, we all know, OP EI Cat GPT, Microsoft Co Pilot, and Google GMI, we'll be typing a text prompt to generate an e mail to generate an article to generate a block. With that, we can use text two images on the Dali model and Md Jury. Dali model is now part of Microsoft Copilot, so you will be getting around 15 boosts in a day within the free version of copilot so that you can work with Daly. Text to video, Open A Sa, and we now also have Kling and Luma AI Dream machine introduced today. So you can try and generate text to video now. Easily. Text two songs, you must have heard about creating songs using text prompts using a two line prompt with So AA. Okay. You can easily achieve this with text two songs model. Then comes your text to task like software agents, virtual assistance, automation. So Microsoft came up with the co pilot PCs, the copilot studio also as in virtual assistant to easier work. 10. Generative AI - Features & Examples: In this lesson, we will learn about generative via chat board features, as well as some examples. With the features, we will see some examples related to text to image to image, as well as text to video. Let's start. Now the features of AHd bots, I have shown these chat bots before. Now the features, I have just amalgamated all of them. You can easily create logo banners. You can also use it to code, fix your code, generate syntax. With that, you can also upload and scan images. This means that if you're having an image and you want the AH adb to read it, to scan it. That What does this image include? So our tutorial also includes that complete use case. You can directly ask that what does this image include? By aplouding EPEC. Okay. With that, you can easily work search and scan PDF documents. So 20 pages, ten, 20, 50 pages documents PDF documents can be scanned within seconds within less than a minute. Definitely, it will save your time. It will save at least five to seven 8 hours of your work. If you will summarize a 30 to 40 page document. You can write e mail blogs and articles. You can also set the tone, the number of words you want. If you want storytelling for your article, you can easily add it. With that, you can easily find jobs, create resumes, cover letters from that resume. I told before that we can also use it for coding. You can also write advertisements. You can generate product timelines. Fix the grammar proof, read, your content completely. Plan your vacations, generate hotel recommendations completely. You can write a meal plan, a fitness plan based on your recommendation. Let's say if someone wants a fitness plan without using gym equipment, exercise plan, your chat board can also do it for you. With that, you can also get gift ideas. In fact, in Google Gemini, you'll also get images and the link to get some gift. Let's say you want a gift for a kid aged six. You can mention I want a gift for a boy kid aged six. Then you'll get relevant results. These are some examples. I generated them using text image model. Let's say Dali, Dali is a part of copilot now. You can easily generate images by just typing one to two line prompts. So this is how I generated it. I generated this three D line with sunglasses, a robo image, a dog playing on a road, and these also. This was text image model. You can also generate them using mid Journey, encraft, Daly, and others. This is Image two Mage model. So I took this image from the Internet. So these are the images of cricketers when they were kids, and this is my image generated by an image two image model. 11. What are Prompts: In this lesson, we will learn what are prompts. With that, we will also learn what is prompt engineering, as well as who are prompt engineers. The role of prompt engineers are becoming popular. Let's see. Now, let's see what is a prompt. Prompt is basically the input that a user types. I told you text two image text to text text to video. So those were the prompts. What will happen when a user will type the input, it will go to the AI model to get a specific response. A new response will be generated. That's the purpose of generative A to generate new content in the form of text images videos. Okay. You can also consider it as a query. It describes the tasks that an AA should perform. Let's say you t I want to write an e mail to my boss for five days leave. Okay, so the response will get generated. What is prompt engineering? It means crafting specific instructions that can be understood by the AA model. And to get responses in real time. That means what you will type and the result will get generated immediately. I've just shown you the images I generated. Also you can generate text from your text prompt. Now, what do prompt engineers do? So the role of prompt engineers are becoming popular because if you know how to craft prompts properly, you can easily generate results because a lot of these prompts are limited. Copilot provided daily, but you only get 15 boosts in a day. Also for text, you have some tokens, Token, you can consider half a word or 0.75 of a word. Okay. So those are also limited for a day. Those keep on changing. 12. Popular AI Chatbots: In this lesson, we will learn about some popular AI chat bots. Some of them are widely used. The first chat bot was introduced by OPA. That is Cgb. Then came your copilot and Google Gemini. Guys, the following are some popular AI chat bots. The first one was introduced by OPA, that is Open Gb. O pene also introduced Daly for images, and they also introduced Open SRA for videos. Microsoft launched copilot. A lot of people don't know that CAD GPT supported by Microsoft. It got funded by Microsoft. That's why in Copilo, now we have Open Dal for image generation. Okay, Google Gemini was known as bad and it was obviously developed by Google. So let's see the layout quickly. These are the links wherein you can access them. Here are the layouts. The following. The first one is for Chart GPT. Okay. This is the free version. If you'll type any prompt, let's say, so you can now see you have GPT four for free. Here it is, GPT four f free with limited prompts and image generation. Then we have our copilot. These are copilot GPT. Let's say you want images, you can click here, and you can generate logos images. The website is copilot microsoft.com. The last one is Google Gemini. Okay? These are the suggested prompts, and these are the prompts I wrote. Under settings, you can select the dark theme or you can also select extensions to work on tuks. Okay guys, Guys, we saw how we can easily work around Generative VA, what is generative VA, its models. We also learned about its features and the types, the transformer model, its process, the process of generative VA, and we also worked around some great examples to understand the text text to image, as well as text to video models. Thank you for watching the video. 13. ChatGPT-4o Quick Overview and Use Cases (Prompts): So the g4o is here. O stands for Omni. It includes your access to audio vision and text in real time. Here it is O for Omni, and it accepts as input any combination of text audio and image. You can scan images as well as improvement on text in non English languages. Also, the APA is 50% cheaper. Let's see how to access it. Just go to chart. I already locked in for the 3.5 version, so it is directly giving me an option to try it now. I told you it can understand images, can browse the web and speaks more languages. I'll click Try it now. Here you can see g4o. Now, let us start whether it is connected to the Internet or not. So GPT 3.5 wasn't connected with the Internet. But to this claims, that an internet connection is there for G four. What is the temperature today in? Deli India? Here it is the current temperature is visible. That means it is connected to the Internet. Okay. From here, you can change the model also. Right now, we are in GBT four. Now let me generate a logo. Create a logo for an for an online shopping company with the text one stop shopping destination. Let's see whether it will generate a logo or not. No, it is not providing. No, let us upload an image. Here it is type. What is this image about? Let's see, will it be able to check the image, scan the image or not? Okay, it's a Laptop smart phone apple. Fine. We can also learn about this. Now let us upload another image. I'll upload my image open? Let's see. Submit. Okay. It has represented it. Now, I'll click on this shoe image. I'll type. Have you seen this before? So I'm just scanning it. Submit. It was able to guess it perfectly the name of the shoe. Okay. No solve this linear equation. I'll direct solve. It is showing the steps also. Okay. Here is the answer. You can see the steps. Now you can see when I'll click here, you have reached your phyplod limit. You can upgrade to chat GPT plus or try again. Okay. Let's write an article on a current topic. Write an article on IPL, Indian Premier League is here, and we are nearing its end. Ten to 15 matches are remaining. Let's say how much it is updated. It is searching the news. Okay. Here it is. Let me know about the current matches of IPL 2024. Of, it should be off. I edited it. Similarly, you can also edit. I saved it and edited. You have reached our limit of messages. Please try again. Region. This is how I demonstrated the 4.0 version, the following. You can rename it. G four First impressions on Laptop. Okay. So definitely, if you want the voice access also, you need to go to your mobile phones and download Chat GPD there. Download and install Chat GPD there for the GPT 40 version. So guys, we saw some first impressions of GPT four. Thank you for watching.