Build AI Apps with Open-Source Models from Hugging Face

Partner: Udemy
Affiliate Name:
Area:
Description: Open-source AI has changed the game. With many models (and their weights) freely available, anyone—from hobbyists to seasoned developers—can experiment, innovate, and build new applications.In this course, you’ll explore a variety of open-source models from the Hugging Face Hub and use the transformers library to tackle NLP, audio, image, and multimodal tasks. You’ll also learn how to wrap your work into an interactive app and deploy it on the cloud using Gradio and Hugging Face Spaces.Here’s what you’ll be doing:Build a chatbot – Start with a small language model and turn it into a conversational agent that handles multi-turn interactions and follow-up questions.Work with text – Translate between languages, summarize long documents, and compare two pieces of text for similarity—perfect for search and retrieval systems.Handle audio – Convert speech to text using Automatic Speech Recognition (ASR) and generate natural-sounding speech from text with Text-to-Speech (TTS).Classify sounds instantly – Use zero-shot audio classification to identify audio content without retraining your model.Describe images with audio – Combine object detection with TTS to create spoken descriptions of images.Segment images on demand – Use zero-shot image segmentation by simply pointing to the area you want to identify.Do more with multimodal AI – Implement visual question answering, image search, image captioning, and other cross-modal tasks.Share your creations – Deploy your AI app to Hugging Face Spaces for an easy-to-use web interface or integrate it into an API.By the end of the course, you’ll have all the essential components you need to design AI-powered pipelines and bring your own ideas to life.
Category: IT & Software > Other IT & Software > Artificial Intelligence (AI)
Partner ID:
Price: 39.99
Commission:
Source: Impact
Go to Course