Details
-
Collect and preprocess data for large language models
-
Train and fine-tune pre-trained large language models for specific tasks
-
Evaluate the performance of large language models and select appropriate metrics
-
Deploy large language models in real-world applications using APIs and Huggingface
-
Understand ethical considerations involved in working with large language models, such as avoiding bias and ensuring transparency
​
Learning Outcome:
Upon completing this course, students will undergo a transformation in their understanding of machine learning with a focus on production and LLMs. They will gain practical skills in building and deploying machine learning models in a production environment.
Additionally, students will gain a comprehensive understanding of the end-to-end machine learning pipeline, allowing them to construct and deploy robust and effective models in real-world settings using Large Language Models. Overall, students will emerge with greater confidence in their abilities to tackle practical machine learning problems and deliver results in production.
​
Curriculum
​
01 - Introduction to Natural Language Processing
-
Introduction to NLP: covers what NLP is, its history, applications and challenges.
-
NLP Techniques: covers common techniques such as tokenization, part-of-speech tagging, named entity recognition and sentiment analysis with examples of their use.
-
NLP Tools: introduces popular NLP tools such as NLTK, spaCy with examples of how to use them
​
02 - Foundational Knowledge of Transformers & ML System Design
-
Transformers Foundational Knowledge: covers fundamental concepts of transformers, including self-attention, multi-head attention, and positional encoding.
-
Introduction to Fundamental Concepts of ML System Design: covers the basics of designing machine learning systems, including data collection and preprocessing, model selection and training
03 - Retrieval Systems
-
Introduction to Information Retrieval Systems: covers the basic concepts and goals of information retrieval systems and their importance in various applications.
-
FAISS: introduces Facebook's AI Similarity Search (FAISS) library, a popular open-source library for efficient similarity search and clustering of dense vectors, and explains how it works.
​
04 - Building a search engine from bare bones and deploying it on Huggingface using server-less inference
-
Deployment on a Serverless Inference: discusses the benefits of serverless computing and how to deploy the semantic search model on a serverless platform like Hugging Face
-
Preprocessing of Hotel Data
-
Evaluation of the Model: discusses the different evaluation metrics used for semantic search models and how to measure the performance of the model
​
05 - Knowledge Graph, Generative AI, Alpaca,Llama and Dolly 2.0 and World of langchain on larger corpus of Data
-
Introduction to Knowledge Graph: covers the basics of knowledge graphs, including their architecture, data modeling, and how they are used in real-world applications like search engines and recommendation systems.
-
Generative AI: introduces the concept of generative AI and how it is used to generate new data, such as text, images, and music.
​
06 - Deploying models in Airflow, DAGs and using GPU-as-a-service to build automated systems [replicate.ai & banana.dev]
Details to follow...
​​
Workshop Requirements
-
Coming soon
Who Should Attend?
-
You are intrigued by LLMs and would like to learn more
-
You are ready to deploy your own SOTA AI models and like to see how they work
-
You want to go beyond a Jupyter Notebook and develop batch or real-time prediction
Who Should NOT Attend?
-
Folks who have not taken an Introduction to NLP course
Who is your Instructor?
Senior Research Science Manager @ Google Adjunct Professor @Stanford | @U of Minnesota
He has over 15 years of experience of leading and building ML teams and has been teaching for the past three years.
​