AI60213 - Foundation of Large Language Models

Recommended Books

These carefully selected books provide comprehensive coverage of natural language processing, deep learning fundamentals, and the latest developments in transformer architectures and large language models.

View Online Buy Book

Large Language Models: A Deep Dive

Uday Kamath, Kevin Keenan, Garrett Somers, Sarah Sorenson

2024 Springer

This book provides an authoritative exploration of the design, training, evolution, and application of LLMs. It begins with an overview of pre-trained language models and Transformer architectures, laying the groundwork for understanding prompt-based learning techniques. Next, it dives into methods for fine-tuning LLMs, integrating reinforcement learning for value alignment, and the convergence of LLMs with computer vision, robotics, and speech processing. The book strongly emphasizes practical applications, detailing real-world use cases such as conversational chatbots, retrieval-augmented generation (RAG), and code generation. These examples are carefully chosen to illustrate the diverse and impactful ways LLMs are being applied in various industries and scenarios.

Fundamental Large Language Models (LLM) Foundation Models Generative AI Prompt Engineering

View Online Buy Book

Hands-On Large Language Models

Jay Alammar, Maarten Grootendorst

2024 O'Reilly Media, Inc.

This book also helps you: Understand the architecture of Transformer language models that excel at text generation and representation, Build advanced LLM pipelines to cluster text documents and explore the topics they cover, Build semantic search engines that go beyond keyword search, using methods like dense retrieval and rerankers, Explore how generative models can be used, from prompt engineering all the way to retrieval-augmented generation and Gain a deeper understanding of how to train LLMs and optimize them for specific applications using generative model fine-tuning, contrastive fine-tuning, and in-context learning.

LLMs Mathematics Theory