Recommended Books

These carefully selected books provide comprehensive coverage of natural language processing, deep learning fundamentals, and the latest developments in transformer architectures and large language models.

Large Language Models: A Deep Dive

Uday Kamath, Kevin Keenan, Garrett Somers, Sarah Sorenson

2024 Springer

This book provides an authoritative exploration of the design, training, evolution, and application of LLMs. It begins with an overview of pre-trained language models and Transformer architectures, laying the groundwork for understanding prompt-based learning techniques. Next, it dives into methods for fine-tuning LLMs, integrating reinforcement learning for value alignment, and the convergence of LLMs with computer vision, robotics, and speech processing. The book strongly emphasizes practical applications, detailing real-world use cases such as conversational chatbots, retrieval-augmented generation (RAG), and code generation. These examples are carefully chosen to illustrate the diverse and impactful ways LLMs are being applied in various industries and scenarios.

Fundamental Large Language Models (LLM) Foundation Models Generative AI Prompt Engineering

Hands-On Large Language Models

Jay Alammar, Maarten Grootendorst

2024 O'Reilly Media, Inc.

This book also helps you: Understand the architecture of Transformer language models that excel at text generation and representation, Build advanced LLM pipelines to cluster text documents and explore the topics they cover, Build semantic search engines that go beyond keyword search, using methods like dense retrieval and rerankers, Explore how generative models can be used, from prompt engineering all the way to retrieval-augmented generation and Gain a deeper understanding of how to train LLMs and optimize them for specific applications using generative model fine-tuning, contrastive fine-tuning, and in-context learning.

LLMs Mathematics Theory

Reference Papers

Attention Is All You Need

Vaswani et al., 2017

The original transformer paper that revolutionized NLP

Download PDF

Large Language Models: A Survey

Minaee et al., 2024

A detailed survey on large language models understanding

Download PDF

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Guo et al., IJCAI, 2024

About Multi model based AI Agents

Download PDF

Online Resources

The Illustrated Transformer

Jay Alammar

Visual guide to understanding transformer architecture

Visit Website

Hugging Face Transformers

Hugging Face

State-of-the-art NLP library and model hub

Visit Website

Papers With Code - NLP

Papers With Code

Latest NLP research with implementations

Visit Website

Development Tools

Google Colab

Google

Free cloud-based Jupyter notebooks with GPU access

Visit Website

PyTorch

Meta AI

Deep learning framework for research and production

Visit Website

Weights & Biases

W&B

Experiment tracking and model monitoring

Visit Website