LLM Engineer's Handbook: Master the art of engineering large language models from concept to production

LLM Engineer's Handbook: Master the art of engineering large language models from concept to production book cover

LLM Engineer's Handbook: Master the art of engineering large language models from concept to production

Author(s): Paul Iusztin (Author), Maxime Labonne (Author)

  • Publisher: Packt Publishing
  • Publication Date: 22 Oct. 2024
  • Language: English
  • Print length: 522 pages
  • ISBN-10: 1836200072
  • ISBN-13: 9781836200079

Book Description

Step into the world of LLMs with this practical guide that takes you from the fundamentals to deploying advanced applications using LLMOps best practices

Get With Your Book: PDF Copy, AI Assistant, and Next-Gen Reader Free

Key Features

  • Build and refine LLMs step by step, covering data preparation, RAG, and fine-tuning
  • Learn essential skills for deploying and monitoring LLMs, ensuring optimal performance in production
  • Utilize preference alignment, evaluation, and inference optimization to enhance performance and adaptability of your LLM applications

Book Description

Artificial intelligence has undergone rapid advancements, and Large Language Models (LLMs) are at the forefront of this revolution. This LLM book offers insights into designing, training, and deploying LLMs in real-world scenarios by leveraging MLOps best practices. The guide walks you through building an LLM-powered twin that’s cost-effective, scalable, and modular. It moves beyond isolated Jupyter notebooks, focusing on how to build production-grade end-to-end LLM systems.

Throughout this book, you will learn data engineering, supervised fine-tuning, and deployment. The hands-on approach to building the LLM Twin use case will help you implement MLOps components in your own projects. You will also explore cutting-edge advancements in the field, including inference optimization, preference alignment, and real-time data processing, making this a vital resource for those looking to apply LLMs in their projects.

By the end of this book, you will be proficient in deploying LLMs that solve practical problems while maintaining low-latency and high-availability inference capabilities. Whether you are new to artificial intelligence or an experienced practitioner, this book delivers guidance and practical techniques that will deepen your understanding of LLMs and sharpen your ability to implement them effectively.

What you will learn

  • Implement robust data pipelines and manage LLM training cycles
  • Create your own LLM and refine it with the help of hands-on examples
  • Get started with LLMOps by diving into core MLOps principles such as orchestrators and prompt monitoring
  • Perform supervised fine-tuning and LLM evaluation
  • Deploy end-to-end LLM solutions using AWS and other tools
  • Design scalable and modularLLM systems
  • Learn about RAG applications by building a feature and inference pipeline

Who this book is for

This book is for AI engineers, NLP professionals, and LLM engineers looking to deepen their understanding of LLMs. Basic knowledge of LLMs and the Gen AI landscape, Python and AWS is recommended. Whether you are new to AI or looking to enhance your skills, this book provides comprehensive guidance on implementing LLMs in real-world scenarios

Table of Contents

  1. Understanding the LLM Twin Concept and Architecture
  2. Tooling and Installation
  3. Data Engineering
  4. RAG Feature Pipeline
  5. Supervised Fine-Tuning
  6. Fine-Tuning with Preference Alignment
  7. Evaluating LLMs
  8. Inference Optimization
  9. RAG Inference Pipeline
  10. Inference Pipeline Deployment
  11. MLOps and LLMOps

Editorial Reviews

Recensione

“Exploring large language models (LLMs) and retrieval augmented generation (RAG)? I recently got my hands on 𝐿𝐿𝑀 𝐸𝑛𝑔𝑖𝑛𝑒𝑒𝑟’𝑠 𝐻𝑎𝑛𝑑𝑏𝑜𝑜𝑘 by Paul Iusztin and Maxime Labonne, and I’ve been hooked ever since it arrived! This book has everything you need to design, deploy, and optimize LLMs for real-world applications. From understanding LLM Twin for adapting models to specific tasks, to building robust data pipelines, and fine-tuning models for state-of-the-art performance, this book is packed with practical insights. It even dives into LLMOps practices for keeping everything running smoothly at scale. If you're as passionate about LLMs as I am, this is a must-read.”

Akshit Bhalla, Product Data Scientist at Tesla

“What can be better than sitting by the fireplace and reading a book? Reading LLM Engineer's Handbook by Paul Iusztin and Maxime Labonne.

Without any doubt, this is one of the best practical books on LLMOps out there.

It covers the LLM Twin use case (which I was slightly familiar with from Paul's content) and goes really deep into designing architecture – what components are required to build an LLM Twin, what are considerations when choosing these components; tooling; RAG feature pipeline; supervised finetuning vs finetuning with reference alignment, implementing Direct Preference Optimization; LLM evaluation; and inference pipeline.

I am now overly excited to start implementing the LLM Twin myself, but with different tools such as Databricks!”

Maria Vechtomova, Databricks MVP

L'autore

Paul Iusztin is a senior ML and MLOps engineer at Metaphysic, a leading GenAI platform, serving as one of their core engineers in taking their deep learning products to production. Along with Metaphysic, with over seven years of experience, he built GenAI, Computer Vision and MLOps solutions for CoreAI, Everseen, and Continental. Paul's determined passion and mission are to build data-intensive AI/ML products that serve the world and educate others about the process. As the Founder of Decoding ML, a channel for battle-tested content on learning how to design, code, and deploy production-grade ML, Paul has significantly enriched the engineering and MLOps community. His weekly content on ML engineering and his open-source courses focusing on end-to-end ML life cycles, such as Hands-on LLMs and LLM Twin, testify to his valuable contributions.

Maxime Labonne is a Senior Staff Machine Learning Scientist at Liquid AI, serving as the head of post-training. He holds a Ph.D. in Machine Learning from the Polytechnic Institute of Paris and is recognized as a Google Developer Expert in AI/ML.

An active blogger, he has made significant contributions to the open-source community, including the LLM Course on GitHub, tools such as LLM AutoEval, and several state-of-the-art models like NeuralBeagle and Phixtral. He is the author of the best-selling book “Hands-On Graph Neural Networks Using Python,” published by Packt.

Connect with him on X and LinkedIn @maximelabonne.

View on Amazon

代发服务PDF电子书30立即求助
未经允许不得转载:电子书百科大全 » LLM Engineer's Handbook: Master the art of engineering large language models from concept to production

评论 抢沙发

评论前必须登录!

立即登录   注册