Included with free trial
$0.00$0.00
- Free trial includes 1 credit in your first month good for any title of your choice, yours to keep.
- Plus, you can enjoy unlimited listening to The Plus Catalogue—thousands of Audible Originals, podcasts, and audiobooks.
- You'll unlock exclusive member-only sales, as well as 30% off your purchases of any additional titles.
- After 30 days Audible is $14.95/month + applicable taxes. Renews automatically.
Buy
-13% $27.36$27.36
Image Unavailable
Colour:
-
-
-
- To view this video, download
LLMs in Production: Engineering AI Applications
Audible Audiobook
– Unabridged
鶹
Unlock the potential of Generative AI with this Large Language Model production-ready playbook for seamless deployment, optimization, and scaling. This hands-on guide takes you beyond theory, offering expert strategies for integrating LLMs into real-world applications using retrieval-augmented generation (RAG), vector databases, PEFT, LoRA, and scalable inference architectures. Whether you're an ML engineer, data scientist, or MLOps practitioner, you’ll gain the technical know-how to operationalize LLMs efficiently, reduce compute costs, and ensure rock-solid reliability in production.
What You’ll Learn:
- Master LLM Fundamentals – Understand tokenization, transformer architectures, and the evolution linguistics to the creation of foundation models.
- RAG & Vector Databases – Augment model capabilities with real-time retrieval and memory-optimized embeddings.
- Training vs Fine-tuning – Learn how to train your own model as well as cutting edge techniques like Distillation, RLHF, PEFT, LoRA, and QLoRA for cost-effective adaptation.
- Prompt Engineering – Discover the quickly evolving world of prompt engineering and go beyond simple prompt and pray methods and learn how to implement structured outputs, complex workflows, and LLM agents.
- Scaling & Cost Optimization – Deploy LLMs into your favorite cloud of choice, on commodity hardware, Kubernetes clusters, and edge devices.
- Securing AI Workflows – Implement guardrails for hallucination mitigation, adversarial testing, and compliance monitoring.
- MLOps for LLMs – Learn all about LLMOps, automate model lifecycle management, retraining pipelines, and continuous evaluation.
Hands-on Projects Include:
Training a custom LLM from scratch – Build and optimize an industry-specific model.
AI-Powered VSCode Extension – Use LLMs to enhance developer productivity with intelligent code completion.
Deploying on Edge Devices – Run a lightweight LLM on a Raspberry Pi or Jetson Nano for real-world AI applications.
PLEASE NOTE: When you purchase this title, the accompanying PDF will be available in your Audible Library along with the audio.
- Listening Length16 hours and 45 minutes
- Audible release dateApril 9 2025
- LanguageEnglish
- ASINB0F46KW2QR
- VersionUnabridged
- Program TypeAudiobook
Most popular
People who bought this also bought
Only from Audible
Product details
Listening Length | 16 hours and 45 minutes |
---|---|
Author | Christopher Brousseau, Matt Sharp |
Narrator | Christopher Kendrick |
Audible.ca Release Date | April 09 2025 |
Publisher | Manning Publications |
Program Type | Audiobook |
Version | Unabridged |
Language | English |
ASIN | B0F46KW2QR |
鶹 Rank |
|
Customer reviews
- 5 star4 star3 star2 star1 star5 star68%32%0%0%0%68%
- 5 star4 star3 star2 star1 star4 star68%32%0%0%0%32%
- 5 star4 star3 star2 star1 star3 star68%32%0%0%0%0%
- 5 star4 star3 star2 star1 star2 star68%32%0%0%0%0%
- 5 star4 star3 star2 star1 star1 star68%32%0%0%0%0%
Top reviews from Canada
There was a problem filtering reviews. Please reload the page.
- Reviewed in Canada on February 14, 2025The authors have dedicated a fair amount of effort to choose the topics to build a foundational level knowledge and have stretched enough to create a useful product. I liked how the authors have explained the simple concepts and expanded to design and build an AI product. The book offers a sound knowledge of LLMs, comparison of different LLMs, when to use existing LLMs, how to train and fine tune existing LLMs, scaling up to ML platform and finally deployment of the LLMs to the cloud. Due to the expansive nature of LLMs, a thorough understanding of AI product design to deployment phase it would be helpful to reduce the iterations of AI product. The covers the complete lifecycle of LLMs products.
The book can be very useful for AI Product designers/developers, MLOps and Business development teams or the newcomers to the field to understand the integration of LLMs into production.