Hands-On LLM Serving and Optimization

Available
0
StarStarStarStarStar
0Reviews
Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era. Without proper optimization, however, LLMs can be expensive and slow to serve. Hands-On LLM Serving and Optimization is a comprehensive guide to the complexitie...
Read more
E-book
pdf
Price
54.50 £
Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era. Without proper optimization, however, LLMs can be expensive and slow to serve. Hands-On LLM Serving and Optimization is a comprehensive guide to the complexitie...
Read more
Follow the Author

Options

  • Formats: pdf
  • ISBN: 9798341621473
  • Publication Date: 28 Apr 2026
  • Publisher: O'Reilly Media
  • Product language: English
  • Drm Setting: DRM