menu
{ "item_title" : "Multimodal Models Systems Playbook", "item_author" : [" Reid Harper "], "item_description" : "Most books explain what multimodal AI is.This playbook shows you how to actually build and deploy it.As AI systems move beyond text into images, speech, and actions, many teams struggle with fragile pipelines, hallucinations, broken RAG setups, and demos that fail in production.This book fixes that.Multimodal Models Systems Playbook is a practical, systems-first guide for engineers building real multimodal AI applications-using vision, language, and speech models together with agent workflows and retrieval pipelines.Inside, you'll learn how to: Design reliable vision → language pipelinesBuild voice and speech systems that go beyond transcriptionImplement multimodal RAG across text, images, and audioCreate agent workflows that route tasks by modalityEvaluate multimodal systems for grounding, latency, and costDeploy production-ready systems with fallbacks and observabilityEach chapter includes clear explanations, failure modes, production checklists, and hands-on mini-labs.Who this book is for: Engineers, AI builders, and teams shipping multimodal systems.Not for: academic theory or vendor-locked tutorials.If you want to move from multimodal demos to production systems, this playbook shows you how.", "item_img_path" : "https://covers2.booksamillion.com/covers/bam/9/79/824/229/9798242295773_b.jpg", "price_data" : { "retail_price" : "16.99", "online_price" : "16.99", "our_price" : "16.99", "club_price" : "16.99", "savings_pct" : "0", "savings_amt" : "0.00", "club_savings_pct" : "0", "club_savings_amt" : "0.00", "discount_pct" : "10", "store_price" : "" } }
Multimodal Models Systems Playbook|Reid Harper

Multimodal Models Systems Playbook : Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

local_shippingShip to Me
In Stock.
FREE Shipping for Club Members help

Overview

Most books explain what multimodal AI is.
This playbook shows you how to actually build and deploy it.

As AI systems move beyond text into images, speech, and actions, many teams struggle with fragile pipelines, hallucinations, broken RAG setups, and demos that fail in production.

This book fixes that.

Multimodal Models Systems Playbook is a practical, systems-first guide for engineers building real multimodal AI applications-using vision, language, and speech models together with agent workflows and retrieval pipelines.

Inside, you'll learn how to:
  • Design reliable vision → language pipelines

  • Build voice and speech systems that go beyond transcription

  • Implement multimodal RAG across text, images, and audio

  • Create agent workflows that route tasks by modality

  • Evaluate multimodal systems for grounding, latency, and cost

  • Deploy production-ready systems with fallbacks and observability

Each chapter includes clear explanations, failure modes, production checklists, and hands-on mini-labs.

Who this book is for:
Engineers, AI builders, and teams shipping multimodal systems.

Not for: academic theory or vendor-locked tutorials.

If you want to move from multimodal demos to production systems, this playbook shows you how.

This item is Non-Returnable

Details

  • ISBN-13: 9798242295773
  • ISBN-10: 9798242295773
  • Publisher: Independently Published
  • Publish Date: January 2026
  • Dimensions: 9 x 6 x 0.56 inches
  • Shipping Weight: 0.8 pounds
  • Page Count: 268

Related Categories

You May Also Like...

    1

BAM Customer Reviews