{ "item_title" : "Multimodal Models Systems Playbook", "item_author" : [" Reid Harper "], "item_description" : "Most books explain what multimodal AI is.This playbook shows you how to actually build and deploy it.As AI systems move beyond text into images, speech, and actions, many teams struggle with fragile pipelines, hallucinations, broken RAG setups, and demos that fail in production.This book fixes that.Multimodal Models Systems Playbook is a practical, systems-first guide for engineers building real multimodal AI applications-using vision, language, and speech models together with agent workflows and retrieval pipelines.Inside, you'll learn how to: Design reliable vision → language pipelinesBuild voice and speech systems that go beyond transcriptionImplement multimodal RAG across text, images, and audioCreate agent workflows that route tasks by modalityEvaluate multimodal systems for grounding, latency, and costDeploy production-ready systems with fallbacks and observabilityEach chapter includes clear explanations, failure modes, production checklists, and hands-on mini-labs.Who this book is for: Engineers, AI builders, and teams shipping multimodal systems.Not for: academic theory or vendor-locked tutorials.If you want to move from multimodal demos to production systems, this playbook shows you how.", "item_img_path" : "https://covers2.booksamillion.com/covers/bam/9/79/824/229/9798242295773_b.jpg", "price_data" : { "retail_price" : "16.99", "online_price" : "16.99", "our_price" : "16.99", "club_price" : "16.99", "savings_pct" : "0", "savings_amt" : "0.00", "club_savings_pct" : "0", "club_savings_amt" : "0.00", "discount_pct" : "10", "store_price" : "" } }

Multimodal Models Systems Playbook|Reid Harper

Multimodal Models Systems Playbook : Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

Name: Multimodal Models Systems Playbook
SKU: 9798242295773
Price: 16.99 USD
Availability: InStock

by Reid Harper

Ship to Me

In Stock.

FREE Shipping for Club Members

In-Store Pickup

Overview

Most books explain what multimodal AI is.
This playbook shows you how to actually build and deploy it.

As AI systems move beyond text into images, speech, and actions, many teams struggle with fragile pipelines, hallucinations, broken RAG setups, and demos that fail in production.

This book fixes that.

Multimodal Models Systems Playbook is a practical, systems-first guide for engineers building real multimodal AI applications-using vision, language, and speech models together with agent workflows and retrieval pipelines.

Inside, you'll learn how to:

Design reliable vision → language pipelines
Build voice and speech systems that go beyond transcription
Implement multimodal RAG across text, images, and audio
Create agent workflows that route tasks by modality
Evaluate multimodal systems for grounding, latency, and cost
Deploy production-ready systems with fallbacks and observability

Each chapter includes clear explanations, failure modes, production checklists, and hands-on mini-labs.

Who this book is for:
Engineers, AI builders, and teams shipping multimodal systems.

Not for: academic theory or vendor-locked tutorials.

If you want to move from multimodal demos to production systems, this playbook shows you how.

This item is Non-Returnable

Customers Also Bought

Details

ISBN-13: 9798242295773
ISBN-10: 9798242295773
Publisher: Independently Published
Publish Date: January 2026
Dimensions: 9 x 6 x 0.56 inches
Shipping Weight: 0.8 pounds
Page Count: 268

Related Categories

Favorites

What We Recommend

Featured

Shop by Category

Fiction

Nonfiction

Shop By Format

More Information

Favorites

Shop By Author A-G

Shop by Author G-L

Shop by Author R-Z

Shop By Series A-G

Shop By Series H-M

Shop By Series N-Z

Customers Also Liked

More in Manga

Favorites

Favorite Characters

Kids Fiction

Nonfiction

Shop by Age

Top Authors

Educational Resources

More Categories

Favorites

Popular Authors

Bestselling Series A-K

Bestselling Series L-Z

Favorites

Music

Featured

Page to Screen

Tabletop Role-playing

Fandoms

LEGO

Bestsellers

Games & Puzzles

Favorites

Best Books of 2026

#BookTok

Best Gifts for Kids

Toys & Games

For Teens & Young Adults

Pop Culture & Fandoms

Pen to Paper Shop

Faith-Based Gifts

Bargains in Fiction

Bargains in Nonfiction

Bargains in Young Adult Books

Bargains in Kids Fiction

Bargains in Kids Nonfiction

Bargains in Faith & Inspiration

Bargain Favorites

Multimodal Models Systems Playbook : Build Vision-Language-Speech Apps with Agent Workflows, RAG, Evaluation & Deployment

Overview

Customers Also Bought

Details

You May Also Like...

BAM Customer Reviews