LIBRISTO
LIBROAMANTO
obligatorisch
Werden Sie Teil einer Gemeinschaft von Buchliebhabern aus der ganzen Welt und erhalten Sie eine Reihe von Vorteilen. Konto kostenlos anlegen
0
DPD-Kurier 4.49 Hermes Kurierdienst 4.99 DHL-Kurier 3.99 Hermes-Stelle 4.49 DPD-Stelle 2.99 GLS-Kurierdienst 4.99

Large Language Models Architecture and Deployment

Build End-to-End Generative AI Applications with RAG, Vector Search, Fine-Tuning, APIs, and Cloud Infrastructure

Sprache EnglischEnglisch
Buch Broschur
Buch Large Language Models Architecture and Deployment Nao Hajime
Libristo-Code: 52817235
Verlag Independently published, Juni 2026
Building modern AI applications requires far more than connecting a language model to a chatbot inte... Vollständige Beschreibung
? points 49 b Neu Neu
20.09 inkl. MwSt.
Externes Lager Wir versenden in 9-15 Tagen

Bis zu 30 Tage Rückgaberecht

Building modern AI applications requires far more than connecting a language model to a chatbot interface. Production-grade Large Language Model systems demand scalable infrastructure, optimized inference pipelines, reliable data engineering workflows, secure deployment architectures, observability frameworks, and carefully engineered Retrieval-Augmented Generation (RAG) systems capable of delivering accurate and context-aware responses in real-world environments.
LLM Architecture and Deployment is a comprehensive engineering-focused guide to designing, building, deploying, scaling, and maintaining production-ready Generative AI systems powered by Large Language Models. Written for software engineers, AI practitioners, platform architects, DevOps engineers, and technical professionals, this book provides practical insight into the complete lifecycle of modern LLM application development, from infrastructure planning and vector search pipelines to deployment automation and enterprise-scale AI operations.
The book begins by introducing the architecture of production-grade AI systems and the engineering principles required to build scalable and modular LLM applications. Readers will explore modern AI infrastructure design patterns, distributed architectures, orchestration strategies, cloud-native deployment models, and scalable backend systems capable of supporting high-throughput inference workloads.
As the book progresses, readers will learn how to build Retrieval-Augmented Generation pipelines using vector embeddings, semantic search, chunking strategies, metadata enrichment, hybrid retrieval systems, and re-ranking architectures. The book also provides deep technical coverage of prompt engineering, context management, embedding pipelines, vector databases, API development, AI agents, memory systems, autonomous workflows, and multi-agent orchestration frameworks.
Practical deployment topics are covered extensively, including containerization, Kubernetes orchestration, GPU acceleration, quantization, inference optimization, distributed serving, load balancing, CI/CD pipelines, infrastructure automation, cloud deployment strategies, and real-time streaming architectures. Readers will also explore advanced engineering topics such as observability systems, hallucination monitoring, prompt validation, security hardening, governance frameworks, cost optimization, and enterprise AI reliability engineering.
In addition to implementation-focused workflows, the book examines the operational realities of maintaining large-scale AI platforms, including compliance requirements, adversarial attacks, scaling challenges, deployment resilience, infrastructure monitoring, and long-term maintainability of rapidly evolving Generative AI ecosystems.
By the end of this book, readers will have the technical knowledge and practical engineering expertise necessary to design and deploy scalable, production-grade LLM applications capable of supporting enterprise workloads, intelligent AI agents, semantic retrieval systems, and modern Generative AI platforms operating in real-world production environments.

Schauspielerin & Polyglotte
EWA KASP für
Video abspielen
Ewa Kasp
Libristo bietet die größte Auswahl an fremdsprachiger Literatur an. Deshalb kaufe ich meine Bücher hier ein.

Informationen zum Buch

Vollständiger Name Large Language Models Architecture and Deployment
Autor Nao Hajime
Sprache Englisch
Einband Buch - Broschur
Datum der Veröffentlichung 2026
Anzahl der Seiten 194
EAN 9798199951579
Libristo-Code 52817235
Gewicht 347
Abmessungen 178 x 254 x 10
Verschenken Sie dieses Buch noch heute
Es ist ganz einfach
1 Legen Sie das Buch in Ihren Warenkorb und wählen Sie den Versand als Geschenk 2 Wir schicken Ihnen umgehend einen Gutschein 3 Das Buch wird an die Adresse des beschenkten Empfängers geliefert

Anmeldung

Melden Sie sich bei Ihrem Konto an. Sie haben noch kein Libristo-Konto? Erstellen Sie es jetzt!

 
obligatorisch
obligatorisch

Sie haben kein Konto? Nutzen Sie die Vorteile eines Libristo-Kontos!

Mit einem Libristo-Konto haben Sie alles unter Kontrolle.

Erstellen Sie ein Libristo-Konto
Buchberater Libroamiko
Hallo, ich bin Libroamiko, kann ich helfen?