RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone

RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone

RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone

Author: Denis Rothman (Author)

Publisher finelybook 出版社:‏ ‎ Packt Publishing

Edition 版本:‏ ‎ N/A

Publication Date 出版日期:‏ ‎ 2024-09-30

Language 语言: ‎ English

Print Length 页数: ‎ 334 pages

ISBN-10: ‎ 1836200919

ISBN-13: ‎ 9781836200918

Book Description

Minimize AI hallucinations and build accurate, custom generative AI pipelines with RAG using embedded vector databases and integrated human feedback

Purchase of the print or Kindle book includes a free eBook in PDF format

Key Features

  • Implement RAG’s traceable outputs, linking each response to its source document to build reliable multimodal conversational agents
  • Deliver accurate generative AI models in pipelines integrating RAG, real-time human feedback improvements, and knowledge graphs
  • Balance cost and performance between dynamic retrieval datasets and fine-tuning static data

Book Description

RAG-Driven Generative AI provides a roadmap for building effective LLM, computer vision, and generative AI systems that balance performance and costs.

This book offers a detailed exploration of RAG and how to design, manage, and control multimodal AI pipelines. By connecting outputs to traceable source documents, RAG improves output accuracy and contextual relevance, offering a dynamic approach to managing large volumes of information. This AI book shows you how to build a RAG framework, providing practical knowledge on vector stores, chunking, indexing, and ranking. You’ll discover techniques to optimize your project’s performance and better understand your data, including using adaptive RAG and human feedback to refine retrieval accuracy, balancing RAG with fine-tuning, implementing dynamic RAG to enhance real-time decision-making, and visualizing complex data with knowledge graphs.

You’ll be exposed to a hands-on blend of frameworks like LlamaIndex and Deep Lake, vector databases such as Pinecone and Chroma, and models from Hugging Face and OpenAI. By the end of this book, you will have acquired the skills to implement intelligent solutions, keeping you competitive in fields from production to customer service across any project.

What you will learn

  • Scale RAG pipelines to handle large datasets efficiently
  • Employ techniques that minimize hallucinations and ensure accurate responses
  • Implement indexing techniques to improve AI accuracy with traceable and transparent outputs
  • Customize and scale RAG-driven generative AI systems across domains
  • Find out how to use Deep Lake and Pinecone for efficient and fast data retrieval
  • Control and build robust generative AI systems grounded in real-world data
  • Combine text and image data for richer, more informative AI responses

Who this book is for

This book is ideal for data scientists, AI engineers, machine learning engineers, and MLOps engineers. If you are a solutions architect, software developer, product manager, or project manager looking to enhance the decision-making process of building RAG applications, then you’ll find this book useful.

Table of Contents

  1. Why Retrieval Augmented Generation?
  2. RAG Embedding Vector Stores with Deep Lake and OpenAI
  3. Building Index-Based RAG with LlamaIndex, Deep Lake, and OpenAI
  4. Multimodal Modular RAG for Drone Technology
  5. Boosting RAG Performance with Expert Human Feedback
  6. Scaling RAG Bank Customer Data with Pinecone
  7. Building Scalable Knowledge-Graph-Based RAG with Wikipedia API and LlamaIndex
  8. Dynamic RAG with Chroma and Hugging Face Llama
  9. Empowering AI Models: Fine-Tuning RAG Data and Human Feedback
  10. RAG for Video Stock Production with Pinecone and OpenAI

Review

“This book stands out for its hands-on, practical approach, offering readers a clear pathway from foundational concepts to complex implementations. Its meticulous explanation of RAG concepts and real-world code implementations, make it accessible to both beginners and seasoned professionals.

A notable highlight is its unique insights into the challenges of scaling RAG systems and practical guidance on managing large datasets, optimizing query performance, and controlling costs. Additionally, the chapters on modular RAG and fine-tuning offer actionable strategies, which resonate with my own experiences in building an AI-powered Mental Health Management application utilizing conversational AI and RAG. The emphasis on human feedback is crucial; it demonstrates how expert input can refine data and enhance the reliability of AI responses, aligning AI outputs with human values.

The book’s insights into performance optimization and the integration of human feedback make it a standout resource in the field.”

Harsha Srivatsa, Founder and Head of AI Products at Stealth AI, Ex- Apple, Accenture

“This book provides an incredibly comprehensive deep dive, covering everything from multimodal data types and various RAG architectures to advanced topics like evaluation, knowledge graphs, and fine-tuning with human feedback.

What truly stands out is how seamlessly Rothman explains complex concepts, making the material both accessible and insightful for readers at all levels. Whether you’re looking to build end-to-end RAG solutions or simply enhance your understanding of cutting-edge AI systems, this book will deepen your knowledge with its thorough and practical coverage across diverse use cases.”

Surnjani Djoko, PhD, SVP, Specialized ML/AI – Lead USPBA Innovation Lab

About the Author

Denis Rothman graduated from Sorbonne University and Paris-Diderot University, and as a student, he wrote and registered a patent for one of the earliest word2vector embeddings and word piece tokenization solutions. He started a company focused on deploying AI and went on to author one of the first AI cognitive NLP chatbots, applied as a language teaching tool for Moët et Chandon (part of LVMH) and more. Denis rapidly became an expert in explainable AI, incorporating interpretable, acceptance-based explanation data and interfaces into solutions implemented for major corporate projects in the aerospace, apparel, and supply chain sectors. His core belief is that you only really know something once you have taught somebody how to do it.

相关文件下载地址

(conv) | 31 MB | 2024-10-08

打赏
未经允许不得转载:finelybook » RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone

评论 2

  1. #1

    无链接

    winsome1周前 (12-15)回复
    • 已修复

      admin1周前 (12-15)回复

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫