Retrieval Augmented Generation in Production with Haystack (Early Release)

Retrieval Augmented Generation in Production with Haystack (Early Release) КНИГИ » ПРОГРАММИНГ

Название: Retrieval Augmented Generation in Production with Haystack: Building Trustworthy, Scalable, Reliable, and Secure AI Systems (Early Release)
Автор: Skanda Vivek
Издательство: O’Reilly Media, Inc.
Год: 2024-05-07
Язык: английский
Формат: pdf, epub, mobi
Размер: 10.1 MB

In today's rapidly changing AI technology environment, software engineers often struggle to build real-world applications with large language models (LLM). The benefits of incorporating open source LLMs into existing workflows is often offset by the need to create custom components. That's where Haystack comes in. This open source framework is a collection of the most useful tools, integrations, and infrastructure building blocks to help you design and build scalable, API-driven LLM backends.

With Haystack, it's easy to build extractive or generative QA, Google-like semantic search to query large-scale textual data, or a reliable and secure ChatGPT-like experience on top of technical documentation. This guide serves as a collection of useful retrieval augmented generation (RAG) mental models and offers ML engineers, AI engineers, and backend engineers a practical blueprint for the LLM software development lifecycle.

An emerging paradigm is the leveraging of Generative AI to unlock data-centric insights for customers across various industries using large language models (LLMs) such as the OpenAI GPT models, Anthropic’s Claude models, Google Gemini, Meta’s Llama models, Mistral, etc. However, an engine alone cannot propel a vehicle. State-of-the-art LLMs like GPT-4 excel at language-based tasks due to their a priori knowledge, acquired through training on a vast representative corpus of documents (including websites, books, etc.) and tasks involving these documents.

While LLMs demonstrate exceptional out-of-the-box performance, their inherent value is limited. Enterprise use-case lie in adapting these LLMs to their custom data sources and customer workflows. One approach for this involves feeding the LLM relevant context as part of the input. However, this method presents several challenges, including latency, cost, and model forgetfulness when dealing with large context sizes.

Large Language Models like GPT-3.5 have ushered in a new era of artificial intelligence and computing. LLMs are large scale neural networks, composed of several billion parameters, and trained on natural language processing tasks. Language models aim to model the generative likelihood of word sequences, to predict the probabilities of future (or missing) tokens. The simplest language models are bigram, trigram (n-gram in general) models where the probability of the following word depends on the previous n-1 words.

Скачать Retrieval Augmented Generation in Production with Haystack (Early Release)

Скачать с Turbobit

ОТСУТСТВУЕТ ССЫЛКА/ НЕ РАБОЧАЯ ССЫЛКА ЕСТЬ РЕШЕНИЕ, ПИШЕМ СЮДА!

Автор: Ingvar16 13-05-2024, 15:52 | Напечатать |

Уважаемый посетитель, Вы зашли на сайт как незарегистрированный пользователь.

С этой публикацией часто скачивают:

Building Generative AI-Powered Apps: A Hands-on Guide for Developers Название: Building Generative AI-Powered Apps: A Hands-on Guide for Developers Автор: Aarushi Kansal Издательство: Apress Год: 2024 Страниц: 175...

CockroachDB: The Definitive Guide (Fifth Early Release) Название: CockroachDB: The Definitive Guide: Distributed Data at Scale (Fifth Early Release) Автор: Jesse Seldess, Ben Darnell, Guy Harrison...

Mastering Apache Pulsar (Third Early Release) Название: Mastering Apache Pulsar (Third Early Release) Автор: Jowanza Joseph Издательство: O’Reilly Media, Inc. Год: 2021-11-05 Страниц: 274 Язык:...

Designing Machine Learning Systems (Early Release) Название: Designing Machine Learning Systems: Iterative Processes for Deployable, Reliable, and Scalable Machine Learning (Early Release) Автор: Chip...

Mastering Apache Pulsar (Second Early Release) Название: Mastering Apache Pulsar (Second Early Release) Автор: Jowanza Joseph Издательство: O’Reilly Media, Inc. Год: 2021-05-18 Язык: английский...

React: Up & Running: Building Web Applications, Second Edition (Second Early Release) Название: React: Up & Running: Building Web Applications, Second Edition (Second Early Release) Автор: Stoyan Stefanov Издательство: O’Reilly...

Kubeflow for Machine Learning: From Lab to Production (Early Release) Название: Kubeflow for Machine Learning: From Lab to Production (Early Release) Автор: Holden Karau, Trevor Grant, Ilan Filonenko Издательство:...

Mastering Serverless Applications with Google Cloud Run: A Real-World Guide to Building Production-Ready Services (Early Release) Название: Mastering Serverless Applications with Google Cloud Run: A Real-World Guide to Building Production-Ready Services (Early Release) Автор:...

Building Secure and Reliable Systems: SRE and Security Best Practices (Early Release) Название: Building Secure and Reliable Systems: SRE and Security Best Practices (Early Release) Автор: Heather Adkins, Betsy Beyer, Paul...

Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Services Название: Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Services Автор: Brendan Burns Издательство: O'Reilly Media...

Информация

Посетители, находящиеся в группе Гости, не могут оставлять комментарии к данной публикации.