Driving AI Innovation: Making It Easy to Build Generative AI Apps with Redis

Logo
Presented by

Tyler Hutcherson, Senior Applied Machine Learning Engineer, Redis

About this talk

With so many people – like your CIO – hyped about artificial intelligence, machine learning, and generative AI products like ChatGPT, you’re likely under a lot of pressure to develop applications with those technologies for your company. Like, yesterday. But how do you turn terabytes of unstructured data – ranging from text to images, audio, and video – into useful information for the AI application? The answer is a vector database. Vectorizing converts unstructured data into numerical representations known as vector embeddings, and Redis stores, indexes, and retrieves them. Vectors make searching for similar images or text or documents – vector similarity search (VSS) – faster and easier. Moreover, an external vector database with domain-specific, proprietary, or more recent data – used with public general purpose large language models – improves AI application responses. Not to mention, keeping it from making things up.   In this tech talk, find out how to use Redis’s vector database capabilities for real-time AI apps. We show the elements of building generative AI applications – using Redis technology you already know. Watch to find out more about it. * What’s a vector database * Examples * Why LLMs need real-time response * Redis Enterprise VSS features and differentiators
Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (102)
Subscribers (10880)
Redis makes apps faster, by creating a data foundation for a real-time world. It is the driving force behind Open-Source Redis, the world’s most loved in-memory database, and commercial provider of Redis Enterprise, a real time data platform. Redis Enterprise powers real-time services for over 8,000 organizations globally. It builds upon the unmatched simplicity and speed of Open-Source Redis along with an enterprise grade data platform that offers robustness of modern data models, management, automation, performance and resiliency to deploy and run modern applications at any scale from anywhere on the planet.