Exclusive: Databricks Unveils New Tools for Creating High-Quality RAG Applications

Home AI News Exclusive: Databricks Unveils New Tools for Creating High-Quality RAG Applications

Updated on December 6 2023

Today, Databricks announced the launch of new retrieval augmented generation (RAG) tools within its Data Intelligence Platform. These tools are designed to assist businesses in building, deploying, and maintaining high-quality large language model (LLM) applications tailored to various use cases.

Now available in public preview, these tools tackle significant challenges in developing production-ready RAG applications. They streamline the process of integrating relevant real-time business data from diverse sources with the appropriate models, while also enabling effective monitoring of applications for issues like toxicity that commonly affect LLMs.

Craig Wiley, Senior Director of Product for AI/ML at Databricks, emphasized the urgency of developing RAG apps: “Organizations find it challenging to deliver solutions that consistently produce accurate, high-quality responses while implementing guardrails to prevent undesirable outputs.”

Understanding RAG and Its Challenges

While LLMs are gaining popularity, many existing models rely on parameterized knowledge, which limits their ability to provide up-to-date and context-specific responses, particularly for internal business needs. Retrieval augmented generation (RAG) addresses this by leveraging specific data sources to enhance the accuracy and reliability of model responses. For example, a model trained on HR data can assist employees with various inquiries.

RAG involves several complex tasks, including gathering and preparing structured and unstructured data from multiple sources, model selection, prompt engineering, and ongoing monitoring. This fragmented approach often results in underperforming RAG applications.

How Databricks Is Leading the Way

Databricks’ new RAG tools integrate various processes, allowing teams to rapidly prototype and deploy quality RAG applications. Features such as vector search and feature serving eliminate the need to build cumbersome data pipelines, as structured and unstructured data from Delta tables sync seamlessly with the LLM application. This ensures access to the latest and most relevant business information for accurate and context-aware responses.

“Unity Catalog automatically tracks the lineage between offline and online datasets, simplifying the debugging of data quality issues and enforcing access control settings for better data governance,” noted Databricks' co-founder and VP of Engineering Patrick Wendell and CTO of Neural Networks Hanlin Tang.

Furthermore, developers can utilize the unified AI playground and MLFlow evaluation to evaluate models from various providers, including Azure OpenAI Service, AWS Bedrock, and open-source options like Llama 2 and MPT. This flexibility enables teams to deploy projects with the best-performing and most cost-effective models, while retaining the option to pivot to improved solutions as they become available.

Advanced Monitoring Capabilities

After deploying a RAG application, monitoring its performance at scale is crucial. Databricks offers a fully managed Lakehouse Monitoring capability that automatically scans application responses for toxicity, hallucinations, or any unsafe content. This proactive detection feeds into dashboards, alert systems, and data pipelines, allowing teams to take corrective actions swiftly. The feature integrates with model and dataset lineage, facilitating quick identification of errors and their causes.

Early Adoption Success

Although the new tools have just launched, enterprises such as RV supplier Lippert and EQT Corporation are already testing their capabilities within the Databricks Data Intelligence platform. Chris Nishnick, leading data and AI efforts at Lippert, shared, “Databricks enhances our call center operations by integrating diverse content sources into our Vector Search, ensuring agents have the knowledge they need at their fingertips. This innovative approach significantly improves efficiency and customer support.”

Internally, Databricks is also deploying RAG applications. According to Wiley, the company's IT team is piloting a RAG Slackbot for account executives and a browser plugin for sales development representatives.

Recognizing the increasing demand for specialized LLM applications, Databricks plans to invest significantly in its suite of RAG tools. The goal is to empower customers to deploy high-quality LLM applications on a large scale, with ongoing commitment to research and future innovations in this area.

Automated Proposal Writing Startup AutogenAI Secures $39.5M Investment from Salesforce Ventures and Other Backers

AMD Launches Advanced Data Center and PC Chips Designed to Boost AI Performance

Most people like

SnapEdit.App

3.9M

SnapEdit.App, a free online photo editing tool that harnesses the power of AI to effortlessly remove unwanted objects and people from your images, while also enhancing their overall quality. Experience the future of photo editing with SnapEdit's intuitive features designed to elevate your photography.

photo editing Photo & Image Editor

Humata - ChatGPT for all your files

1.2M

Humata is a powerful AI tool designed to provide instant answers to your data-related questions, streamlining the way you access and utilize information. Whether you're analyzing complex data sets or seeking quick insights, Humata makes data exploration efficient and user-friendly.

file analysis AI Document Extraction

Freepik AI Image Generator

108.3M

In today's digital landscape, the power of a real-time AI image generator stands out as a revolutionary tool for artists, designers, and anyone looking to create stunning visuals effortlessly. Leveraging advanced algorithms and machine learning, these innovative applications can produce high-quality images in an instant, transforming ideas into visual masterpieces. Whether for professional projects or personal endeavors, real-time AI image generation opens the door to a world of limitless creativity and inspiration.

AI image generator Text to Image

XspaceGPT

44.9K

Discover the power of our AI tool designed specifically for converting Twitter Spaces into text. Easily transcribe conversations, lectures, and discussions from Twitter Spaces, capturing every insightful moment effortlessly. With our advanced technology, you can enhance accessibility, make notes, or repurpose content for blogs and articles. Join the growing community leveraging AI to streamline their Twitter Spaces experiences!

AI text conversion Summarizer

Find AI tools in YBX