Why LLMs Can't Surpass a 1970s Technique Yet Remain Valuable: Here's What You Need to Know

Home AI News Why LLMs Can't Surpass a 1970s Technique Yet Remain Valuable: Here's What You Need to Know

Updated on October 24 2024

This year, the MIT Data to AI Lab explored the use of large language models (LLMs) for anomaly detection in time series data, a task traditionally handled by different machine learning (ML) tools. Anomaly detection is crucial in industries for monitoring heavy machinery and identifying potential issues before they escalate. We designed a framework utilizing LLMs and compared their performance against ten other methods, including state-of-the-art deep learning techniques and the classic autoregressive integrated moving average (ARIMA) model from the 1970s. Surprisingly, LLMs underperformed against most models, even the ARIMA model, which outclassed them in seven out of eleven datasets.

For those who envision LLMs as all-encompassing solutions, these results may seem discouraging—though they reflect the existing limitations of AI. However, two key findings were notably unexpected. First, LLMs managed to outperform some models, including certain transformer-based deep learning methods, which took us by surprise. More importantly, LLMs demonstrated impressive capabilities by performing anomaly detection with zero-shot learning—meaning they operated without prior examples or any fine-tuning. Using GPT-3.5 and Mistral LLMs in their standard forms, we showed that LLMs can efficiently detect anomalies without the need to develop specialized models for each signal, significantly streamlining the process.

Current anomaly detection methods involve training and deploying ML models in a two-step process that can be complex and cumbersome. Operators often lack the experience with ML, leading to questions about retraining frequency, data input, and signal management. These barriers frequently hinder the deployment of trained models. LLMs, by contrast, allow operators to control anomaly detection through simple API queries, enabling them to easily add or remove signals and toggle the service without reliance on other teams. This autonomy may facilitate broader adoption of LLMs in industrial settings.

While LLMs have sparked a reevaluation of anomaly detection, they still lag behind state-of-the-art deep learning models and even the ARIMA model in performance. This discrepancy could stem from our decision not to fine-tune the LLMs or create a foundational model explicitly designed for time series applications. To improve anomaly detection accuracy, we must tread carefully to maintain the advantages inherent to LLMs.

This means we should avoid:

1. Fine-tuning existing LLMs for specific signals, as this would compromise their zero-shot capabilities.

2. Developing a foundational LLM for time series with a fine-tuning layer for each new type of machinery, as this would revert us to the complexities of training models for every signal.

For LLMs to effectively compete in anomaly detection or other ML tasks, they must foster innovative approaches or unlock new possibilities. The AI community needs to establish safeguards to ensure that efforts to improve LLMs do not undermine their foundational benefits.

In classical ML, establishing robust practices like train, test, and validate took nearly two decades. Even with such methods, matching model performance in real-world scenarios remains a challenge due to issues like label leakage and data biases. To avoid returning to convoluted practices, we must define clear parameters for enhancing LLM capabilities in anomaly detection.

Kalyan Veeramachaneni is the director of the MIT Data to AI Lab and co-founder of DataCebo.

Sarah Alnegheimish is a researcher at the MIT Data to AI Lab.

DataDecisionMakers

DataDecisionMakers is a platform for experts and professionals involved in data work to share insights and innovations. To learn more about cutting-edge ideas and best practices in data technology, visit us at DataDecisionMakers and consider contributing your insights.

OpenAI Launches Experimental ‘Swarm’ Framework, Sparking Debate on the Future of AI-Powered Automation

The ‘Strawberry’ Dilemma: Overcoming the Limitations of AI for Enhanced Performance

Most people like

Scribble Diffusion

Scribble Diffusion harnesses the power of AI to effortlessly convert rough sketches into stunning, polished artwork. This innovative tool enhances creativity and serves as a game-changer for artists and designers alike.

AI AI Image Enhancer

NsfwGPT.AI

In today's rapidly evolving digital landscape, the intersection of artificial intelligence (AI) and immersive experiences is captivating a growing audience. As technology advances, the AI community is harnessing innovative tools to transform how we interact with our environment, enhancing both entertainment and learning. Exploring this dynamic fusion not only highlights the potential of AI but also showcases its role in crafting engaging, immersive experiences that resonate with users. Join us as we delve into the ways AI is revolutionizing our understanding and engagement with the world around us.

AI technology NSFW

Rewritify: Undetectable AI Rewriter

In the realm of digital communication, the need for content that resonates with readers has never been more crucial. Enter the AI rewriter, a powerful tool designed to transform machine-generated text into engaging, human-like writing. By refining and humanizing your content, this innovative technology enhances clarity and connection, making your messages more impactful. Discover how an AI rewriter can elevate your text, ensuring it not only conveys information but also captivates your audience.

AI rewriter AI Rewriter

SunoCC.com

Discover the power of an AI music generator that allows you to create custom tracks tailored to your needs. Whether you're a seasoned musician or a hobbyist, this innovative tool empowers you to compose original music quickly and easily, helping you unlock your creative potential. Dive into the world of AI-generated music and start producing tracks that elevate your projects and inspire your audience.

AI music generator AI Music Generator

Find AI tools in YBX