RAG sucks

Gen

25 Oct 2024 — 1 min read

Think about it for a second. sending back text chunks from documents, based on a query that sometimes isn’t even making sense for a search engine. now add hundred documents, thousands, millions. It will sucks even more.

Search is a hard problem to solve and when you add the non-deterministic behavior of current models well you in for some pretty bad retrieval.

There are ways for your RAG approach to suck less. you could try hybrid search, you could try bigger chunks, semantic chunks, you could try to ask your LLM do handle the query expansion and filtering to improve relevancy. But guess what? Your RAG will still suck.

And there is no way around that. it's just that the idea of retrieving chunks and hoping there is enough content for your LLM to retrieve the useful information is just hope.

Sometimes it is easier to identify a master document (or a few) and just copy them into your prompt.

Don’t do RAG and if you have to… well do it on heavily curated documents...

Slop is enough

Capitalism loves inflection points. The optimists see of AI liberating humanity from employment, while the reality suggests your livelihood will be sacrificed on the altar of mediocrity. Just look at Duolingo and Shopify latest endavors, both championing an "AI-first" company where so-called "agents" replace human workers

What happens during fine-tuning and why does a thing of beauty become woke

In the paper "Hidden Persuaders: Language Models' Political Leanings and Their Influence on Voters", researchers examined the political biases and electoral predictions of various large language models (LLMs). One particularly intriguing finding involved the Mixtral 8X7B model. The base version of Mixtral 8X7B predicted that Donald Trump

NotebookLM : Of profits over soul

NotebookLM gives the same vibes than when animation moved to 3D. The feeling that while it somehow worked it was also lifeless, molding Gen Z's minds into accepting a world where computers produce inferior work to humans, yet profit-driven efficiency still prevails over artistic soul. Don't

They telling us open sourcing LLMs will lead to wars

In recent months, we've witnessed an interesting narrative emerge around Generative AI: Open-source will lead to mass militarization of LLMs. The leap from language prediction to alleged warfare capabilities seems remarkably sudden - and suspiciously convenient. Yet this is precisely the narrative being pushed by VCs, lobbies, and

Read more

Slop is enough

What happens during fine-tuning and why does a thing of beauty become woke

NotebookLM : Of profits over soul

They telling us open sourcing LLMs will lead to wars