Menu

#Bm25

7 posts

Feed·
7 of 7 posts
Two Retrieval Methods Are Better Than One: Evidence from 500 Clinical Queries
🖼️
0

Two Retrieval Methods Are Better Than One: Evidence from 500 Clinical Queries

DEV Community·Igor Eduardo·19 days ago
#bgGc48cw
#python#rag#ai#bm25#dense#hybrid

When I set out to evaluate retrieval configurations for Portuguese clinical text, I expected one...

15s
Read More
📰
0

Codebase-scale retrieval using AST-derived graphs + BM25 — reducing LLM context from 100K to 5K tokens [D]

Reddit r/MachineLearning·u/Altruistic_Night_327·about 1 month ago
#dFWUtIiu
#bm25#graph#retrieval#embedding#file#article

Wanted to share an approach I've been using for retrieval-augmented generation over large codebases and get feedback from people thinking about similar problems.…

15s
Read More
RAG Without Embeddings: When BM25 Beats Your $0.20-per-1K Vector Index
🖼️
0

RAG Without Embeddings: When BM25 Beats Your $0.20-per-1K Vector Index

DEV Community·Gabriel Anhaia·about 1 month ago
#mVHgl7RK
#when#rag#bm25#dense#corpus#retrieval

BEIR still shows BM25 holding its own on exact-match domains. Here is the decision rule, the Postgres setup, and when hybrid actually pays.

15s
Read More