Menu

Post image 1
Post image 2
Post image 3
Post image 4
Post image 5
1 / 5
0

Hybrid Search and Re-Ranking in Production RAG | Towards Data Science

Towards Data Science·Priyansh Bhardwaj·20 days ago
#b6LZG5Ld
Reading 0:00
15s threshold

, we got a complaint from one of our platform engineers of the infrastructure team that our internal knowledge assistant is confidently giving the wrong answers when asked about the retry policy for our message-queue consumers. It looks like a simple question, and the assistant should have given a well-documented answer, but I was wrong. The system returned a three-paragraph response about exponential backoff with jitter. All of it was accurate but none of it was what she asked for. The actual document she wanted described a custom override we had put in place for one specific service after a production incident that hit us six months earlier. The document used the phrase “dead-letter queue threshold” repeatedly. Our embedding model had decided that “exponential backoff” and “dead-letter queue threshold” were semantically close enough. I checked the retrieval logs and found out that the document she needed was sitting at position eleven in the results just outside the top ten that were passed to the model.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More