Menu

#Embeddings

22 posts

Feed·
20 of 22 posts
Pinterest cut AI costs 90% by gutting a frontier model's vision layer
📰
0

Pinterest cut AI costs 90% by gutting a frontier model's vision layer

At 620 million monthly users, calling a frontier model for every image recommendation isn't a strategy — it's a bill. Pinterest CTO Matt Madrigal solved it by gutting Qwen3-VL's vision layer and rebuilding it with proprietary embeddings, cutting costs 90%…

15s
Read More
Vector Embeddings Basics: A Technical Introduction for AI
🖼️
0

Vector Embeddings Basics: A Technical Introduction for AI

DEV Community: nlp·FreeDevKit·3 days ago
#A9sJHfud

Explore vector embeddings, their function in AI, types like Word2Vec and BERT, applications in semantic search and NLP, and implementation considerations for de

15s
Read More
How a Custom Multimodal Transformer Beat a Fine-Tuned LLM for Attribute
🖼️
0

How a Custom Multimodal Transformer Beat a Fine-Tuned LLM for Attribute

DEV Community·gentic news·about 1 month ago
#1H0zzO8z

LeBonCoin's ML team built a custom late-fusion transformer that uses pre-computed visual embeddings and character n-gram text vectors to predict ad at

15s
Read More
📰
0

Vector DB and ANN vs PHE conflict, is there a practical workaround? [D]

Reddit r/MachineLearning·u/XPERT_GAMING·about 1 month ago
#KXK795cU

Hey everyone, I have been digging into vector databases, ANN search, and privacy preserving techniques (specifically PHE), and I have hit a design roadblock that I would love some input on.…

15s
Read More
Build a RAG System in Python (Without Overcomplicating It)
🖼️
0

Build a RAG System in Python (Without Overcomplicating It)

DEV Community·Ayesha·about 1 month ago
#5Ckh0a47
#llm#python#rag#tutorial#model#system

A few months ago, I built a chatbot that sounded very smart… Until it started confidently giving...

15s
Read More
Semantic Caching for LLMs: FastAPI, Redis, and Embeddings - PyImageSearch
📰
0

Semantic Caching for LLMs: FastAPI, Redis, and Embeddings - PyImageSearch

PyImageSearch·Vikram Singh·about 1 month ago
#ysjJ8NHn
#h2#toc#genesis#download#h1#cache

Build a semantic cache for LLMs using FastAPI, Redis, and cosine similarity to cut latency and cost with exact-match and semantic cache hits.

15s
Read More