Updated: Independent Analysis

Horse Racing Results Data: CSV Downloads, Feeds & APIs

Machine-readable racing data — CSV exports, public APIs, commercial feeds and scraping considerations.

Computer screen displaying lines of racing data code and a CSV spreadsheet with horse racing results columns

Best Horse Racing Betting Sites – Bet on Horse Racing in 2026

Loading...

Not everyone who wants horse racing results is content to read them on a website. A growing community of developers, data scientists, model builders, and quantitative punters needs raw data for builders and analysts — machine-readable formats that can be ingested into spreadsheets, databases, and predictive models without the friction of manual entry or screen scraping.

BHA chief executive Brant Dunshea has acknowledged an ever-growing desire for data among those consuming and betting on racing. That desire extends well beyond the casual punter checking results on their phone. It encompasses hobbyist programmers building form-analysis tools, academic researchers studying equine performance, machine-learning practitioners training models on historical race outcomes, and professional syndicates whose entire approach to the sport is built on data infrastructure.

This guide surveys the landscape of machine-readable racing data in Britain: what is available for free, what requires a commercial licence, and what rules govern the use and redistribution of the data once you have it.

Free and Public Racing Data Sources

Genuinely free, structured racing data in the UK is rarer than you might expect. The sport’s commercial model treats results data as an asset, and much of it sits behind paywalls or licensing agreements. That said, several sources offer usable data without charge.

The BHA website publishes race results, fixture lists, and some statistical reports in formats that can be manually extracted or lightly scraped. The BHA’s quarterly and annual racing reports contain aggregate data — field sizes, runners per race, prize money totals, off-time punctuality — that is freely accessible in web format. With 1,458 fixtures scheduled for 2026, the volume of result data generated each year is substantial, and the BHA’s published statistics provide a useful top-level overview for researchers who need industry-wide metrics rather than horse-by-horse granularity.

The Gambling Commission publishes industry statistics including gross gaming yield, betting turnover, and market-share data in downloadable Excel format. While not racing-specific in every case, the horseracing data within these reports — including the GGY breakdown by sport — is machine-readable and free to download.

Several community-driven datasets exist on platforms like GitHub and Kaggle. These are typically compiled by individuals who have aggregated data from public sources over time and made them available for research purposes. The quality and completeness vary, and the datasets may lag behind current results by months or years, but they offer a practical starting point for anyone building a model or learning data analysis techniques using racing data. The provenance of the data should be checked carefully — not all community datasets are transparently sourced.

Exchange market data is another partial free source. The major betting exchange publishes historical market data for research purposes through its developer programme, though the level of access and the format have changed over time. Exchange data is not the same as official results — it covers odds movements and matched bets rather than finishing orders — but combined with results data from another source, it provides a rich picture of market behaviour around each race.

Commercial APIs: The Racing API and Others

For structured, reliable, and comprehensive racing data, commercial APIs are the standard route. These services provide real-time and historical results in JSON, XML, or CSV format via authenticated API endpoints, and they cover everything from racecards and declarations through to finishing orders, SPs, form figures, and sectional times.

The Racing API is one of the more prominent UK-focused providers. It offers endpoints for results, racecards, horse profiles, trainer and jockey statistics, and course data, with coverage across British and Irish racing. Pricing typically scales with usage — the number of API calls per month — making it accessible to hobbyist developers at a lower tier while supporting professional users at higher volumes.

The leading betting exchange API provides access to exchange market data rather than official results. It allows developers to pull live and historical odds, matched volumes, and market metadata for every race traded on the exchange. For quantitative analysts and model builders, an exchange API is a core tool: it enables automated analysis of market movements and, combined with results data from another provider, supports sophisticated analytical strategies. The scale of the market is considerable — remote horserace betting generated £766.7 million in gross gaming yield in the 2024-25 financial year, and a meaningful portion of that flows through the exchange.

Sporting Life and Racing Post both offer data products, though their APIs are generally geared towards commercial partners (media outlets, data resellers, affiliate sites) rather than individual developers. Access requires a licensing agreement and is priced accordingly. The data quality from these providers is among the highest available, drawing on decades of editorial curation and proprietary rating systems.

Other providers in the market include Timeform’s data servicesPA Media’s racing feed (which supplies data to most major news outlets and licensed operators), and specialist providers focused on particular data types such as sectional times or breeding records. The market is fragmented, and there is no single API that provides every possible data point — most serious data users end up combining feeds from multiple providers to build a complete picture.

Licensing, Scraping Rules and Data Integrity

The legal and ethical landscape around racing data is more complex than many newcomers realise, and getting it wrong can have consequences.

Scraping — extracting data from websites by automated means — sits in a grey area. Most major racing websites explicitly prohibit scraping in their terms of service. Whether those terms are legally enforceable in every case is debatable, but the practical reality is that aggressive scraping can result in IP bans, legal warnings, and reputational damage. If you need structured data, the API route is both more reliable and more defensible than scraping.

Database rights under UK and EU law protect the investment in compiling and maintaining a database, even if the individual data points are factual and not subject to copyright. Racing Post’s database of results, for instance, is protected by the substantial investment the company has made in curating it. Reproducing that database in whole or in significant part without a licence would likely infringe database rights, regardless of whether the underlying facts (finishing positions, SPs) are in the public domain.

Data integrity is a separate concern. When you pull data from any source — free or commercial — you are trusting the provider’s accuracy. Errors in results data do occur: occasional mis-recorded SPs, transposed finishing positions, or missing non-runner information. Cross-referencing between multiple sources is the best defence, and any dataset used for modelling or betting decisions should be validated against at least one alternative source before being trusted.

For anyone building a serious data pipeline for racing analysis, the combination of a commercial API for structured daily data, the BHA’s published statistics for industry-level context, and exchange market data for price behaviour provides a robust foundation. Add a validated historical dataset and a clear licensing arrangement, and you have the raw data infrastructure that the growing community of racing’s data analysts depends on.

One final note: the landscape is evolving. As the sport invests in data products and acknowledges the commercial value of its performance data, access terms and pricing models are changing. What is freely available today may be behind a paywall tomorrow, and what requires an enterprise licence now may become available at a more accessible tier as providers compete for the developer market. Keeping abreast of changes from the BHA and the major data providers is part of the ongoing maintenance that any data-driven racing project requires.