DeepRace: GRU-Based Sequence Modeling Framework for Marathon Performance Prediction
Abstract
In recent years, data-driven approaches have gained prominence in sports analytics, offering new opportunities for optimizing athletic performance. This study presents DeepRace, a computational framework based on deep learning models including Recurrent Neural Networks (RNN), Long Short-Term Memory networks (LSTM), and Gated Recurrent Units (GRU) for marathon performance prediction. Using a dataset of 23,763 training activities from 549 athletes who participated in the 2017 Boston Marathon, we evaluated the effectiveness of sequence-based models in forecasting marathon finish times. The framework systematically preprocesses and analyzes sequential workout data, employing a multimodel selection algorithm to identify the most accurate predictive model. Experimental results demonstrate that the GRU model achieved superior performance, with a Mean Absolute Percentage Error (MAPE) of 5.2 percent and a Root Mean Squared Error (RMSE) of 0.11 hours, outperforming both RNN (MAPE 5.6 %) and LSTM (MAPE 6.5 percent) models. Notably, all deep learning models significantly surpassed the baseline linear regression model (MAPE 10%), confirming the advantage of deep learning for time-series performance prediction. These findings underscore the potential of GRU-based sequence modeling for enhancing marathon training strategies, offering actionable insights for athletes and coaches aiming to improve race outcomes through data-informed decisions.DOI:
https://doi.org/10.31449/inf.v49i28.8818Downloads
Published
How to Cite
Issue
Section
License
Authors retain copyright in their work. By submitting to and publishing with Informatica, authors grant the publisher (Slovene Society Informatika) the non-exclusive right to publish, reproduce, and distribute the article and to identify itself as the original publisher.
All articles are published under the Creative Commons Attribution license CC BY 3.0. Under this license, others may share and adapt the work for any purpose, provided appropriate credit is given and changes (if any) are indicated.
Authors may deposit and share the submitted version, accepted manuscript, and published version, provided the original publication in Informatica is properly cited.







