Visualizing the IHSA State Cross-Country Meet

By Max Candocia

|

April 07, 2018


Last Updated 04/07/2018

Introduction

In Illinois, high schoolers who run cross-country compete in 3-mile races throughout the fall, usually competing at local parks, or possibly farmland if they live in a more rural area. At the end of the season, which is early November, athletes from each division—1A, 2A, and 3A—compete in Peoria's Detweiller Park, a fairly flat and open course. The division corresponds to the size of the competing school, where 1A is the smallest, and 3A is the largest.

To get a better idea of what the race looks like, below is the map of the course from the IHSA website, followed by some footage I took on my old digital camera back in 2006 of that year's race:

Detwieler Park Cross-Country Course

First 12 seconds of the race, followed by 2:20-3:00 of the race. The winning time was 14:07 by Evan Jager, current North American record holder of the 3000m steeple chase.

If you look at the map, you can see that a very wide start line converges into a very small bottleneck after about 0.3 miles. After that, the course is relatively narrow for the remainder.

Observing Trends

The results of past races are available on the IHSA website for both girls and boys.

In recent years, viewing splits from chip timing has become more common, and River City Race Management has made the splits of the races available for the years 2015, 2016, and 2017. The results, however, are not all formatted the same, and it appears that individual results for the 2017 2A Boys race is missing. Judging by the filenames of the other results from that year, all of the results had been misplaced originally, but the boys 2A race was the only one that didn't get fixed.

There is quite a bit of information that can be gathered from here, but I will look at a few different https://maxcandocia.com/media/articles/ihsa_xcs to get an idea of what competition looks like and what the characteristics of the faster runners are in each division for each sex.

Average Time

The average race paces of each division seems to follow a common trend: the higher the division, the faster the average pace and the more narrow the distribution. Even the girls 3A division is narrower than the boys 1A division, despite the slower average speed.

The girls' races seem to be wider, as well, although this is partly because the difficulty in increasing pace becomes when one is running faster. For example, improving from a 7-minute/mile pace to a 6:30/mile pace is much easier than improving from a 6:30/mile to 6:00/mile pace.

plot of chunk pace_density

Differences in Split Times

When racing, you ideally want to maintain as even of a pace as possible. Confounding factors that can break this strategy include changes in elevation, changes in terrain, and obstructions. Oftentimes runners (especially younger ones) will start out very fast and then slow down partway into the race. For this race, it is somewhat unavoidable, as the beginning starts on a downhill, followed by a bottleneck that is somewhat unforgiving if you are caught in a crowd moving slower than you'd like.

Looking at how percent changes in speed over the three miles' splits correlate to place can give us an idea of what strategy the best runners are using.

plot of chunk split_deviations

It looks like the ones who are performing worse across all divisions and sexes are slowing down considerably after the first mile, around 10-15%, while the top performers tend to only slow down around 5-10%. We can look at the top 25 of each race (those are the places that earn a medal and the title "all-state"), we can see if this trend continues for the fastest runners.

plot of chunk top_split_deviations

The most noticeable trend I can see is that the faster runners (dark purple) have more of a tendency to finish their third mile noticeably faster than their second in the boys 3A division. That is quite impressive, considering they are already running sub 5-minute miles back-to-back. When I was looking at the results, I noticed that the top runners were neck-to-neck going into the third mile.

Year in School

In high school, it was fun to see/speculate how different ages performedacross the different sexes and divisions. There was a notion that it was mostly juniors and seniors (3rd and 4th year) placing for boys, while age didn't matter at all for girls.

Looking at the below bar graph, it seems like that assertion is correct, to some degree. If I had to explain it, I would say that boys are still hitting puberty by the time they enter high school, and their muscles develop much more significantly than girls'. There seems to be an odd ridge for the girls 3A division, although I think that is a time-lag effect from a particularly strong class of girls in 2015 as sophomores and in 2017 as seniors.

plot of chunk year_in_schoo

Years at State

One last thing I wanted to look at was the distribution of the top 25 runners in each race based on how many years they ran a state meet in the past 2 years. Since I did not collect data for 2014, I am only looking at 2017 data. For the 3A division, it seems like having been to the state race before is a prerequisite for doing well.

plot of chunk state_experience

I can't make too many generalizations about the 2A division, since the boys' data is missing, but the 1A data seems almost downright counterintuitive with the more experienced runners not performing as well. This is a byproduct of statistical censoring, though. If we view all 200+ racers at once for each boxplot, then it is clear that more experience at the state meet is advantageous.

plot of chunk state_experience_2

Links


Tags: 

Recommended Articles

Analyzing the Politics of Reddit - Part 1

Reddit's wide variety of political Subreddits can be compared and grouped together in several ways. Using user post and comment history, comment text, and moderator information, subreddits can be clustered into groups based on different ideas.

Wordclouds of America

Wordclouds of how Americans describe America