Do it again wrote:
Show the numbers from 2008 onwards for sub 2:09, sub 2:08, sub 2:07 etc and see how that compares.
This can be done quickly from the Mac or Linux command line:
download marathon times and convert to tsv:
curl "http://www.alltime-athletics.com/mmaraok.htm" | grep "\d:\d\d:\d\d" | perl -pe 's///g' | perl -pe 's/ +/\t/g' > marathon.tsv
print sub-2:10 count by year:
cat marathon.tsv| grep "2:0[0-9]" | perl -pe 's/^.*\.(\d\d\d\d).*$/\1/' | sort | uniq -c
print sub-2:09 count by year:
cat marathon.tsv| grep "2:0[0-8]" | perl -pe 's/^.*\.(\d\d\d\d).*$/\1/' | sort | uniq -c
print sub-2:08 count by year:
cat marathon.tsv| grep "2:0[0-7]" | perl -pe 's/^.*\.(\d\d\d\d).*$/\1/' | sort | uniq -c
...etc
print sub-2:10 by year excluding November and December:
cat marathon.tsv| grep "2:0[0-9]" | grep -v "1[12]\.\d\d\d\d" | perl -pe 's/^.*\.(\d\d\d\d).*$/\1/' | sort | uniq -c
print sub-2:10 count by athlete:
cat marathon.tsv | grep "2:0[0-9]" | perl -pe 's/ /_/g' | awk '{print $3}' | sort | uniq -c | sort -k 1n
print sub-2:10 count by location:
cat marathon.tsv | grep "2:0[0-9]" | perl -pe 's/^.*\t([^\t]+)\t\d\d\.\d\d\.\d\d\d\d.*$/\1/' | sort | uniq -c | sort -k 1n