Monday, November 4, 2013

Py Win Rankings - Week 19 and year end wrap-up

The regular season is over and unfortunately for us fans, it ended with a week that meant precisely nothing to the playoff picture.  As a result, we ended up with a weekend slate of games populated by backups.  Too bad, especially since a narrow Edmonton win dragged them closer to their py expectations and made their final numbers a little bit less interesting.

Calgary finishes the season on top, Winnipeg finishes the season at the bottom, and Hamilton and Montreal wind up in a tie despite a 2 game difference in the official standings.

Here are the final numbers:

Luckiest Team: Calgary (+2.1 wins)
Unluckiest Team: Edmonton (-3 wins)

Biggest Jump: Hamilton (+0.8 projected wins)
Biggest Drop: Calgary (-0.5 projected wins)




2013 Recap

Now that the season is complete, we can go back and see how things changed since I posted my first article back in week 10.

As I noted in my introductory article, the primary value of the pythagorean expectation formula is as an indicator of future results.  While I used it here as a sort of "mathematical" power ranking, that's really not it's purpose.

#1 Calgary
Started: 7-2, 1.2 wins over expectation
Finished: 14-4, 2.1 wins over expectation
All time rank: #32 of 207

The Stamps ignored the odds and finished the second half of the season the same way the as the first - 7-2 and roughly 1 win over expectation.  Calgary's finishing total of 2.1 wins over expectation is the second highest since 1990, tied with Winnipeg in 2001 (lost in the Grey Cup), and Baltimore in 1995 (won the Grey Cup).


#2 Saskatchewan
Started: 8-1, 1.4 wins over expectation
Finished: 11-7, 0.7 wins below expectation
All time rank: #37 of 207

The Riders started strong but regressed towards expectations over the second half of the season.  With the league's best scoring defense and second best offense, this Rider team finishes as the best since 1990, according to Py win percentage.

#3 Toronto
Started: 5-4, exactly on expectation
Finished: 11-7, 1 win over expectation
All time rank: #75 of 207

Toronto was pretty consistent all year.  They got a bit luckier in the second half of the season after playing right along expectations in the first half.

#4 BC
Started: 6-3, 1.3 wins over expectation
Finished: 11-7, 1.1 wins over expectation
All time rank: #76 of 207

Like Toronto, BC was fairly consistent for most of the year.  The #3 and #4 teams jumped back and forth all year, finishing with nearly identical seasons.  Toronto scored 3 more points than BC, and BC allowed 2 points less than Toronto.  They end up back to back in the all time rankings, a mere 0.018 Py wins apart.

#5 Montreal
Started: 4-5, 0.3 wins over expectation
Finished: 8-10, 0.7 wins below expectation
All time rank: #116 of 207


2013 was a rough year for Alouette fans, but they can take some solace in the fact that the math says they are just the tiniest bit better than Hamilton, despite the 2 game difference in records.

#6 Hamilton (tie)
Started: 4-5, 0.1 wins below expectation
Finished: 10-8, 1.3 wins over expectation
All time rank: #119 of 207

The two game difference between Montreal and Hamilton is why stats like this were invented.  Like BC and Toronto, these teams had virtually identical seasons, separated by 6 points offensively, and 3 points defensively, and yet Hamilton finishes 2 games clear of Montreal in the standings.  Expect a close one in Guelph this weekend.  (Side note - is there anyone out there who'd have guessed that Montreal finishes with the better offense, and Hamilton with the better defense?)

#7 Edmonton
Started: 1-8, 2.4 wins below expectation
Finished: 4-14, 3 wins below expectation
All time rank: #160 of 207

From a math standpoint, Edmonton was the most interesting team in the league this year.  Their close losses early in the season inspired me to start collecting these stats, and unfortunately for Eskimo fans, their luck did not improve in the second half of the season.  A meaningless week 19 win brings their win differential up slightly, but still good for a tie for second all time at -3.0 wins.

#8 Winnipeg
Started: 1-8, 1.4 wins below expectation
Finished: 3-15, 1.3 wins below expectation
All time rank: #200 of 207

The Bombers were the worst team in the league this year, and it wasn't particularly close.  Their defense allowed 66 points more than 7th place Edmonton, and in a year where half the league scored more than 500 points, Winnipeg wasn't even close to cracking 400.  According to the numbers, only 7 teams since 1990 have been worse, and two of those teams don't even exist anymore (the 1995 Ottawa Rough Riders, and the 1994 Shreveport Pirates).


Playoff Predictions

My research into how Py Wins and Big Wins can be used to project playoff stats is incomplete at this point, but what I do have so far indicates that the answer is probably "not very well".

Again going back to 1990, the best team according to Py Wins has only gone on to make the Grey Cup 56.5% of the time, and only won it 43.5% of the time.  That fares poorly compared to simply using wins as a projector, where the team with the most wins (outright or tied) has made the Grey Cup 65% of the time, and won it 56.5% of the time.

I intend to do some more research in the off season, but my theory at this point is that home field advantage, coupled with the bye that the division winner gets, is a significant enough advantage that it more than off-sets any difference in team quality, especially since the two teams playing in the West or East Final games should typically be fairly close in quality.

With that in mind, here are my mathematically unsound, empirically irrelevant predictions:

West Semi - BC @ SSK
BC has been poor on the road (3-6) and the Riders are above average (6-3) at home.   They also beat BC twice fairly handily.  I like Saskatchewan to advance here.

East Semi - MTL @ HAM
Hamilton was good at home (6-3), and Montreal was just OK on the road (4-5), but the math suggests they are very evenly matched, and it took a wacky special teams play for Hamilton to pull off the last one.  I think Montreal will put this one away earlier and avoid the late game shenanigans.

West Final - SSK @ CGY
Calgary is just too good at home (8-1), and despite play each other close this season, it just seems like Calgary has had Saskatchewan's number since that early loss at Mosaic.  It pains me to write this, but I see Calgary winning this one.

East Final - MTL @ TOR
Remember what I said about top seeds and playoff success?  I don't see this game defying the odds.  Toronto is a better team on both sides of the ball, and I like them to win and set up a rematch of the 2012 Grey Cup.

Grey Cup - CGY vs TOR
I really hope I'm wrong about this matchup, and I get to see the Riders play in the Grey Cup at home.  But there is no room for hope in predictions, only speculation and BS.  Toronto pulled off a crazy upset at McMahon earlier in the year, but this should basically be a home game for Calgary, and the Stamps have been the best team all year.  I like the Stamps to win it all, adding another mark in the "won grey cup" column for both the "Most wins" and "Most Py Wins" statistics.

Tuesday, October 29, 2013

Py Win Rankings - Week 17

One last week of numbers before the end of the season, and it's looking to me like this year is going to stand out on both ends of the spectrum.  It looks like a virtual certainty that Edmonton will finish as the second unluckiest team of all time, and now it's looking like Calgary will be the luckiest 14 or 15 win team in history as well.  Other teams have finished further above their Py Expectation, but only Baltimore in 1995 has finished with 15 wins and been more than 2 wins above expectation.  It's far from empirical in the least, but it's worth nothing that Baltimore won the Grey Cup that year.

The rankings themselves haven't changed at all, without even any interesting projection changes.  That's of course because as the season goes on, each game affects the totals by a smaller percentage than previous games, so things are mostly stable by now.  Based on the gaps between teams at this point, I don't anticipate any changes next week either, other than perhaps BC moving up a spot if they win big and Toronto loses.

Next week after we have the final numbers, I'll take a look at each team and how historically similar teams have fared in the playoffs and future seasons.

Luckiest Team: Calgary (+2.3 wins)
Unluckiest Team: Edmonton (-3.4 wins)

Biggest Jump: Toronto and BC (+0.3 projected wins)
Biggest Drop: Saskatchewan (-0.3 projected wins)

Monday, October 21, 2013

Py Win Rankings - Week 17

2 weeks left in the season (playoff time, for the fantasy football fans).

A bit of shuffling in the ranks this week, as 4 teams switch places.  Toronto and Montreal move up, BC and Hamilton move down.  My broken record repeats as Edmonton continues to be historically unlucky.

Luckiest Team: Calgary (+1.9 wins)
Unluckiest Team: Edmonton (-3.1 wins)

Biggest Jump: Montreal (+0.8 projected wins)
Biggest Drop: Hamilton (-0.7 projected wins)

 * In hindsight, my decision to call column 10 "Projected" was a poor one.  It was never a true projection, it's merely the teams' Py winning percentage extrapolated over 18 games.  It looks quite silly now that Calgary has more real wins than "projected" wins.  I'll find a better name next year, or better yet, work on a proper projection.

Tuesday, October 15, 2013

Py Win Rankings Week 16

Nothing to see here folks, no change at all.

Calgary stays on top, Winnipeg on the bottom.  Even the projections for the top 2 teams (which to be clear, aren't a prediction for how many wins I expect a team to finish with, they are just the result of the pythagorean formula taken over 18 games).

One thing to note here, barring some kind of miraculous turnaround, Edmonton is closing in on one of the unluckiest seasons in the past 20+ years.  Their current total of -3.0 wins vs expectation would finish in a tie for second place with Hamilton in 2008, only behind Winnipeg's -4.5 in 2010.   Eskimo fans take heart - each of those teams followed up their historically unlucky seasons with big turnarounds the next year - 9 wins and a home playoff game for the Tiger-Cats, and 10 wins and a Grey Cup appearances for the Bombers.

Luckiest Team: Calgary (+1.7 wins)
Unluckiest Team: Edmonton (-3 wins)

Biggest Jump: Winnipeg (+0.5 projected wins)
Biggest Drop: BC (-0.5 projected wins)


Thursday, October 10, 2013

Py Rankings Week 15

Little late on this one, sorry to anyone who was looking for this post earlier in the week.

After many weeks of hanging around despite losses, the Riders win this week and still relinquish their hold on top spot, dropping to #2 and leaving Calgary alone at the top, while Montreal and Edmonton swap places near the bottom.

Nothing overly surprising this week; the rankings exactly match the CFL standings.

Luckiest Team: Calgary (+1.5 wins)
Unluckiest Team: Edmonton (-2.6 wins)

Biggest Jump: Montreal (+0.9 projected wins)
Biggest Drop: Edmonton (-0.6 projected wins)

Monday, September 30, 2013

Py Rankings Week 14

Calgary finally jumps into first place - sort of.  Four straight losses aren't quite enough to knock Saskatchewan from the top of the heap, but they do fall into a tie for first with Calgary.  After spending most of the season as the highest scoring and strongest defense, the Riders drop into a tie for second for the scoring lead, while remaining the top scoring defense.

A strong showing by the Argos wasn't enough to get them any math love, as a dominant BC win bumps them up to 3rd place.

Edmonton continues to underperform, the math gods still don't like Montreal, and then there's Winnipeg.

Luckiest Team: Tie - Calgary and Toronto (+1.5 wins)
Unluckiest Team: Edmonton (-2.6 wins)

Biggest Jump: BC (+1.0 projected wins)
Biggest Drop: Hamilton (-0.7 projected wins)

Tuesday, September 24, 2013

Does winning close games help in the playoffs?

A while back, I added the "Big Win" stat to the Py Win Rankings.  The idea behind this stat is that it counters the notion that "good teams win close games", and that winning close games prepares a team for success in the playoffs.

The idea for the stat, and the logic behind it, comes from Jim Glass of advancednflstats.com.  In his article, he analyzes playoff records for the best playoff teams, and groups them by their record in close games and blowouts.  I'm following his example with the CFL data.

Test Data


Like Mr. Glass, I will filter the data to the top tier playoff teams, those which made the playoffs with at least 10 wins..  More wins means of course that they will have more "big wins" or "close wins" respectively.  It will filter out some Grey Champions which didn't finish in the top of the league, but the goal here is simply to determine if big wins or close wins are more closely related to playoff success, not analyze each Grey Cup winner in detail.

Filtering to 10+ win teams since 1990 leaves us with 93 teams.

I'm using 9 points or more to represent the cut off between blowout and close win.  The original NFL article used 10 points, but he later amended it to 9 points, which represents the cutoff between a 1 and 2 possession game.


Results

Most close wins:

  • Of the 93 teams, 8 won 8 or more close games.  Their record in the playoffs was 5-7.
  • The 15 teams with the best records in close games combined to be 91-14 (87%) in those close games.  In the post-season, they had a record of 17-11 (60%), 6 Grey Cup appearances, and 4 winners.
 Fewest close wins:
  • 20 of the 93 teams has losing records in close games.  The 15 with the worst record in close games combined for a record of 34-66 (34%).  In the post season, they had a record of 19-11 (63%), 9 Grey Cup appearances, and also 4 winners.
It doesn't appear that a team's record in close games matters very much at all, as both the best and worst teams in close games have very similar records come playoff time.

 Grey Cup Winners:

Looking at things from the perspective of the 20 champions (the 2012 Argos, 2000 Lions and 2001 Stamps didn't make the 10 win cut), they're win-loss records shake out as follows:
  • The playoffs: 47-0 (of course)
  • The regular season: 252-106-2 (70%)
  • Close games during the regular season: 81-52 (61%)
  • Big wins/losses during the regular season: 171-54 (76%)
A 61% win rate in close games does show some ability to win close games, though not much better than a coin flip.  They were slightly better than the rest of the 10 win teams, however, as the average for all 93 teams was 60%.

Playoff results by win cohort:

Here's how winning close games matches up with winning playoff games, grouped by W-L record.  For the purposes of this exercise, I'm considering a tie to be a loss (it's not "clutch" to tie, right?).  Only 2 teams finished with ties.  (Value in brackets is winning percentage in close games).

15-3 (7 teams)
Top 3 (83%): 6-1, 2 GC winners
Low 4 (68%): 7-2, 2 GC winners

14-4 (4 teams)
Top 2 (86%): 1-2
Low 2 (69%): 1-2

13-5 (17 teams)
Top 8 (82%): 8-6, 2 GC winners
Low 9 (51%): 10-5, 4 GC winners

12-6 (20 teams)
Top 10 (72%): 9-1, 3 GC winners
Low 10 (49%): 15-8, 2 GC winners


11-7 (24 teams)
Top 12 (69%): 11-10, 2 GC winners
Low 12 (46%): 11-10, 2 GC winners

10-8 (21 teams):
Top 10 (61%): 10-9, 1 GC winner
Low 11 (40%): 9-11

The two groups win at nearly the same rate.
  • The "higher halves" have a much better record in close games, but a 45-29 record in the playoffs (61%).
  • The "lower halves" are much worse in close games, but have a similar record at 53-38 (58%), and more Grey Cup winners (10 vs 9).

The data isn't as cut and dry in the CFL as it is in the NFL (where the lower half of each group clearly has a better winning percentage), but the numbers are extremely close.  Close enough to suggest that a team's record in close games may not have much to do with playoff success, either positively or negatively.

Big wins and big losses:

Perhaps then, big wins and losses are a better indicator of playoff success than close wins?
  • 5 teams had 11 or more big wins in a season.  Their playoff record was  6-3 (67%), with 2 Grey Cup wins.
  • 15 teams had 10 or more big wins.  Their record in the playoffs was 22-8 (73%) with 7 Grey Cup wins and 3 Grey Cup losses, meaning 10 of those 15 teams made the Grey Cup.

Conclusion

It seems clear that close wins do not equal playoff success.  Of those 8 teams with 8 or more close wins, only the '95 Stallions had a successful play off run - they went 3-0 and won the Grey Cup.  The remaining 7 only appeared in 1 Grey Cup, with no wins.

However, while "big" wins do appear to correlate better with Grey Cup wins than close wins, they don't appear to correlate any better than straight up wins and losses.  I think I will explore this in better detail in a later post, but I believe this comes down to the CFL having less scheduling variance than the NFL.

For now, I will continue to include the "big win" stat on my rankings table, as I think it is interesting, but I suspect that more analysis will show that it simply lines up quite closely with overall win-loss records, and doesn't give us much useful information.  In fact, it may be more beneficial to include "close wins" instead, as an indicator of teams which may do poorly in the playoffs, vs using big wins to indicate those which will do well.

Monday, September 23, 2013

Py Rankings Week 13

The Riders keep losing, the Argos keep winning, and the rankings stay the same for the second consecutive week.

Saskatchewan, despite a 3rd straight loss, cling to a small lead in the stats, continuing to be the highest scoring offense and stingiest defense.

Luckiest Team: Tie - Calgary and BC (+1.5 wins)
Unluckiest Team: Edmonton (-2.2 wins)

Biggest Jump: Edmonton (+0.4 projected wins)
Biggest Drop: Calgary (-0.4 projected wins)


Thursday, September 19, 2013

Py Rankings Week 12

The Riders lose another, but Py Expectation still thinks they are the best team in the league, 0.4 wins ahead of Calgary.  Edmonton moves up 2 spots to sixth (take heart, Eskimo fans, the numbers suggest you could be looking at 7-8 win season by the time we're done).

Luckiest Team: Calgary (+1.9 wins)
Unluckiest Team: Edmonton (-2.5 wins)


Friday, September 13, 2013

Pythagorean Stats since 1990

Pythagorean (py) wins (what are py wins?), and Big Wins are interesting stats, but they don't tell us much on their own.  In order to put perspective on them, it's necessary to look at historical data, and see if they have useful, or even any, connection with past seasons.

Py Wins As a Prediction Model

The idea behind the pythagorean expectation formula is that points for and against provide a better indication of team quality than actual wins and losses, and that over time, teams which significantly over or underperform their expectation tend to regress or improve back to expectations.  NFL and MLB statisticians use historical data to provide perspective on what kind of regression or improvement a team a team is likely to show in the next season, or even half season.  I now have data dating back to the 1990 season, which I can use to gather the same data (in the future I will look at pre-1990 seasons, but I expect that as you go back in time, the changes to the game will start to hurt the accuracy of our current data):

   
Over the past 182 seasons (that's 1 season per team since 1990, including Ottawa twice and the failed American teams), you can see how many teams finished above or below expectation, and how they did in the following season.  Seasons where the team was not in the league the following year have been removed from the table and chart.  The 2012 and 2013 seasons are also not yet included, as they have no follow up season to analyse.

As you can see, the majority of seasons fall into a range quite near to expectation: 41 of 182 fall between -0.5 and 0.5, and 100 between -1 and 1.  That's pretty good; 55% of teams finish within 1 win of expectation, and less than only 28 times in 31 years has a team missed expectations by more +/- 2 wins. 

In the ranges where we have more data, the chart follows the line you would expect; teams which miss expectations tend to turn it around the following year, while teams which surpass them end up with a few less wins the next year.

There are of course some outliers in the data at the outer edges where we have poor sample sizes.  In 1997, Montreal finished a full 4.5 wins above expectation, winning 13 games despite a -23 point differential.  Defying the expectations, they won another 12 games in 1998, finishing another 2.7 wins up on expectations.  On the other end of the spectrum, we have the 2010 Blue Bombers, finishing 4.5 wins below expectation.  They had extraordinarily bad luck that year, winning only 4 games despite a point differential better (-21) than those '97 Als.  The next year, Winnipeg won 10 games and made it to the Grey Cup.

Neither one of these examples gives us a good idea what to expect when a team is so far above or below expectation, simply because it's so uncommon.  Were the Bombers lucky to turn it around?  Were the Als lucky to avoid regression?  I think the latter is likely the case based on the ranges where we do have more data, but no one can say for sure.

All in all, I'm comfortable with saying now that as with other sports, Pythagorean Expectation is a good way to predict future performance in the CFL.

Monday, September 9, 2013

Advanced CFL Stats - Week 11

The week is over, so it's time for more stats.

This week the Riders got a little less lucky, the Bombers got a win in the new stadium, and the Eskimos just can't buy a break.


I stream-lined the chart a bit this week and it's presented in a slightly different format, as my stats are now in a database instead of a spreadsheet, so I can store more and do cooler things, like:

Big Win Percentage.

Big Win Percentage is a simple stat, created by Jim Glass. It's based on the premise that football by nature is a game that can be heavily influenced by luck. A bad call, a fumble recovery, a gust of win; these are all things which can turn a close game into a win or a loss. According to Brian Burke (the guru of NFL stats), the outcome of more than 40% of NFL games is determined by random chance. This makes judging a team by it's record a difficult proposition (especially in the NFL, where teams don't play every team in the league).

What Mr. Glass's formula does it try to account for that luck by giving teams credit for "Big Wins", defined as a game decided by 9 or more points. 9 points makes a good cut off because it is the border between 1 and 2 possession games.

The formula is simple - games won by 9+ points count as a "Big Win", games lost by 9+ points are considered a "Big Loss", and all the rest are considered ties. If you read the article linked above, you'll see that he's found that teams with a high number of "Big Wins" in a season tend to fare much better in the playoffs. We'll see if that holds true for the CFL (I'm compiling data back to 1990 for a post later this week), but in the mean time, I'm going to include it on the chart for this week.
 

Py W = Pythagorean Wins, Projected = Py Wins over 18 games

The Riders remain the best team in the league based on Py Expectation, but they are no longer considered the luckiest team in the league, that honour now goes to Calgary.  Edmonton remains the unluckiest team so far, nearly 3 wins below expectation.  Winnipeg, despite a win over the Riders this week, still sits at the bottom, though they are still considered unlucky by the formula.

Coming soon...

As noted above, I've been collecting data, back to 1990 so far.  I plan to do a post to highlight some of the interesting points once I have a bit more information gathered.

- Mike

Friday, September 6, 2013

CFL Pythagorean Wins


I'm a big believer in statistics and analysis when it comes to sports.  As noted by some on /r/cfl previously, there is a significant lack of advanced stats for the CFL.  I'm not a statistician, nor do I have charting stats for each any every game like the NFL stats sites, so there are definite limits on what I can provide, but one stat I can calculate easily is Pythagorean Wins.

Bill James created the formula for baseball years ago, and it's been modified to better suit the NFL since then.  Obviously the CFL is not the NFL, but the season is of similar length and scoring numbers are also in the same ball park, so I believe the stat should apply fairly well to our league.  Down the line I will look at some past seasons and see if I can determine how well (or poorly) it actually does work.

The formula itself is based on the idea that not all wins are created equal, and that point differential is actually a better indicator of future winning percentage than actual wins and losses.  When applied to NFL games, the stat is a good indicator of future performance, both for future seasons, and second halves of the same season.

For a more detailed explanation from someone much smarter than I, check out Bill Barnwell's explanation on grantland.com.

With all of that said, we are at the half way point of the CFL season, so this is a perfect time to run the numbers on the first half and see what they might tell us.

Legend
P-W%: Pythagorean Winning Percentage, P-W: Pythagorean Wins, P W-L: Pythagorean Win-Loss,
Diff: Difference between Py Wins and Actual wins, P-W-T: Pythagorean Win Total (projected over 18 games)


By the numbers, Saskatchewan and BC are the luckiest teams of the first half, while Edmonton and Winnipeg are the unluckiest.  Despite being the luckiest team, the formula still believes that the Riders are the best team in the league, while Edmonton has been particularly unlucky, performing almost 2.5 wins below expectation. 

Teams which over or under perform the formula by a wide margin tend to fall back or climb closer to their expected win total as the season progresses, so according to Pythagoras, both Edmonton and Winnipeg fans should have some hope that their team will rebound slightly in the second half.  That said, there aren't many surprises here, other than some shuffling in the middle.  The formula believes that Toronto is slightly better than BC (but clearly isn't aware that Ricky Ray is injured), and that Montreal is slightly worse than Hamilton.