Kameron Michaels isn't quite sure what's happening below, but she's certainly a little revolted at the sight.
Kameron Michaels isn't quite sure what's happening below, but she's certainly a little revolted at the sight.

Last updated: May 31, 2019

My wife got me hooked into watching RuPaul’s Drag Race with her. The long and short of it is I didn’t need to have my arm twisted to watch or enjoy the show. Indeed, drag nights at Icon in Tuscaloosa—incidentally home to the performer who gave Trinity her “Tuck” moniker—are some of my fondest memories of graduate school. A performer who routinely served as opening act for her—and sadly passed away last year—was among the sweetest/kindest people I knew in town.1 I had a receptiveness to drag as concept and creative outlet, but oddly did not know about the show until my wife told me about it.

My wife also informed me of the debates that fans have about the show, some of which can be quite heated. Several of these—at least the ones that aren’t completely subjective or involve questions that quickly devolve to total grease fires—involve concerns of measurement and ranking. Certainly, the show has evolved from Season 1 to Season 10, but how would a fan rank, say, Jinkx Monsoon’s run to the crown in Season 5 versus Bianca Del Rio in Season 6? Or Bob the Drag Queen in Season 8?

This creates a lot of conversation on the subreddit for the show and spawns numerous videos where fans can upvote or downvote a particular performer or carry on these debates. I hear it as an opportunity to scrape the wiki for the show and create a few data sets to see if I can measure these things myself. Toward that end, I’m creating a package in R (dragracer) that will contain various data sets for this. Mostly, I just wanted to call dibs on having an R package for the show named dragracer.

About the Data/Methods

The data package is a work in progress and leans heavily on scraping the wiki, especially the contestant-level information that the wiki turns into pyramidal tables. I have, so far, three data sets in the package. rpdr_contep is contestant-episode-level data about how contestants fared, if they appeared, in episodes as the season progressed. rpdr_contestants is contestant-level data that includes some background information on the contestants along with metrics of their performances through the season. rpdr_ep contains some episode-level data that would have been difficult or tedious to build into the contestant-episode-level data. These are data points like the mini-challenge winners, the runway theme (where applicable), and some information about the lip-sync.

There are some additional caveats.

  • I interpret “HIGH TEAM” codings in the wiki as cases where the contestant was on a winning team for a main challenge, but the contestant herself is safe.
  • “Eliminations” are broadly understood as cases where a participant was removed from the show. As a result, there is an outcome variable that codes whether a contestant was a winner, scored high, was safe, scored low, or was in the bottom two for the main challenge and runway and a corollary eliminated variable if the contestant was removed from the show. The overlap here is substantial, obviously. However, there are a few cases where a contestant was removed despite winning a challenge (Willam, Season 4) or scoring safe (Eureka O’Hara, Season 9) for some other reason.
  • The minichalw column in the contestant-episode data is mostly uninformative. I should delete it or better sync the episode-level data with it.
  • Finales are sui generis and users should not read too much into them. They also evolve substantially over the show’s 11 seasons. For example, the first three seasons had finales where the finale itself effectively had a bottom and, practically, an elimination. Seasons 4-8 had top threes (later top fours) where there was no “bottom” or elimination, just a crowning. Season 9 started the newer trend of having a top four and semifinal competition that leads to a lip-sync for the crown. Users who want to compare performers should subset out finales.
  • Related: Seasons 6 through 8 have a penultimate column. These were episodes in which RuPaul whittled the top four to a top 3. They’re unique episodes in that no one was a bottom, but everyone lip-synced. This created three winners and one bottom/elimination. I code those episodes as such but encourage users to subset those out when comparing contestant performances. However, I do not do this Season 11. Therein, there was no winner, but a clear bottom two in the episode that sent Vanessa Vanjie Mateo home and put Brooke Lynn Hytes as the fourth semifinalist.
  • The episode-level data has a numqueens column that codes the number of participants who were active in the episode. There were several cases where RuPaul reintroduced a performer—or, in the case of Season 7, an entire damn cast—at the start of the episode. In more than a few cases, those reintroduced were eliminated the exact same episode.
  • Related to that: I code that episode in Season 7 as a case where Pearl and Trixie Mattel were joint winners.
  • The finales for Season 9 and Season 10 are cases where the runner-up(s) (e.g. Peppermint, Season 9) scored high and those who lost the semifinals (e.g. Trinity Taylor, Season 9) were bottoms for that episode. Again, these are finales so the user should subset them out when comparing queens across different seasons.

An Empirical Analysis of RuPaul’s Drag Race Performances

The data are far from perfect or complete, but they’re workable for an empirical analysis of contestant performance for the 11 main seasons of RuPaul’s Drag Race. Here, I’ll explore some of the best performers in the history of the show with some rudimentary metrics. Namely, which contestants had the highest percentage of competitive (i.e. non-special, but non-finale) episodes in which they won, or scored high/won? Which queens scored the highest on the “Dusted or Busted” scoring system? The data aren’t too hard to generate and require a few lines in tidyverse. The source code, along with version history, is available on the _source directory for my website.

Comparing the Season Winners

This will facilitate some kind of comparison, say, among the season-winners by reference to thier performance in main challenges/runways through the course of a season. I choose to focus on the percentage of wins, bottoms, etc. because the number of competitive episodes varies considerably through the 11 seasons. For example, Bebe Zahara Benet had just six episodes before the finale in which she could’ve been eliminated. Raja, Season’s 3 winner, had the most (12). Seasons since then have had 10 or 11 competitive episodes, excluding the eight competitive episodes in Season 8.

Ranking the RuPaul's Drag Race Winners from Season 1 to Season 11
season contestant % Wins % Wins/Highs % Bottom % Bottom/Low D/B Score
S01 BeBe Zahara Benet 33.33% 50% 16.67% 16.67% 3
S02 Tyra Sanchez 33.33% 55.56% 0% 11.11% 7
S03 Raja 25% 58.33% 8.33% 16.67% 7
S04 Sharon Needles 36.36% 54.55% 9.09% 18.18% 7
S05 Jinkx Monsoon 18.18% 81.82% 9.09% 9.09% 9
S06 Bianca Del Rio 30% 70% 0% 0% 10
S07 Violet Chachki 27.27% 45.45% 0% 18.18% 6
S08 Bob the Drag Queen 37.5% 37.5% 12.5% 12.5% 4
S09 Sasha Velour 18.18% 63.64% 0% 9.09% 8
S10 Aquaria 27.27% 36.36% 0% 18.18% 5
S11 Yvie Oddly 7.69% 30.77% 7.69% 15.38% 2

The table won’t reveal too much that stands at odds with what long-running fans of the show already know. Sharon Needles had the most main challenge wins of any season-winner and, in my interpretation of the data, is tied with Shea Couleé (a Season 9 semifinalist) for most main challenge wins of any performer. Fans will qualify that Sharon won her challenges outright while Shea shared two of her wins with Sasha Velour. This may matter to the allocation of prize money but won’t matter in the data as I have it.

Perhaps the strongest contestants are in the middle two seasons. Jinkx Monsoon and Bianca Del Rio effectively dominated their seasons. Jinkx stands out in particular for winning or placing high in nine of her 11 competitive episodes. A “safe” in the season-opener and bottom-two performance in the last competitive episode of the season before the finale sandwiched a run of excellence in which she won or scored high. This is particularly impressive because, if you want to treat the show as something other than reality TV in which winners are determined by RuPaul’s discretion, her season might have been the most competitive. Roxxxy Andrews, Alaska, Detox, and Alyssa Edwards are legendary names as drag performers.

Likewise, Bianca Del Rio has the distinction of the first and, to date, only queen to never land in the bottom two or even get a “low” rating for a given episode. She is not alone as a winner to have never lip-synced for her life, but she is unique for not even ranking low. That effectively explains the one-point differential favoring Bianca Del Rio over Jinkx Monsoon in the “Dusted or Busted” scoring system.

Bob the Drag Queen and Aquaria stand out for scoring eye-openingly low in the “Dusted or Busted” scoring system. Both are, from what I’ve gathered, well-respected among fans of the show, and I think their low scores have some intuitive explanations. In Bob’s case, she had just eight competitive episodes in what amounts to a conspicuously abbreviated season. That she has 12% of her appearances in the bottom means she was in the bottom just once. Fans will remember that episode well. She lip-synced against Derrick Barry to the tune of one of my all-time favorite jams: Sylvester’s “You Make Me Feel (Mighty Real).” That contest was not close.

In Aquaria’s case, her season might have been the most competitive top to bottom in the show’s history. Blair St. Clair (9th place) and Mayhem Miller (10th place) could probably crack top five each in most other seasons. Vanessa Vanjie Mateo, 14th place in Season 10, may even make a long run in Season 11 (ed, May 31/2019: she did).

The conclusion of Season 11 and addition of Yvie Oddly to this list of winners means the current champion has surpassed BeBe Zahara Benet for lowest “Dusted or Busted” score to ever win a season. She has the fewest wins (1, and a shared one at that) and the fewest number of wins/highs (3) of any eventual champion. I can qualify that the episode prior to the reunion was one in which Yvie would’ve won if Ru had announced a winner. She had the best runway look (in a walk) and Ru named her first as a semifinalist. Alas, that episode is coded as a safe. Had Ru announced her as a winner that episode, her “Dusted or Busted” score improves to 4.

Who Were the Lowest-Ranked Performers to Land in the Top Four?

The data can also be used to identify the lowest-scoring performers to get a top four finish in the show’s 10 seasons. There are several metrics available for ranking these performers, but I’ll go in descending order, by “Dusted or Busted” score.

The Lowest Ranked Top Four Performers from Season 1 to Season 11
season rank contestant % Wins % Wins/Highs % Bottom % Bottom/Low D/B Score
S06 4 Darienne Lake 10% 20% 30% 60% -6
S02 3 Jujubee 0% 22.22% 33.33% 44.44% -5
S01 4 Shannel 0% 16.67% 33.33% 50% -4
S05 4 Detox 9.09% 18.18% 27.27% 36.36% -4
S10 2 Kameron Michaels 9.09% 27.27% 27.27% 36.36% -3
S01 3 Rebecca Glasscock 16.67% 33.33% 33.33% 50% -2
S02 4 Tatianna 11.11% 22.22% 22.22% 33.33% -2
S08 4 Chi Chi DeVayne 12.5% 25% 25% 37.5% -2
S04 4 Latrice Royale 18.18% 36.36% 27.27% 36.36% -1
S03 3 Alexis Mateo 25% 41.67% 25% 41.67% 0

A couple of names stand out. Darienne Lake has the lowest “Dusted or Busted” score of any contestant with a top four finish. In fact, she finished bottom or low in six of her 10 competitive episodes. The distance between her and the next lowest scoring top four performer by “Dusted or Busted” score (Courtney Act) is seven points.

Several names will be familiar and unsurprising entries on this list, thinking especially of Shannel, Jujubee, and Tatianna. Jujubee emerged as the queen of the read and the lip-sync assassin of Season 2. She was in the bottom two three times and sent home Sahara Davenport, Pandora Boxx, and Tatianna. No matter, bottom two performances are still penalized heavily in this scoring system. Tatianna, likewise, is in the bottom 10 in this metric. From my outside reading of how fans interpret the show, Tatianna was perhaps raw in Season 2—apparently only 20 years old at the time of filming. She was excellent in Season 2 of All-Stars, though. It is interesting to see two of Season 2’s top four in this list, though.

Kameron Michaels is the only No. 2 finisher to appear in this top ten. Fans of the show will note she mostly played it safe but, as the competition whittled down, otherwise “safe” performances put her in the bottom two. The interesting anecdote to emerge from that, however, is she appears to have the most consecutive “shantays” in lip-sync battles. In four-straight episodes, she shantayed in lip-syncs against Eureka O’Hara (a double shantay), Monét X Change, Miz Cracker, and Asia O’Hara. Do note that lip-syncs are selection effects. Kameron’s final lip-sync in that run was also a semifinal. Further, RuPaul is generally unforgiving of multiple, consecutive appearances in the bottom two (see: Alyssa Edwards [Season 5], Akashia [Season 1]). However, that four-episode run for Kameron Michaels is an all-time best and likely won’t be matched, given the nature of the show and the peculiar string of those lip-syncs.

Detox’s entry into this list is one of those head-scratchers that makes sense upon further review. Detox can match high-concept looks with great comedy, but that and her top three performance in All-Stars 2 belie that her performance in Season 5 included only one win and four appearances in the “low” or “bottom” categories.

Which Contestants Were “Robbed?” Their Season

I added this table after Season 11 ended because the selection of the champion is arguably a bit controversial. Not only does Yvie have the lowest “Dusted or Busted” score of any champion in a given season, but the queen she beat in the finale, Brooke Lynn Hytes, had a “Dusted or Busted” score a full three points higher. In fact, a latent variable analysis that I provide below will contend Brooke Lynne Hytes had the eighth best season of any performer in the show’s history.

This would not be the first time a queen was “robbed” from winning her particular season (at least, if that’s how you want to spin it). Long-time fans of the show probably know Ru started the first season with the idea of crowning BeBe Zahara Benet at the end, forgetting along the way that it meant Nina Flowers had a better season and finished second. The most prominent case of this might have been Shea Couleé, who entered the semifinals as the top performer in that season and was bulldozed in the semifinal by arguably the best individual lip sync in the show’s history. Sasha Velour deserved to advance on the merits of the lip sync but it did mean Shea Couleé, who had arguably the third best run in the show’s history to that point, did not snatch the crown.

There are four such cases I show below where an individual performer had a higher “Dusted or Busted” score than the eventual winner that season. All four have unique situations. Nina finished second despite having the overall better record than Bebe Zahara Benet, but Ongina was woefully underplaced in her season. She too had a better “Dusted or Busted” score than the eventual champion, but finished in fifth place. Shea Couleé had arguably the best season run in the show’s history but was eliminated in the semifinals. More recently, Brooke Lynn Hytes can now boast the biggest point differential in show history over an eventual champion. Her three-point cushion over the eventual season champion may not be topped.

Queens Whose D/B Scores Were Greater Than the Season Champion
Season Contestant Rank Number of Wins Number of Wins for Champion D/B Score Champions's D/B Score
S01 Nina Flowers 2 1 2 4 3
S01 Ongina 5 2 2 4 3
S09 Shea Couleé 3 4 2 9 8
S11 Brooke Lynn Hytes 2 3 1 5 2

One other metric for classifying “robbed” queens is to locate those queens who 1) finished outside the top four in a season but 2) had a higher “Dusted or Busted” score than the lowest-ranked performer to land in the top four. There are over 43 such instances in the data, but much of that is in Season 6 because Darienne Lake had such a low score by the end of the season that even contestants like Gia Gunn and Laganja Estranja had comfortably outperformed her. The table below subsets that list to just the cases of performers who had the highest such difference in a given season and arranges them by largest difference.

Select Queens Outside Top Four Who Finished With Higher ''Dusted or Busted'' Scores Than Lowest Ranked Top Four Performer
Season Contestant Rank D/B Score Lowest D/B Score in Top 4 Lowest Ranked Top 4 Queen Difference
S06 BenDeLaCreme 5 3 -6 Darienne Lake 9
S01 Ongina 5 4 -4 Shannel 8
S02 Jessica Wild 6 2 -5 Jujubee 7
S04 Willam 7 3 -1 Latrice Royale 4
S05 Ivy Winters 7 0 -4 Detox 4
S05 Lineysha Sparx 9 0 -4 Detox 4
S07 Katya 5 4 0 Pearl 4
S10 Miz Cracker 5 1 -3 Kameron Michaels 4
S08 Thorgy Thor 6 1 -2 Chi Chi DeVayne 3
S08 Acid Betty 8 1 -2 Chi Chi DeVayne 3
S09 Valentina 7 2 1 Peppermint 1
S09 Eureka O'Hara 11 2 1 Peppermint 1
S11 Plastique Tiara 8 2 1 Silky Nutmeg Ganache 1

There should be some care in interpreting this table. I don’t agree with all of it (e.g. I’m on the record as being a big Kameron Michaels fan). Also consider that Eureka O’Hara and Lineysha Sparx had fewer episodes on which they could’ve accrued “Dusted or Busted” score deductions and these metrics are largely driven by the number of contests in which a queen appears. Yet, this table offers substantive support to the contention that Darienne Lake advancing over BenDeLaCreme in Season 6 was almost unforgivable. Further, Ongina was woefully underplaced in Season 1. Also, Acid Betty should’ve been on more episodes.

A Simple Item Response Model for Ranking All Contestants

Finally, I decided to run a simple graded response model on the data for all contestants. The graded response model includes the number of performances in which the contestant won, won or scored high, or was in the bottom.2 I also include the contestant’s final rank for the season and the “Dusted or Busted” score. Conceptually, these polytomous measures of a contestant’s performances are judges of a contestant’s ability based on discrete scale. This is obviously rudimentary, and someone more versed in IRT modeling will balk at treating the “Dusted or Busted” score in this way, but the model ran and I’m not submitting it for peer review and I don’t have time to invest in a more comprehensive model. Your critiques can sashay away.

The top 25 participants on this metric are listed below, with accompanying standard errors around the latent estimate.

plot of chunk top-rupauls-drag-race-performers

Fans of the show are free to read into this what they want. I’m not too invested in what this demonstrates, but a few things are worth mentioning as quick hitters.

She went home.
She went home.

The model wants to downweight Bob the Drag Queen for being on an abbreviated season. She’s not the lowest-ranked winner on this list; in fact: Bebe Zahara Benet is not even on it. I don’t think fans of the show hold Bob in low regard. Her snatch game, always an important metric, is among the best. However, metrics for evaluating contestants focus on raw counts (e.g. wins, highs, etc). Toward that end, being on an abbreviated season hurts Bob’s rankings.

BenDeLaCreme and Katya are the only contestants without top four finishes on this list. This is a case where a model might tell you something that’s intuitively true, but probably not implied by what you thought from the inputs. In other words: these two are great. Katya could’ve won All-Stars 2 and the only reason BenDeLaCreme didn’t win All-Stars 3 is because she chose not to win it.

Conclusion: As 🔥 #Takes

I don’t have much to conclude, but I do have a few #takes that I’ll offer as heaters here.

  • The scorecard is pretty clear Tyra Sanchez deserved to win Season 2, but I bet RuPaul wishes she could go back and do that one again.
  • Kameron Michaels should’ve won Season 10. Hey, I didn’t say these takes were good, just that they were 🔥.
  • Sasha Velour needs a street named after her in Urbana. Hell, next-door Champaign had a street named after REO Speedwagon when I was living in that city. Why not Sasha Velour for Urbana?
  • Acid Betty needed more episodes on Season 8. Her three high performances in the first three episodes push her up a lot of rankings for someone who otherwise landed No. 8 in a season.
  • I would’ve put Aquaria as a bottom instead of Monét in episode 10 of Season 10. That would’ve put Aquaria, the future winner of the season, in front of the grim reaper of lip-syncs (Kameron Michaels). And, yeah, there’s no way RuPaul would’ve sent Aquaria home instead of Kameron Michaels, but the thought experiment is fun.
  • I feel most scoring metrics for RuPaul’s Drag Race should include a category that awards 10 points to a contestant if she is able to check the box of “is Alyssa Edwards.” I think that’s appropriate.
  • I am compelled to support Nina West in Season 11 because she is from Columbus. I will disregard any critiques of her as wrong as a result.

Addendum: Season 11 🔥 #Takes

Btw, nailed it. This was from February.

  • Jaymes Mansfield might appreciate this reference, but Silky for me was the first performer in the show’s history to have what pro wrestling fans call “X-Pac Heat.” There have been other villains in the show’s history you want to see vanquished or defeated. Phi’Phi O’Hara (Season 4), Roxxxy Andrews (Season 5), and The Vixen (Season 10) come to mind here as performers who took on a villain role but were still enjoyable to watch the extent to which a viewer wanted to see some measure of comeuppance. However, like the canonical case of X-Pac, I didn’t want to see Silky get her just deserts. I just wanted her off my television so I could enjoy the other performers.
  • I think Chris says it best here. Nina was great on Season 11 because her character on the show augmented other performers around her (prominently Brooke in the L.A.D.P. challenge). Silky just sucked up and rerouted the energy in the room in a way that doesn’t make for good television and isn’t compelling to watch.
  • Not the first to make this observation, but this would be the first winner who lip-synced after Snatch Game. It was going to be the case no matter if Ru had gone with the Brooke edit or the Yvie edit.
  • I’m hoping Yvie’s win offers some pause for future performers who think the path through the finale is to make compound reveals. Yvie’s final two lip syncs were light on reveals (“Free Britney” in the first and the backward head reveal in the second). Contrast that with the compound reveals that Silky had in the lip sync that made no sense and were poorly timed. One tells a story through the song. The other is mindless theatrics. One works better than the other.
  • That said, Brooke’s final lip sync reveal was hilarious. I love a good fourth wall break.
  • Both of Silky’s wins were cases where I thought it was clear Nina had the better main challenge performance and runway look. Nina should’ve received those two wins instead. That would put Nina West at four wins on the season and would have made it almost impossible to deny her the crown. My apologies to Shea Couleé.
  • Also, political scientist speaking, Silky’s explanation for why she registers as a Republican was the dumbest thing I’ve heard an educated person say on that show. The rationale behind what she said was basically nails on a chalkboard as I heard it.
  • I’m still a bit apoplectic that none of the judges understood or got Nina’s Ohio flag dress.
  • Ra’Jah O’Hara has the distinction of being tied with Carmen Carrera for the worst ever D/B score in the show’s history (-8).
  • I thought this season took a while to get going. With 15 queens in the mix, the space seemed kind of crowded and it took about four or five episodes to figure out who some of these queens were and identify with them and their craft.
  1. The University of Alabama’s Center for Ethics and Social Responsibility produced a video about her experience that’s worth watching. 

  2. I had also included the number of performances in the bottom or with a low mark. This factor didn’t load well and the model improved for its absence.