I’m not putting any serious spoilers or cheats below but there’s a fold if you would rather remain ignorant of some details or if you are just heartily sick of people talking about the daily word game that is clearly an alien plot or a CIA scheme to make us think about abstract objects.
A friend discussing Wordle (the game where you use clues to guess a 5 letter word https://powerlanguage.co.uk/wordle/ ) stated that I had probably made a big spreadsheet already. I hadn’t…but I’d thought about it and that was sufficient motivation to look at some letter frequencies.
I thought I’d have to approximate the underlying word list but it is actually quite easy to get hold of. You can probably Google it now but it’s not a great help to have it. There are 2315 words in the list from aback to zonal (or, as my spreadsheet insisted from aback to FALSE).
Not every letter in the alphabet is equally likely to appear. This table shows the top ten letters by frequency.
Letter | Total | Percent |
e | 1233 | 11% |
a | 979 | 8% |
r | 899 | 8% |
o | 754 | 7% |
t | 729 | 6% |
l | 719 | 6% |
i | 671 | 6% |
s | 669 | 6% |
n | 575 | 5% |
c | 477 | 4% |
That’s fairly typical of English. E is the most common but it is a power-law distribution with a lot of low-frequency letters. That means that even though E is the most common, it is much less common than NOT-E.
E’s dominance isn’t consistent in every position. If we just look at the first letter of words, E is only the 14th most common letter. In position 1, the letter S is a lot more common.
Letter | Position 1 |
s | 366 |
c | 198 |
b | 173 |
t | 149 |
p | 142 |
a | 141 |
f | 136 |
g | 115 |
d | 111 |
m | 107 |
In positions 2 and 3, the letter A is more common and E’s strength really relies on positions 4 and 5. E is first when it is last!
So the perfect first guess would be SAAEE…except astute users of English have spotted that SAAEE isn’t a word. A nice feature of the puzzle is that you can only guess words in the word list. Without that rule, the game would be more of a logic puzzle.
So what are the most Wordly words then if SAAEE isn’t a valid English word? To make a guess at that, for each letter in each position in each word, I tabulated the frequency and then added those frequencies together to get an overall score. Here are the top 15
Word | Total Score |
slate | 1437 |
sauce | 1411 |
slice | 1409 |
shale | 1403 |
saute | 1398 |
share | 1393 |
sooty | 1392 |
shine | 1382 |
suite | 1381 |
crane | 1378 |
saint | 1371 |
soapy | 1366 |
shone | 1360 |
shire | 1352 |
saucy | 1351 |
The bottom 15 are a lot more fun.
omega | 396 |
abyss | 393 |
offal | 393 |
ennui | 391 |
endow | 383 |
hydro | 377 |
lymph | 361 |
jumbo | 349 |
igloo | 348 |
ethic | 347 |
unzip | 345 |
umbra | 344 |
affix | 340 |
ethos | 326 |
inbox | 318 |
nymph | 310 |
And a stats post wouldn’t be complete with a pointless graph. So here is a graph of all those scores with a few example words on the horizontal/category axis.

35 responses to “Some Wordle Stats”
Where did you get the list of 2,315 words?
LikeLike
I’ll rot13 in case people don’t want to know
(V qvqa’g qvfpbire guvf, vg’f orra qvfphffrq bayvar) Vs lbh ybbx ng gur jro cntr fbhepr pbqr gurer’f n yvax gb n wninfpevcg svyr gung unf gur pbqr sbe gur tnzr. Gur jbeq yvfg vf va gur tnzr nf n inevnoyr pnyyrq “Gn”.
LikeLiked by 1 person
Wordle is such a simple yet elegant game and kudos to Josh Wardle for putting everything in the clear, which has made it trivial for other people to replace the dictionaries with their own & spawn their own versions.
If you want a direct link to the script containing the list of words (which you can view with any text reader) it’s here (ROT13’d):
uggcf://cbjreynathntr.pb.hx/jbeqyr/znva.r65pr0n5.wf
The solution list has ~2300 words, but Wordle accepts a longer list of ~12,000 as guesses. Interestingly, although it is hosted in a uk domain, Wordle uses American English (Wardle originally created the game for his partner who AFAIK is American). That has caused some grief amongst non-American English spellers. Another factoid: the list of words allowable as guesses contain words in both American & UK English. So I can see why it caused confusion.
Interestingly, the coloured grids used to share results in a non-spoilerly way, weren’t in the earlier public version. It was created by manually typing emoji by a Twitter user from New Zealand Elizabeth S. but was such a success, Wardle incorporated it, and that helped it go viral.
The New Zealand connection seems to have helped in another way too. Turns out New Zealand Twitter is more interconnected than ‘rest of the world’ Twitter, so something that takes off gets propagated more quickly (Twitter was where I first saw mentions of Wordle). https://www.rnz.co.nz/national/programmes/the-weekend/audio/2018826546/josh-wardle-the-power-of-wordle
LikeLike
//but Wordle accepts a longer list of ~12,000 as guesses//
Ah! That makes sense.
LikeLike
Oooooh, Wordle geekery!
So, I think this is good as an analysis of letter-frequency, but my personal experience with the game has led me to some kind of counterintuitive conclusions. And a big one is: nailing down a few super-common letters actually _isn’t_ the most helpful thing early on in the game.
For example, it doesn’t _hurt_ to know that the fifth letter is ‘E’, but that doesn’t actually give me a ton to go on for my next guesses. Finding an “unusual”, slightly less-common letter, or a common letter but in a surprising position, can be a much stronger hint. (This is extra true in Hard Mode, where getting too much right early on kind of locks you in, and limits how many new letters you can guess/eliminate per guess.)
As a very handy example, today my first guess was JUICY, and it was *super* helpful! I got the word in three moves.
(That said, my second guess was STARE, which will rank very highly in your score, and that was ALSO super-helpful…)
LikeLike
(started with JUICY because I wanted to either confirm or eliminate U, I, and Y, which are specifically the *trickier* vowels, and also harder ones to work into guesses once I’ve got more information and am trying to figure out more specific things)
LikeLiked by 1 person
nice
LikeLike
Inspired by that I’ve tried ADIEU as the first guess. It seems to work, but subjectively it’s not an improvement on STALE (revised from STARE, so avoid overlap with MINOR as a second guess.)
LikeLike
I’m very naughty. I’ve come up with 4 words which encompass 19 letters, so I always start with those. Most of the time I can get the right word after the first 3 guesses, a couple of times I’ve figured it out after the first 2 guesses, and a few times it’s taken 5 tries, and a couple of times 6 tries.
I’m very non-competitive and don’t feel as though I have anything to prove, so it’s just a couple minutes of amusement for me each day.
LikeLiked by 4 people
I absolutely love it as “a few minutes of amusement each day,”
except there are *two* Hebrew clones, so now it’s up to about 15 minutes of amusement each day,
but also the Hebrew ones aren’t nearly as good because Hebrew doesn’t have nearly as many patterns as English, nor a tenth as many idiosyncrasies and odd exceptions.
LikeLiked by 1 person
Yes, the proprietor has done a good job of selecting words with odd, less-intuitive structures.
The ones with a letter that repeats (e.g. “SQUEE”) are the hardest, and usually take me 5 or 6 tries, because if you get the letter right at some point, there’s no indication that there’s a second version of it lurking in there.
LikeLiked by 1 person
A friend of mine who’s really involved in music has been playing Byrdle.
There’s also a 4-letter version called Sweardle. 😀
LikeLike
Oh nice!
There’s also the adversarial one, Absurdle,
where the “correct” word isn’t pre-ordained,
and instead, every time you make a guess, the game gives you responses in such a way as to leave as many options as possible still valid. It’s kind of brilliant, because it shows you just how you can be “funneled” into word patterns that are so common you still need a bunch of guesses to figure the word out!
LikeLike
If I had a modicum of spare time this week, I have everything I need now to make a Typordle game – exactly like wordle but all the words are slightly misspelled. This would be universally loved by everybody obviously and not remotely frustrating.
LikeLiked by 2 people
Yes indeed. This doesn’t tell you what the best first guess word should be. I think that’s more tactical. Having said that, I did a first guess “exile” once to be weird and got the “x” and nothing else and that was a nightmare.[the answer was “proxy”]
LikeLiked by 1 person
My goals in Wordle are to get the word. If I get it quickly, all the better. But I am not obsessed with getting it in as few moves as possible, just to get it if I can.
LikeLiked by 2 people
My question is: Is it possible to gave 5 yellow boxes? I.e: Are there words in the list that are anagrams of another were every letter is shifted (for example no palindroms, bc the middle letter stays the same).
LikeLike
Yes, STEAM and MEATS are one example of that.
LikeLike
Having read that S is commonest as the first letter, I shifted my initial guess from RATES to STARE.
LikeLike
Thanks, I have something to aim for now 🙂
LikeLike
I believe 4 letter words with a fifth plural s aren’t on the list.
LikeLike
I’ve seen that claim before, but I tried
PILLS
MEATS
TOMBS
WEEKS
now and they where all accepted as guesses.
LikeLike
There are two word lists – one for the target word and a bigger more generous one that validates guesses.
LikeLike
To me, the game seems too simple to be compelling. I played it eight times, won the later seven and went, not for me, back to the crossword.
LikeLike
That’s true. And, in a sense, Wordle is distilling the communal crossword experience down to a single entry – it’s the same puzzle that everyone plays just once in the same 24hr period.
But the key here isn’t anything to do with the puzzle – it’s the brilliance of the social media sharing; you aren’t sharing your specific guesses or even the answer, meaning that you aren’t spoiling anything and yet you still get to show-off (which is what social media is for, after all.)
I don’t find it particularly interesting either (I think Jotto is far more interesting, where you are simply told how many correct letters you have and nothing more), but I can understand why the social sharing took off.
LikeLiked by 2 people
There’s another version at wordlegame.org.
This one is harder, because firstly it uses a larger dictionary of allowed words, and secondly allows any entry in the dictionary to be the answer. This includes having non-words such as sokol as the answer. Another time I lost because I had 4 letters correct, but I ran out of guesses because there were too many valid alternatives for the 4th letter.
For the powerlanguage version, my results are 0/0/4/2/10/0. I’ve read that a group of physicists have taken the answer and dictionary lists and generated a strategy tree that guarantees completion in 5 guesses.
LikeLike
Y’all should try Evil Wordle.
https://swag.github.io/evil-wordle/
The person that developed it was looking for an algorithm that would maximize the number of guesses by a player.
As I understand it, you start with all of the 5-letter words available. You cannot guess the word in the first row because the algorithm doesn’t know it yet. Based on the letters you select, the algorithm identifies the group of words with the most possible permutations/letters. And you guess again using the narrowed range of letters.
If you get down to a bunch of words that use the same letters
cakes
takes
makes
wakes
lakes
then you will have to eliminate all of the options before getting the “correct” word.
I feel pretty good if I can get in 7 guesses. I did it in 5 guesses once.
It’s a bit like Mastermind, but only if the other player can change the answer based on your guesses until you have eliminated all the possibilities.
Regards,
Dann
So many books, so little time. – Frank Zappa
LikeLiked by 2 people
STRAW
CLOUD
PINKY
BOGEY
BOBBY
LikeLike
Initially, I was trying with the supposedly optimal first guesses, but have now switched to using a different first guess every day. I enjoy it more that way because it’s about the journey. I’ve also switched to playing in hard mode (where you have to use all revealed clues in subsequent guesses).
LikeLiked by 2 people
I think “new first guess everyday” may be a really good metastrategy. It gets you thinking about five letter words.
LikeLiked by 1 person
Thinking back, I just got bored with the same first guess every day. This is meant to be fun right?
LikeLiked by 2 people
Yes. I tried a letter frequency strategy today (using these tables to guide guesses) and it was just less fun to do it that way.
LikeLiked by 1 person
I have a handful I like to use, all of which give me three or four vowels to start. ADIEU and AUDIO are good ones, and READY and OPIUM make a good opening move pair.
LikeLiked by 1 person
One more word about Wordle and the puppy gets it! (Oh, wait, that won’t sound like a threat to the author of the Debarkle. Better pick another animal. Manatee? Eh, I’ll get back to you.)
LikeLiked by 4 people
Some people (including well known writers) stick to one word for their first guess.
LikeLike