Submitted by OfficialWireGrind t3_11sbphb in dataisbeautiful
Comments
AKADriver t1_jcdjxtp wrote
There's a Chinese car called the Chery QQ.
quarterthrowback t1_jcdl3f2 wrote
That must be the other 1
Lite3000 t1_jceaq8y wrote
There is a Chinese social media app called QQ that had 653 million active users as of 2019.
firewaterstone t1_jcdvfvv wrote
Awe, don't cry.
shruggedbeware t1_jceg5i1 wrote
BigDumer t1_jcd2uc9 wrote
Does each instance of the video game VVVVVV count as 5 double letters?
OfficialWireGrind OP t1_jcd7tzi wrote
Yes.
OfficialWireGrind OP t1_jccva32 wrote
The bar chart counts the occurrences of double letters in all of English Wikipedia's article text.
Data Source: English Wikipedia's April 1st, 2022 article data dump
Tools: Python, Matplotlib
sckurvee t1_jcdzwyx wrote
Wikipedia's got more llamas than accuracy.
Gr1ff1n90 t1_jce980l wrote
Omg! š This made me snort out loud reading it
solarmelange t1_jcd1o0v wrote
They need to go case sensitive, eliminating double capitals. That is clearly where ii are coming from.
DameKumquat t1_jcd4qen wrote
And skiing.
OfficialWireGrind OP t1_jcdrsql wrote
And Hawaii and Pompeii. Roman numerals do make up a lot of the ii's. Roughly, about 75% of them. A lot of the xx's too.
imlookingatarhino t1_jcdqd5v wrote
I'm gonna put so many Q's in articles now. Gotta* work on those numbers
Only-Engineering6586 t1_jce8i7t wrote
Seems right, Iāve seen a couple double ddās in my dday
Clambulance1 t1_jcdrtfi wrote
A majority of the jj must come from articles about Korean things.
[deleted] t1_jcd41gh wrote
[removed]
PrompteRaith t1_jcdvsht wrote
I would expect XX to be much higher (genetics, the band, etc)
[deleted] t1_jce987x wrote
[deleted]
[deleted] t1_jce9y2x wrote
[deleted]
[deleted] t1_jceg2ll wrote
[removed]
[deleted] t1_jcem2y3 wrote
[removed]
[deleted] t1_jcenb15 wrote
[removed]
[deleted] t1_jcf8jh6 wrote
[removed]
T-Dex_the_T-Rex t1_jcffbm2 wrote
Interestingly, in terms of words with consecutive double letters, there is only 1 word with 3 consecutive double letters and only 1 word with 4 consecutive double letters. These words are Bookkeeper and Subbookkeeper respectively.
CaptainBentham t1_jcfix2a wrote
How many of those zzās are from the ZZ Top article
[deleted] t1_jcjo8yn wrote
[removed]
ArvinaDystopia t1_jcjsaqy wrote
That's a lot of aardvarks.
[deleted] t1_jcdvc43 wrote
[deleted]
popeter45 t1_jcdpu6g wrote
So how many of the LL are just Welsh words/places?
HiveMindEmulator t1_jce130i wrote
WeLL, I'LL bet it wiLL be a reaLLy smaLL portion of aLL of them.
glidespokes t1_jcejo3e wrote
Probably not all of them because spanish exists.
[deleted] t1_jcepbx7 wrote
There's also lots of double Ls in English; such as pull, allow, traveller...
phred_666 t1_jcfdk62 wrote
ā¦Smell, tell, tall, call, ballā¦
glidespokes t1_jces3pn wrote
Same in german. There (and English too I believe?) it simply means the preceding vowel is pronounced short.
[deleted] t1_jcf02vn wrote
As with so many other 'rules' of English, there's lots of exceptions
Waxoplax t1_jce8pmx wrote
What does the k in the number stand for?
[deleted] t1_jce96xt wrote
[deleted]
Waxoplax t1_jcf6hkz wrote
Thats what I though, but how can there be 43 million words with ll when thereās roughly 200k words in the english vocabulary š¤š¤Æ
dhkendall t1_jcfanit wrote
Each occurrence. Iām sure that, for example, the word āshellā appears more than once in Wikipedia, but it only appears once in the list of words in the English vocabulary.
Waxoplax t1_jcfuvws wrote
Ohh, each occurence.. gotcha. Missed that, I though it was just the number of individual words and I was hella confused
Loosestool421 t1_jcdr5y7 wrote
Double L is the Latin influence.
quarterthrowback t1_jccxrmz wrote
The city Raqqa must acount for 8,999 of those 9k...