Friday, May 11, 2012

Frequencies of first letters in Turkish

Relative frequencies of first letters of words in Turkish.



- Words in Turkish do not start with the letter ğ as can be seen.
- The letter j is found only in loanwords from other languages. It is used in a small number of Turkish words.
- Frequencies are computed from Hurriyet and Zaman newspapers using columnist articles between 2001 and 2011.
- The graph contains all letters with frequencies exceeding 0.001%.
- Note that letters w, x, and q are not part of the Turkish alphabet. They are used to write foreign words.

Data:

Beginning of word letters in Turkish text.
Letter Frequency     Letter  Frequency
b      11.920%       p       1.633%
d       8.927%       ş       1.329%
k       7.682%       f       1.223%
a       7.089%       ü       1.106%
y       6.547%       u       0.990%
s       6.228%       c       0.955%
i       5.821%       r       0.937%
g       5.533%       z       0.848%
t       4.660%       l       0.653%
o       4.227%       ı       0.511%
h       3.860%       j       0.063%
e       3.668%       w       0.053%
m       3.500%       â       0.030%
v       3.496%       x       0.015%
n       2.485%       q       0.002%
ç       2.278%       ğ       0.001%
ö       1.730%      


1 comment: