Friday, April 27, 2012

Letter Frequencies in Turkish




- Frequencies are computed from Hurriyet and Zaman newspapers using columnist articles between 2001 and 2011.
- The graph contains all letters with frequencies exceeding 0.001%.
- Note that w, x, and q are not part of the Turkish alphabet.

Data
Letter  Frequency      Letter   Frequency
a         11.742%      z           1.511%
e          9.373%      g           1.254%
i          8.714%      h           1.134%
n          7.344%      ç           1.035%
r          6.978%      v           1.015%
l          6.372%      ğ           0.998%
ı          4.734%      c           0.974%
k          4.599%      p           0.880%
d          4.287%      ö           0.797%
m          3.759%      f           0.518%
t          3.562%      j           0.068%
y          3.426%      â           0.062%
s          3.170%      w           0.019%
u          3.032%      î           0.014%
o          2.602%      x           0.008%
b          2.554%      û           0.004%
ü          1.886%      q           0.001%
ş          1.571%         

No comments:

Post a Comment