Letters Frequency Relative Fri Lower Bound Upper Bound Q 5 Z X J 0.000212 2.63984E-05 0.000398 6 0.000254 5.08478E-05 0.000457 17 0.000719 0.000377282 0.001061 134 0.005669 0.004711853 0.006626 143 0.00605 0.005061402 0.007039 257 0.010873 0.009550911 0.012195 323 0.013665 0.012184948 0.015145 379 0.016034 0.014432707 0.017635 416 0.0176 0.015923667 0.019276 442 0.018699 0.017127129 0.020271 513 0.021703 0.020041148 0.023365 586 0.024792 0.022809723 0.026774 603 0.025511 0.023500926 0.027521 609 0.025765 0.023745207 0.027785 703 0.029742 0.027576348 0.031908 935 0.039557 0.037072109 0.042042 1097 0.04641 0.043728076 0.049092 1362 0.057622 0.054651246 0.060593 1381 0.058425 0.055434893 0.061415 1407 0.059525 0.05650864 0.062541 0.07474 1503 0.063587 0.060476159 0.066698 1689 0.071456 0.068172171 1810 0.076575 0.073184965 0.079965 2235 0.094555 0.090824791 0.098285 2237 0.09464 0.09090829 0.098372 2845 0.120362 0.11621383 0.12451 23637 K V B P -OUWE C M Y F U L D R I > IZS N H A T O E SUM
The data file includes the text of three books of the Bible (Joshua, Jonah and Philippians) using the ESV translation. While these are all great books, our only interest for this project is how often each letter is used.
4) The previous analysis could be useful if our goal was to decipher an encrypted message, where each letter is scrambled (for example, each “a” might become a “g”, while each “b” might become an “o” and so forth).
a) Assume that the letter “z” in encrypted message has a relative frequency of 0.06 (it accounts for 6% of the total number of letters). Which letter’s Confidence Intervals from question 3 contain 0.06 and thus are the most likely candidates to be the letter which was encrypted as “z”?
b) Further assume that “y” in the encrypted message has a relative frequency of 0.04 (4%). Which letter’s question CIs contain 0.04?
c) If “x” in the encrypted message has a relative frequency of 0.02 (2%), which letter’s question 3 CIs contain 0.02?
Trending now
This is a popular solution!
Step by step
Solved in 4 steps