The distance of uniqueness (in cryptology) is the number of ciphertext characters at which the conditional informational entropy of the key (and, consequently, the plaintext ) is zero, and the key itself is determined uniquely.
Achieving the distance of uniqueness does not mean that the key (or plaintext) can be found in practice, since the definition does not take into account the practical computability of the key, but only postulates that it can be found, for example, by exhaustive search .
Content
Definition
We define the key reliability function through the conditional informational entropy of the key and ciphertext characters cryptanalyst intercepts:
This number of intercepted characters at which and is called the distance of uniqueness.
Approximate Formula
The derivation of the uniqueness distance formula is possible for some “good” cryptosystem, for which the informational entropy of the ciphertext has certain properties of “linearity”:
-
- Where - the total number of characters of the ciphertext message - the ciphertext alphabet, for simplicity taken equal, including for plain text, and for the encryption key
- the last expression is the "linearization" of the expression
Then from the expressions for joint informational entropy:
-
- {\ displaystyle n \ approx {{H \ left (Z \ right) -f_ {n}} \ over {\ log LH \ left (X \ right) / N}} = {{H \ left (Z \ right) -f_ {n}} \ over {\ log L \ cdot \ left ({1 - {{H \ left (X \ right)} \ over {N \ log L}}} \ right)}}}
Then according to the definition of the distance of uniqueness as :
Expression called source redundancy . If the source redundancy is zero, that is, it is impossible to determine from the plain text whether it is correct or not (it does not have checksums or signatures), then the distance of uniqueness becomes equal to infinity, and the cryptosystem is absolutely reliable.
Example
For the Russian language, redundancy is 3.5 bits per character. If a mono-alphabetic cipher is used , then the number of possible keys in it is equal , and the entropy of the key (with an equally probable choice) bit .
Then the distance of uniqueness for the Russian text encrypted with a simple replacement cipher is:
That is, if a cryptanalyst intercepts more than 35 characters of the ciphertext, this will most likely allow (for example, exhaustive search) to restore the original plaintext. If fewer characters are intercepted, the text recovery will be ambiguous (there may be several different plaintext options).
Literature
- Gabidulin E.M. , Kshevetsky A.S. , Kolybelnikov A.I. Distance of uniqueness // Information security : a training manual - M .: MIPT , 2011. - 225 p. - ISBN 978-5-7417-0377-9
- G.V. Basalova The distance of uniqueness // Fundamentals of cryptography (Russian)
Links
- Bruce Schneier : How to Recognize Plaintext (Crypto-Gram Newsletter December 15, 1998 )
- Unicity Distance computed for common ciphers