- Split each document into words
- count word frequencies
- compute dot product and normalize
- get angle
- 0 degree: identical
- 90 degree: no common words
translation_table = string.maketrans(string.punctuation + string.uppercase,
" "*len(string.punctuation) + string.lowercase)
translation_table = str.maketrans(string.punctuation + string.ascii_uppercase,
" " * len(string.punctuation) + string.ascii_lowercase)