Term Frequency-Inverse Document Frequency (TF-IDF) is a commonly-used
NLP preprocessing technique. The idea is to collapse each document
in a corpus (dataset) into a vector using 1-2 steps.
Term Frequencies: Count the number of times each word in the vocabulary
appears in the