N-grams are a method to help machines understand a word and its context in order to better understand the meaning of the word.
What You'll Learn
> Validate the effectiveness of TF-IDF in improving model accuracy.
> Introduce the concept of N-grams as an extension to the bag-of-words model to allow for word order.
> Discuss the trade-offs involved of N-grams and how Text Analytics suffers from the “Curse of Dimensionality”.
> Illustrate how quickly Text Analytics can strain the limits of your computer hardware.
Text Analytics tutorial slides can be accessed here
Download R here
SMS Spam Collection Dataset used in this tutorial can be accessed here
Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.
© Copyright – Data Science Dojo