N-grams






     

    Course Description

    N-grams are a method to help machines understand a word and its context in order to better understand the meaning of the word. 

    What You'll Learn

      >  Validate the effectiveness of TF-IDF in improving model accuracy.

      >  Introduce the concept of N-grams as an extension to the bag-of-words model to allow for word order.

      >  Discuss the trade-offs involved of N-grams and how Text Analytics suffers from the “Curse of Dimensionality”.

      >  Illustrate how quickly Text Analytics can strain the limits of your computer hardware.

    Watch this short Introduction to N-grams for a general understanding of the method popular in NLP and text analytics.



     

    Text Analytics tutorial slides can be accessed here

    Download R here

    SMS Spam Collection Dataset used in this tutorial can be accessed here



     

    Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.