Course Description

    In this series on Introduction to Text Analytics with R, we will help you get started and understand the fundamentals of text analytics. 

    What You'll Learn

      >  The importance of splitting data in to training and test datasets

      >  Stratified sampling of imbalanced data using the caret package

      >  Representing text data for the purposes of machine learning

      >  Introduction to tokenization, stop words, and stemming

      >  The bag-of-words model

      >  Considerations for data pre-processing


    Text Analytics tutorial slides can be accessed here

    Download R here

    SMS Spam Collection Dataset used in this tutorial can be accessed here


    Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.