Fundamentals of Text Analytics






     

    Course Description

    In this series on Introduction to Text Analytics with R, we will help you get started and understand the fundamentals of text analytics. 

    What You'll Learn

      >  The importance of splitting data in to training and test datasets

      >  Stratified sampling of imbalanced data using the caret package

      >  Representing text data for the purposes of machine learning

      >  Introduction to tokenization, stop words, and stemming

      >  The bag-of-words model

      >  Considerations for data pre-processing



     

    Text Analytics tutorial slides can be accessed here

    Download R here

    SMS Spam Collection Dataset used in this tutorial can be accessed here



     

    Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.