Course Description

    In this conclusion to Introduction to Text Analytics with R series, we will discuss about accuracy of the model and why overfitting the model is not a good idea.

    What You'll Learn

      >  Optimizing our model for the best generalization on new/unseen data.

      >  Discussion of the sensitivity/specificity trade-off of our optimized model.

      >  Potential next steps regarding feature engineering and algorithm selection for additional gains in effectiveness.

      >  For those that are interested, a collection of resources for further study to broaden and deepen their text analytics skills


    Text Analytics tutorial slides can be accessed here

    Download R here

    SMS Spam Collection Dataset used in this tutorial can be accessed here


    Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.