In this conclusion to Introduction to Text Analytics with R series, we will discuss about accuracy of the model and why overfitting the model is not a good idea.
What You'll Learn
> Optimizing our model for the best generalization on new/unseen data.
> Discussion of the sensitivity/specificity trade-off of our optimized model.
> Potential next steps regarding feature engineering and algorithm selection for additional gains in effectiveness.
> For those that are interested, a collection of resources for further study to broaden and deepen their text analytics skills
Text Analytics tutorial slides can be accessed here
Download R here
SMS Spam Collection Dataset used in this tutorial can be accessed here
Data Science Dojo Instructor - Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.
© Copyright – Data Science Dojo