Skip to main content

Site blog

Spooky Author Identification | Exploratory Data Analysis in R

Spooky Author Identification | Exploratory Data Analysis in R

This blog is based on some exploratory data analysis performed on the corpora provided for the “Spooky Author Identification” challenge at Kaggle. The Spooky Challenge A Halloween-based challenge [1] with the following goal: predict who was writing a sentence of a possible spooky ...

Data Science and Law: What One Lawyer Learned From a 50-Hour Data Science Bootcamp

Data Science and Law: What One Lawyer Learned From a 50-Hour Data Science Bootcamp

Is there a relation between data science and law? Here's what a lawyer learned from a 50-hour data science bootcamp at Data Science Dojo.With an increased focus on the growing role of data science and data analytics in the future of law, I decided that it was high time to learn what all the fuss i...

Ethics in Research: Conducting A/B Testing on Customers

Ethics in Research: Conducting A/B Testing on Customers

Ethics in A/B testing is essential. A/B testing might not be as simple and harmless as it looks. Learn how to take care of ethical concerns in A/B tests.The Ethical Way to A/B Test We have come a long way since the days of horrific human experiments during World Wars, the Stanford prison experime...

Building Data Visualization Tools

Building Data Visualization Tools

Data visualization tools are used to gain meaningful insights from data. Learn how to build visualization tools with examples.The content of this blog is based on examples/notes/experiments related to the material presented in the “Building Data Visualization Tools” module of the “Mastering ...

Math for Machine Learning: Top Math Resources for Data Scientists

Math for Machine Learning: Top Math Resources for Data Scientists

At some point, every aspiring data scientist has to get familiar with mathematics for machine learning. To be blunt, the more serious you are about to learn data science, the more math you’ll need to learn for machine learning. If you have a strong math background, this is likely to lit...

Power BI and R: Intro to Visualizations

Power BI and R: Intro to Visualizations

Power BI and R can be used together to achieve analyses that are difficult or impossible to achieve. Microsoft’s Power BI is a powerful technology for quickly creating rich visualizations. Power BI has many practical uses for the modern data professional including executive dashboards, ...

Math for Machine Learning: Math for Aspiring Data Scientists

Math for Machine Learning: Math for Aspiring Data Scientists

If you're an aspiring data scientist, you will need to know some math. Learn why math is important in machine learning. At the end of each of our bootcamps we ask our students to provide us with feedback on their experience. In particular, we ask for honest assessments and opinions on h...

R Language Programming for Excel Users

R Language Programming for Excel Users

R Language is a vital skill for scientists, as evidenced by R's rapid rise in popularity. Not surprisingly, we teach the R language used in programming in our Bootcamp. However, per our mission of “data science for everyone,” most of our students do not have extensive programming ...

Natural Language Processing with R Programming Books

Natural Language Processing with R Programming Books

Natural Language Processing is a key Data Science skill. Learn how to can expand your R programming knowledge with Text Analytics. It is my firm conviction that Natural Language Processing/Text Analytics is a must-have skill for any practicing Data Scientist. From analyzing customer ...

Feature Engineering and Data Wrangling in R

Feature Engineering and Data Wrangling in R

Feature engineering and data wrangling are key skills for a data scientist. Learn how to accelerate your R coding to deliver more, and better, features. Earlier this month I had the privilege of traveling to Amsterdam to teach a great group of folks data science. As is so often the case, ...