Data Science Heroes Blog
  • Blog
  • Twitter
  • Datos en R (Spanish)

big data

A collection of 5 posts

R

Anomaly Detection in R

Introduction Inspired by this Netflix post, I decided to write a post based on this topic using R. There are several nice packages to achieve this goal, the one we´re going to

  • Pablo Casas
    Pablo Casas
4 min read
R

Text Mining Analysis: some theory and practice in R

Introduction Big Data help us to analyze unstructred data (aka "text" ), with many techniques, in this post it is presented one: Cosine Similarity. There are also other analysts work, who scraped

  • Pablo Casas
    Pablo Casas
3 min read
R

{Long Vs. Wide} Data Frames

Introduction This is an excellent resource to understand 2 types of data frame format: Long and Wide. Just take a look at figure 1 inside the article Long format: ggplot2 needs in certain

  • Pablo Casas
    Pablo Casas
1 min read
R

Introduction to automatic machine learning

Introduction "I want to develop a model that automatically learns over time", a really challenging objective. We'll develop in this post a procedure that loads data, build a model, make predictions

  • Pablo Casas
    Pablo Casas
5 min read
R

Dynamic analysis on outliers

Treating outliers Introduction Outliers are the extreme values that a variable has, depending on the model or requirement, it could be necessary to treat them, either transforming or deleting. Variable “Income”

  • Pablo Casas
    Pablo Casas
2 min read
Data Science Heroes Blog © 2025
Latest Posts Facebook Twitter Ghost