Get notify when an R script finishes on Telegram.
Scenario: we got a table of id-value, and a matrix/tibble that contains the id, and we need the labels. It may be useful when predicting the Key (or Ids) of in a
This tutorial will introduce the Deep Learning classification task with Keras. With focus on one-hot encoding, layer shapes, train & model evaluation.
Analyzing the relationship between the sample size and how it impacts on the accuracy in a classification model
tl;dr: This post is related to How to self-publish a book: A handy list of resources. It's centered around Bookdown and some non-standard customizations I found useful to create the Data Science
tl;dr: A list of useful resources aimed to self-publish a book on Amazon using Bookdown.
Exploratory data analysis (EDA) the very first step in a data project. We will create a code-template to achieve this with one function.
Este tutorial tiene como propósito hacer el set-up inicial para empezar a desarrollar modelos machine learning en increíble lenguaje R.
About Machine Learning We all know that machine learning is about handling data, but it also can be seen as: The art of finding order in data by browsing its inner information. Some
Well, what you hate is the way that math was taught to you. That soup of equations, abstractions, and solutions to problems that we don’t know, It's hard to enjoy the things
Hi there! tl;dr: The Data Science Live Book is now available at Amazon! Kindle & Paperback versions! 🚀 👉 See at Amazon 📗! Link to the black & white version, also available on full-color. It
funModeling quick-start This package contains a set of functions related to exploratory data analysis, data preparation, and model performance. It is used by people coming from business, research, and teaching (professors and students)
tl;dr: Convert numerical variables into categorical, as it is shown in the next image. ⏳ Reading time ~ 6 min. Let's start! The package funModeling (from version > 1.6.6) introduces two functions—
Well after some time, and +300 commits, this is the biggest release of the Data Science Live Book! (open source), after the first publication more than 1 year ago :) tl;dr: Hi there!
Playing with dimensions Hi there! This post is an experiment combining the result of t-SNE with two well known clustering techniques: k-means and hierarchical. This will be the practical section, in R. But
Hi there! I decided to almost re-write the model validation section since it didn't reflect real case scenarios. Hopefully in the two new chapters you will gain a deeper knowledge on methodological aspects
This update contains a new chapter -scoring- which is related to model performance and model deployment, used when predicting a binary outcome. Link to the scoring chapter. Important: To use following updates please
Introduction Time series have maximum and minimum points as general patterns. Sometimes the noise present on it causes problems to spot general behavior. In this post, we will smooth time series -reducing noise-
Introduction Inspired by this Netflix post, I decided to write a post based on this topic using R. There are several nice packages to achieve this goal, the one we´re going to
Introduction Big Data help us to analyze unstructred data (aka "text" ), with many techniques, in this post it is presented one: Cosine Similarity. There are also other analysts work, who scraped
These systems are used in cross-selling industries, and they measure correlated items as well as their user rate. This last point wasn't included the apriori algorithm (or association rules), used in market basket
Introduction This is an excellent resource to understand 2 types of data frame format: Long and Wide. Just take a look at figure 1 inside the article Long format: ggplot2 needs in certain
Introduction "I want to develop a model that automatically learns over time", a really challenging objective. We'll develop in this post a procedure that loads data, build a model, make predictions
Introduction In clustering you let data to be grouped according to their similarity. A cluster model is a group of segments -clusters- containing cases (such as clients, patients, cars, etc.). Once a cluster
Living longer, living better? It's equally important to measure the longer living as well as its quality. Analyzing data from [eurostat](http://ec.europa.eu/eurostat/publications/recently-published?p_auth=ZKofrOKp&p_p_