## Introduction to Machine Learning for non-developers

About Machine Learning We all know that machine learning is about handling data, but it also can be seen as: The art of finding order in »

Well, what you hate is the way that math was taught to you. That soup of equations, abstractions, and solutions to problems that we don’t »

Hi there! tl;dr: The Data Science Live Book is now available at Amazon! Kindle & Paperback versions! 🚀 👉 See at Amazon 📗! Link to the black & »

funModeling quick-start This package contains a set of functions related to exploratory data analysis, data preparation, and model performance. It is used by people coming from »

Amazon Redshift is one of the hottest databases for Data Warehousing right now, it's one of the most cost-effective solutions available, and allows for integration with »

tl;dr: Convert numerical variables into categorical, as it is shown in the next image. ⏳ Reading time ~ 6 min. Let's start! The package funModeling (from version »

This package lets you analyze the variables of a dataset, to evaluate how the data is shaped. Consider this the first step when you have your »

Well after some time, and +300 commits, this is the biggest release of the Data Science Live Book! (open source), after the first publication more than »

Playing with dimensions Hi there! This post is an experiment combining the result of t-SNE with two well known clustering techniques: k-means and hierarchical. This will »

I have a new year's surprise for you! This shiny app means to be a system for basic reporting in the style of most Business Intelligence »

A year ago i wrote about a way to authenticate shiny with Auth0, using Apache: http://blog.datascienceheroes.com/adding-authentication-to-shiny-open-source-edition/ This method works but has some »

Hi there! I decided to almost re-write the model validation section since it didn't reflect real case scenarios. Hopefully in the two new chapters you will »

This update contains a new chapter -scoring- which is related to model performance and model deployment, used when predicting a binary outcome. Link to the scoring »

Hi! Well finally there is the first release of this project: A open source book which will hopefully contain some useful resources for those who want »

Introduction Time series have maximum and minimum points as general patterns. Sometimes the noise present on it causes problems to spot general behavior. In this post, »

Amazon's columnar database, Redshift is a great companion for a lot of Data Science tasks, it allows for fast processing of very big datasets, with a »

POST UPDATE 09/24/2016 Good news! funModeling documentation evolved into an open source book! Please follow the link below Jump to the book... This release »

Introduction Inspired by this Netflix post, I decided to write a post based on this topic using R. There are several nice packages to achieve this »

Introduction Big Data help us to analyze unstructred data (aka "text" ), with many techniques, in this post it is presented one: Cosine Similarity. There are also »

Shiny Server is a great solution for BI/analytics reporting. It leverages the power of the R language to create interactive reports/dashboards. May be you »