ARIMA models are not perfect.

ARIMA models are usually a great and straightforward way to forecast a time series with certain assumptions. However, when these fail, ARIMA models are not able to respond properly. Let me illustrate this with a hands-on example with R. We will use the tseries and forecast packages. We immediately check that this series follows a […]

Partial Least Squares [R]

So there is little explanation needed. There are probably implementations around there, but I got bored and decided to do mine as well. May contain errors.

Random acts of Pizza

So I stumbled upon this Kaggle competition and I decided to give it a try. Original data is in JSON format and can be found in the competition website. It offers a vast amount of variables, so it is really difficult to just select a few of them. My approach was to perform sentiment analysis […]

Messing with the IGN ratings dataset

I saw this Reddit link via @TextMining_r and I couldn’t resist doing some basic experimentation related to console/platform wars. Which platform was the best in its generation? Most argue it is not about the system itself but the games, so, here is a magnificent ggplot2 graph showing the mean games score for every platform IGN […]

Tiempo residual medio

Supongamos que tenemos un estudio de análisis de supervivencia entre manos, y dada una variable que define el tiempo de vida de algo, estamos interesados en saber en cuál es el tiempo de vida esperado de un individuo dado que ya ha vivido una cantidad de tiempo . Para responder a eso recurrimos a lo […]