A tutorial on statistical-learning for scientific data processing