Senior Data Scientist @ Nederlandse Energie Maatschappij

by Rodrigo Agundez - 01 May 2017
Tags: #data science #Spark #Python #Pandas #ssh #git #S3 #PyMC3 #SparkML

In this project I was responsible for adding a model to an existing Spark pipeline. This model assigns customer conversion probabilities to different price offering strategies. The type of model does not exist in Spark, therefore a customized implementation was built which could integrate seemingly to the already existing SparkML pipeline.