Mentors’ Guidelines For Datathon Case On Economic Time Series Prediction

Posted Leave a commentPosted in GD2018 Mentors, Mentors

In this article the mentors give some preliminary guidelines, advice and suggestions to the participants for the case. Every mentor should write their name and chat name in the beginning of their texts, so that there are no mix-ups with the other menthors. By rules it is essential to follow CRISP-DM methodology (http://www.sv-europe.com/crisp-dm-methodology/). The DSS […]

Website Guidelines

Posted Leave a commentPosted in Guidelines

1. Overview of the Data.Platform website 1.1. Register Register in our Datathon website. The link is: https://www.datasciencesociety.net/datathon/register/ and the registration will go through e-mail confirmation. After that You should fill-in Your profile, including an avatar (not necessary Your actual photo), Your name, Your data science interest, etc.     Besides going to our public landing […]

Datathon industry experts guidelines

Posted Leave a commentPosted in Guidelines

Dear colleague, we highly appreciate Your participation in Datathon 2018 as an industry expert and we are sure that Your contribution will make for another great Datathon. Please, see the mentor instructions for the Dathaton 2018. https://www.datasciencesociety.net/datathon/    1. Log in to the Data Chat Before starting to use the Data Chat read the instructions […]

Using Machine Learning to explain and predict the life expectancy of different countries

Posted Leave a commentPosted in Datathon cases

The project tries to create a model based on data provided by the World Health Organization (WHO) to evaluate the life expectancy for different countries in years. The data offers a timeframe from 2000 to 2015. The data originates from here: https://www.kaggle.com/kumarajarshi/life-expectancy-who/data The output algorithms have been used to test if they can maintain their accuracy in predicting the life expectancy for data they haven’t been trained. Four algorithms have been used:

Linear Regression
Ridge Regression
Lasso Regression
ElasticNet Regression
Linear Regression with Polynomic features
Decision Tree Regression
Random Forest Regression

Stochastic Processes and Applications

Posted 1 CommentPosted in Datathon cases

This notebook is a basic introduction into Stochastic Processes. It is meant for the general reader that is not very math savvy, like the course participants in the Math Concepts for Developers in SoftUni.
There is a basic definition. Some examples of the most popular types of processes like Random Walk, Brownian Motion or Weiner Process, Poisson Process and Markov chains have been given. Their basic characteristics and examples for some possible applications are stated. For all the examples there are simulations in Python, some are visualized.
The following packages have been used:

nympy
matplotlib.pyplot
random
scipy.stats
itertools
matplotlib.patches

Tiny smart data modelled with a not-so-tiny smart model – the Case of SAP

Posted 1 CommentPosted in Team solutions

Tiny smart data modelled with a not-so-tiny smart model Introduction Metadata Business Understanding Data Understanding Data Preparation Modelling Evaluation Deployment Conclusion Metadata Case: The SAP Case – Analyze Sales Team: Chameleon Project URL: https://github.com/Bugzey/Chameleon-SAP Memebers: Stefan Panev (stephen.panev@gmail.com), Metodi Nikolov (metodi.nikolov@gmail.com), Ivan Vrategov (ivanvrategov@gmail.com, Radoslav Dimitrov (rdimitrov@indeavr.com) Mentors: Alexander Efremov(aefremov@gmail.com) Agamemnon Baltagiannis (agamemnon.baltagiannis@sap.com) Team Toolset: […]