Datathon Telenor Solution – Analysing and Predicting Delays in Mobile Data Connectivity

Posted 9 CommentsPosted in Prediction systems

“You know nothing, Jon Snow ……”
is what Ygritte yells to Jon.

Here our situation is also the same as we know nothing about the TELENOR case until we have seen the dataset.

At first, When we heard about the Datathon as beginners, we were very excited to take apart in it.
At finally we received our datasets and here’s our first challenge to import the dataset into the programming platforms,
As we have faced some hurdles to import the dataset as the size of the dataset is around 4GB which has taken some time and put us in the situation :

What we don’t know is what usually gets us killed………………………– Petyr Baelish

we have mentioned above line to express our feeling that we don’t know what’s in the dataset but we want to explore through that.
At last, we are ready to Analysis What do Game of Thrones and Telecoms Have in Common?

At first, when we have gone through the dataset, We have noticed that the Telenor data contains 16 exciting columns with
30091754 jolted rows

When we have gone through the first analysis, we came to know that how complicated the data is, it contains many interesting aspects which we have done through the Exploratory Data Analysis.

Here our main challenge is to predict the fails in the next four days
At First, we have done Exploratory data analysis
(i) Top 10 ravens with fails :
RAVEN_NAME
Brass raven Birdy
Brown raven Ruby
Yellow raven Rio
Blue raven Axel
Razzle Dazzle Rose raven Cleo
Cadmium Red raven Bubba
Vain And Lazy raven Polly
Fearful Carrion raven Gizmo
Blast Off Bronze raven Zazu
Loving raven Maxwell

(ii) Top 10 ravens without fails:

RAVEN_NAME
Metallic Sunburst raven Polly
Green Sheen raven Azul
Less Combative raven Zazu
Weak raven Buddy
Copper raven Tweety
Spectral Yellow raven Zazu
Mythical raven Tiki
Cyber Grape raven Faith
Mysterious And Venerable raven Bubba
Shadow Blue raven Sammy

(iii) The family with most fails :

FAMILY_NAME
Targerian

(iv) The family with least fails :

FAMILY_NAME
Baelish

(v) The family member with most fails :

MEMBER_NAME
Petyr Baelish

(vi) The family member with least fails :

MEMBER_NAME
Euron

After the EDA we need to predict the future four days of delays in mobile data connectivity. To predict the four days delays we use Time Series analysis.

In Time Series Analysis we used three algorithms ARIMA, Simple Exponential Analysis, Recurrent Neural Networks.

We fitted the model with ARIMA and predict the failures of four days and fitted the model using another algorithm Simple Exponential Analysis.

And We used Recurrent Neural Networks for Prediction of failures.

After Fitting the three models using three different algorithms we evaluated by splitting the data into train and test.

We evaluated the best fit model by using the Root Mean Square Error. By considering the RMSE values of the three models, the model with the least RMSE value is taken as the best fit model.

In this case, considering the mobile failure dataset, RNN(Recurrent Neural Network)has the least RMSE value.

So, RNN is taken as the best fit model to predict the future four days of mobile data delays.

Based on the RNN algorithm the prediction of delays for the next four days based on the dataset
are 973776,973725,973674,973623 for 5 ,6,7,8, August 2018 respectively.

Deciphering Cryptocurrency by Occam’s Data Ninjas

Posted Leave a commentPosted in Datathons Solutions, Team solutions

https://drive.google.com/open?id=1cqUOvGzJtF4qUIOmlxKawhfrjPuzWaDl Occam’s Data Ninjas Please find the attachment for the code in the above google drive link. Occam’s Data Ninjas Team Mentor: Dr. Subhabaha Pal @drsubhabahapa Team Members: Mervyn V. Akash @codemonger (mervyn.akash10@gmail.com) Tanuj Maithani @tanuj (tanuj0004@gmail.com) Tamma Ravindra Reddy @mr-reliable (ravindrareddytamma@gmail.com) Sowmya Dyagala @dsowmya (sowmya048@gmail.com) Team Motto: Simplicity at it’s best. Team Toolset: R Studio, Microsoft Excel. Business Understanding and Objective: There is […]

Datathon-HackNews-Solutions-Data Titans

Posted 4 CommentsPosted in Datathons Solutions

  Team Name : Data Titans Team Members : M.HEMANTH KUMAR, A.PAVAN SHANKAR, B.MANOHAR, V. LITHIN CHOWDARY,  E.V.S.SAI RAM PROBLEM STATEMENT : Hack the news whether it is propaganda or Non-Propaganda INTRODUCTION: Propaganda is a view which can mislead us to certain false assumptions, So here we got a chance to Identify the Propaganda in the […]

Datathon Ontotext Mentors’ Guidelines – Text Mining Classification

Posted Leave a commentPosted in GD2018 Mentors, Mentors

In this article the mentors give some preliminary guidelines, advice and suggestions to the participants for the case. Every mentor should write their name and chat name in the beginning of their texts, so that there are no mix-ups with the other menthors. By rules it is essential to follow CRISP-DM methodology (http://www.sv-europe.com/crisp-dm-methodology/). The DSS […]

Cryptocurrency Prediction by Kautilya

Posted 6 CommentsPosted in Datathons Solutions, Learn, Team solutions

Given the cryptocurrencies’ data, we aim to forecast the future cryptocurrencies’ prices so as to execute profitable trades. We show that the cryptocurrencies’ prices also exhibit desirable properties such as stationarity and mixing. Some classical time series prediction models that exploit this behavior, such as “Arima” models produce poor predictions and also lack good probabilistic interpretations. We have introduced a theoretical framework in the 1st place and for predicting and trading prices of the cryptocurrencies for future and based on that we have designed our model which is based on “Neural Network” model which can give better prediction values as compared to the other models.