Monthly Challenge – Ontotext case – Solution – Team epistemi

Posted 5 CommentsPosted in Classification systems

Week 1. Data Understanding + Feature Extraction We have been provided with a dataset consisting of 277419 records, which are extracted from DBpedia. It has 6 features and 1 label of 32 classes as summarised in below table: ID Item Name Item Type Observations 1 org Feature URL; may be useful for extraction of additional […]

Monthly Challenge – Ontotext – Case

Posted 1 CommentPosted in Cases, Learn, MC-04-2019

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Real Business Problem Classification of companies into industry sectors is a fundamental task for unlocking advanced business intelligence capabilities. However different data sources rarely use the same classification system if any. This is a huge obstacle for taking advantage of the available details in Open Data and very niche commercial […]

Monthly Challenge – Sofia Air – Solution – [iseveryonehigh]

Posted 8 CommentsPosted in Prediction systems

I have just begun my machine learning course from Andrew Ng at Coursera so I thought that this challenge would be a good test of my learnings. I apologise for the delay for article writing as I was not sure if I should have taken this challenge or not since the dataset seemed difficult to […]

Monthly Challenge – Sofia Air – Solution – Kung Fu Panda

Posted 12 CommentsPosted in Prediction systems

1. Business Understanding The air quality in Sofia, Bulgaria, has been a problem for some time already. The population of the city is constantly increasing and this brings more traffic on the streets. The car ownership in Sofia is among the highest in Europe with around 600 cars per 1000 citizens. Another huge issue in […]