Monthly Challenge – Ontotext case – Week 4 – Mentor’s Approach

Posted Leave a commentPosted in Learn

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data modeling + Investigation of the results + Evaluation At this stage, participants will have to use machine learning algorithms for classifying the companies into industry categories. Teams will be encouraged to try different techniques and then investigate the results. During this investigation, participants should analyze […]

Monthly Challenge – Ontotext case – Week 3 – Mentor’s Approach

Posted Leave a commentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Text Representation + Feature selection At the end of this phase, every team should have their datasets turned into an appropriate format for further analysis and building of the classification model. This means that the text should be turned into numbers in order to be able to […]

Monthly Challenge – Ontotext case – Week 2 – Mentor’s Approach

Posted Leave a commentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data Preparation Textual data is highly unstructured and to extract meaningful insights and apply mathematical algorithms, it should be turned into an appropriate format for analyses. This includes the application of a series of transformations on the data which will help you to represent the text […]

Monthly Challenge – Ontotext case – Week 1 – Mentor’s Approach

Posted Leave a commentPosted in Classification systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Week 1.Data Understanding + Feature Extraction At this stage, participants will have to use some visualization techniques to get familiar with the available ‘basic’ features (raw data). This phase is crucial because the made observations and hypotheses will determine what techniques will be appropriate when building […]

Monthly Challenge – Ontotext case – Solution – Team epistemi

Posted 5 CommentsPosted in Classification systems

Week 1. Data Understanding + Feature Extraction We have been provided with a dataset consisting of 277419 records, which are extracted from DBpedia. It has 6 features and 1 label of 32 classes as summarised in below table: ID Item Name Item Type Observations 1 org Feature URL; may be useful for extraction of additional […]

Monthly Challenge – Ontotext case – Solution – Door

Posted 4 CommentsPosted in Classification systems

1. Business Understanding¶ Developing an automated and standardized classification model that can be used on any source to enrich the originally available data with industry sector information. Ultimately the task can be framed as an error/anomaly detection task. At the core, it is still a classification problem and the output should be not the ultimate […]

Monthly Challenge – Ontotext – Case

Posted Leave a commentPosted in Cases, Learn, MC-04-2019

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Real Business Problem Classification of companies into industry sectors is a fundamental task for unlocking advanced business intelligence capabilities. However different data sources rarely use the same classification system if any. This is a huge obstacle for taking advantage of the available details in Open Data and very niche commercial […]

Monthly Challenge – Learning-by-doing in Data Science

Posted Leave a commentPosted in News

The theory is not enough Academic education is indeed the best long-term investment in our professional and personal achievements. However, nowadays it becomes crucial for universities to include different practical seminars in their educational programs with the aim of preparing students for the real problems which they will be solving as professionals in a given […]