Monthly Challenge – Ontotext case – Week 4 – Mentor’s Approach

Posted Leave a commentPosted in Learn

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data modeling + Investigation of the results + Evaluation At this stage, participants will have to use machine learning algorithms for classifying the companies into industry categories. Teams will be encouraged to try different techniques and then investigate the results. During this investigation, participants should analyze […]

Monthly Challenge – Ontotext case – Week 3 – Mentor’s Approach

Posted Leave a commentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Text Representation + Feature selection At the end of this phase, every team should have their datasets turned into an appropriate format for further analysis and building of the classification model. This means that the text should be turned into numbers in order to be able to […]

Monthly Challenge – Ontotext case – Week 2 – Mentor’s Approach

Posted 1 CommentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data Preparation Textual data is highly unstructured and to extract meaningful insights and apply mathematical algorithms, it should be turned into an appropriate format for analyses. This includes the application of a series of transformations on the data which will help you to represent the text […]

Monthly Challenge – Ontotext case – Week 1 – Mentor’s Approach

Posted Leave a commentPosted in Classification systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Week 1.Data Understanding + Feature Extraction At this stage, participants will have to use some visualization techniques to get familiar with the available ‘basic’ features (raw data). This phase is crucial because the made observations and hypotheses will determine what techniques will be appropriate when building […]

Monthly Challenge – Ontotext case – Solution – Team epistemi

Posted 5 CommentsPosted in Classification systems

Week 1. Data Understanding + Feature Extraction We have been provided with a dataset consisting of 277419 records, which are extracted from DBpedia. It has 6 features and 1 label of 32 classes as summarised in below table: ID Item Name Item Type Observations 1 org Feature URL; may be useful for extraction of additional […]

Monthly Challenge – Ontotext – Case

Posted 1 CommentPosted in Cases, Learn, MC-04-2019

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Real Business Problem Classification of companies into industry sectors is a fundamental task for unlocking advanced business intelligence capabilities. However different data sources rarely use the same classification system if any. This is a huge obstacle for taking advantage of the available details in Open Data and very niche commercial […]