Monthly Challenge – Ontotext case – Week 4 – Mentor’s Approach

Posted Leave a commentPosted in Learn

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data modeling + Investigation of the results + Evaluation At this stage, participants will have to use machine learning algorithms for classifying the companies into industry categories. Teams will be encouraged to try different techniques and then investigate the results. During this investigation, participants should analyze […]

Monthly Challenge – Ontotext case – Week 3 – Mentor’s Approach

Posted Leave a commentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Text Representation + Feature selection At the end of this phase, every team should have their datasets turned into an appropriate format for further analysis and building of the classification model. This means that the text should be turned into numbers in order to be able to […]

Monthly Challenge – Ontotext case – Week 2 – Mentor’s Approach

Posted Leave a commentPosted in Prediction systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Data Preparation Textual data is highly unstructured and to extract meaningful insights and apply mathematical algorithms, it should be turned into an appropriate format for analyses. This includes the application of a series of transformations on the data which will help you to represent the text […]

Monthly Challenge – Ontotext case – Week 1 – Mentor’s Approach

Posted Leave a commentPosted in Classification systems

Monthly Challenge: https://www.datasciencesociety.net/events/text-mining-data-science-monthly-challenge/ Monthly Challenge Case: https://www.datasciencesociety.net/monthly-challenge-ontotext-case/  Mentors’ Weekly Instructions: https://www.datasciencesociety.net/text-mining-data-science-monthly-challenge/ Week 1.Data Understanding + Feature Extraction At this stage, participants will have to use some visualization techniques to get familiar with the available ‘basic’ features (raw data). This phase is crucial because the made observations and hypotheses will determine what techniques will be appropriate when building […]

Data Visualization with Python.

Posted Leave a commentPosted in Learn

It is said that a picture is equal to 1000 words. This article will focus on data visualization with Python and will introduce the most popular data visualization libraries, textbooks, and courses available. Data Visualization is a very important and often overlooked part of the process of asking the right question, getting the required data, […]

Monthly Challenge – Ontotext case – Solution – Team epistemi

Posted 4 CommentsPosted in Classification systems

Week 1. Data Understanding + Feature Extraction We have been provided with a dataset consisting of 277419 records, which are extracted from DBpedia. It has 6 features and 1 label of 32 classes as summarised in below table: ID Item Name Item Type Observations 1 org Feature URL; may be useful for extraction of additional […]