NLP

Datathon – HackNews – Solution – TruthFinders

0
votes

Team Members

Tariq Alhindi ([email protected])

Christopher Hidey ([email protected])

Tuhin Chakrabarty ([email protected])

 

Business Understanding

Automatic Detection of propaganda is essential to build tools that can assist people to navigate the web with more awareness of deliberate or indeliberate messages of what they read.

 

Data Understanding

50000 articles for task 1

21000 sentences for task 2

 

Data Preparation

No preprocessing or feature-engineering done.

 

Modeling

We experimented with deep learning approaches

Task1: BiLSTM with sizes 32, 64, 128 with and without max-pooling. We ended up using a BiLSTM of 128 with max-pooling as it got the highest F1 score on the train-dev set.

Task2: We experimented with ULMfit and BERT models for sentence classification and ended up using BERT.

Evaluation

F1 score on the propaganda class was used to compare between different experiments

Share this

Leave a Reply