OffensEval 2020 - Turkish

Organized by cagri - Current server time: Jan. 21, 2021, 4:03 p.m. UTC

First phase

Sub-task A
Feb. 20, 2020, 12:01 a.m. UTC


Competition Ends
March 5, 2020, 11:59 p.m. UTC
Dataset and detailed task description available at:
This is the competition site for the second edition of OffensEval organized at SemEval 2020 (Task 12). SemEval 2020 will take place on September 13 and 13, 2020 co-located with COLING in Barcelona, Spain. 
This year OffensEval's title is Multilingual Offensive Language Identification in Social Media.
The first OffensEval was organized at SemEval 2019OffensEval 2019 used the Offensive Language Identification dataset (OLID) a dataset containing English tweets annotated using a hierarchical three-level annotation model described in this paperNearly 800 teams signed up to participate in OffensEval 2019. The competition received more than 100 submissions across three sub-tasks.The findings are described in the OffensEval 2019 reportThe response received in 2019 by far exceeded our expectations and motivated us to organize OffensEval 2020. 

Evaluation Criteria

Classification systems will be evaluated using the macro-averaged F1-score.

The unlabeled test set can be obtained here

Submission format

The prediction file format is a simple comma separated file, where each line consist of two fields: the document id and the label (either OFF or NOT). The ID field shoud match the ID's of the documents in the test file. The submitted CSV file should have exactly the same number (3528) of instances as the test file, but it shoud not have a haeder line. Please also include a README.txt file that briefly describes the approach used in the submission. Both files should be packaged together as a zip file and submitted through CodaLab. The file starting kit contains an example submission with a majority class baseline (all labels are OFF).

Terms and Conditions

The data in this competition, OffensEval 2020 - Turkish, is licensed under a Creative Commons Attribution licence (CC-BY). If you use this data, please acknowledge the following work:

A Corpus of Turkish Offensive Language on Social Media, Çağrı Çöltekin (2020), Proceedings of LREC

    title={A Corpus of Turkish Offensive Language on Social Media},
    author={\c{C}\"{o}ltekin, \c{C}a\u{g}r{\i}},
    booktitle={Proceedings of the 12th International Conference on Language Resources and Evaluation},

Sub-task A

Start: Feb. 20, 2020, 12:01 a.m.

Competition Ends

March 5, 2020, 11:59 p.m.

You must be logged in to participate in competitions.

Sign In