Welcome to the Subtask 2 of the MADAR Shared Task on Arabic Fine-Grained Dialect Identification, organized at The Fourth Arabic Natural Language Processing Workshop (WANLP 2019).The goal of Subtask 2 is to predict countries of Twitter users from 21 Arab Countries by using information about tweets posted by the Twitter users.
Systems will be evaluated using Macro Averaged F1-score.
Submission format information is available from the 'Participate' tab above.
The performance of submitted systems will be evaluated on
predictions of country labels for Twitter users in
IMPORTANT: Participants are NOT allowed to use
for training purposes. Participants must report the performance
of their best system on MADAR-Twitter-Subtask-2.DEV.user-label.tsv
in their Shared Task system description paper.
IMPORTANT: Participants can only use the ***text*** of the tweets
obtained through (MADAR-Obtain-Tweets.py) and the specific
information about the tweets provided in
Participants are NOT allowed to use additional tweets, nor
are they allowed to use outside information about the Twitter User.
Specifically -- participants should not use meta
data from Twitter about the users or the tweets, e.g.,
The training data from MADAR-Shared-Task-Subtask-1 is allowed.
External manually labelled data sets are *NOT* allowed.
However, the use of publicly available unlabelled data is allowed.
Copyright 2019 Carnegie Mellon University and New York University Abu
Dhabi. All Rights Reserved.
This work is licensed under the Creative Commons Attribution-NonCommercial-
NoDerivatives 4.0 International License.
If you use this resource, cite:
Bouamor, Houda, Sabit Hassan, Nizar Habash and Kemal Oflazer.
The MADAR Shared Task on Arabic Fine-Grained Dialect Identification.
In Proceedings of the Workshop for Arabic Natural Language Processing.
Florence, Italy, 2019.
Start: April 9, 2019, midnight
Start: May 6, 2019, midnight
May 14, 2019, midnight
You must be logged in to participate in competitions.Sign In