MADAR Shared Task- Subtask 1: MADAR Travel Domain Dialect Identification

Organized by sabithassan - Current server time: Jan. 16, 2019, 6:26 p.m. UTC

Previous

Development Phase
Jan. 1, 2019, midnight UTC

Current

Test Phase
Jan. 15, 2019, midnight UTC

End

Competition Ends
Never

Task

Welcome to the Subtask 1 of the MADAR Shared Task on Arabic Fine-Grained Dialect Identification, organized at The Fourth Arabic Natural Language Processing Workshop (WANLP 2019). In subtask 1, participants are provided with a large-scale collection of parallel sentences in the travel domain covering the dialects of 25 cities from the Arab World plus standard Arabic (MSA). The task is to build systems that predict a dialect class among one of the 26 labels (25+ MSA) for given sentences.

Dates

  • December 10, 2018: first announcement of the shared task
  • January 7, 2019: set up of shared task website
  • January 28, 2019: registration begins and release of initial training sets and scoring script
  • March 18, 2019: final training data release
  • April 29, 2019: registration deadline
  • May 6, 2019: test set available
  • May 13, 2019: systems' outputs collected
  • May 20, 2019: system results due to participants
  • May 27, 2019: shared task system papers due
  • June 10, 2019: reviews due
  • June 17, 2019: notification of acceptance
  • June 24, 2019: camera-ready version of shared task system papers due
  • August 1, 2019: ACL 2019 Workshop in Florence

Task Organisers

  • Houda Bouamor (Fortia Financial Solutions, France)
  • Sabit Hasan (Carnegie Mellon University Qatar, Qatar)
  • Nizar Habash (New York University Abu Dhabi, UAE)

For any questions related to this subtask, please post to this google group, or contact the organizers directly using the following email address: madar.shared.task@gmail.com 

References

  • Bouamor, H., Habash, N., Salameh, M., Zaghouani, W., Rambow, O., et al. (2018). The MADAR Arabic Dialect Corpus and Lexicon. In Proceedings of the 11th International Conference on Language Resources and Evaluation. (PDF)
  • Salameh, M., Bouamor, H. and Habash, N. (2018). Fine-Grained Arabic Dialect Identification. In Proceedings of the 27th International Conference on Computational Linguistics. (PDF)

Evaluation Criteria

Systems will be evaluated using Macro Averaged F1-score.

Submission format information is available from the 'Participate' tab above.

 

License for the MADAR Shared Task Dataset

Copyright 2018 New York University Abu Dhabi and Carnegie Mellon University Qatar. All Rights Reserved. A license to use and copy this software, data and its documentation solely for your internal research and evaluation purposes, without fee and without a signed licensing agreement, is hereby granted upon your download of the software, through which you agree to the following: 1) the above copyright notice, this paragraph and the following three paragraphs will prominently appear in all internal copies and modifications; 2) no rights to sublicense or further distribute this software are granted; 3) no rights to modify this software are granted; and 4) no rights to assign this license are granted. Please Contact the Office of Industrial Liaison, New York University, One Park Avenue, 6th Floor, New York, NY 10016 (212) 263-8178, for commercial licensing opportunities, or for further distribution, modification or license rights.

The dataset was created under the Multi-Arabic Dialect Applications and Resources project (MADAR) -- Lead PI Nizar Habash, Co-Lead PI Kemal Oflazer, and PI Houda Bouamor. This dataset was made possible by grant NPRP 7-290-1-047 from the Qatar National Research Fund (a member of the Qatar Foundation).

 

IN NO EVENT SHALL NYU, OR ITS EMPLOYEES, OFFICERS, AGENTS OR TRUSTEES ("COLLECTIVELY "NYU PARTIES") BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES OF ANY KIND , INCLUDING LOST PROFITS, ARISING OUT OF ANY CLAIM RESULTING FROM YOUR USE OF THIS SOFTWARE, DATA AND ITS DOCUMENTATION, EVEN IF ANY OF NYU PARTIES HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH CLAIM OR DAMAGE.

NYU SPECIFICALLY DISCLAIMS ANY WARRANTIES OF ANY KIND REGARDING THE SOFTWARE and DATA, INCLUDING, BUT NOT LIMITED TO, NON-INFRINGEMENT, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, OR THE ACCURACY OR USEFULNESS, OR COMPLETENESS OF THE SOFTWARE. THE SOFTWARE AND ACCOMPANYING DOCUMENTATION, IF ANY, PROVIDED HEREUNDER IS PROVIDED COMPLETELY "AS IS". REGENTS HAS NO OBLIGATION TO PROVIDE FURTHER DOCUMENTATION, MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.

Development Phase

Start: Jan. 1, 2019, midnight

Test Phase

Start: Jan. 15, 2019, midnight

Competition Ends

Never

You must be logged in to participate in competitions.

Sign In