For this project, your goal is to improve upon a transformer baseline on a simplified version of the WMT21 Machine Translation using Terminologies task.
The goal of this project is to explore methods that incorporate terminologies into either the training or the inference process, in order to improve both the accuracy and consistency of MT systems.
We consider the English-to-French translation task, and evaluation is performed on the TICO-19 dataset, which is part of the overall evaluation for the task in WMT21.
To help you focus on developing interesting methods, we provide a baseline system. To get started, please visit the Github repo.
To simplify the submission process, we accept the prediction output files in the format provided by the baseline code.
To submit a result, make a zip file containing the two files, "predict_results.json" and "eval_results.json" upload the file from the Participate tab.
"predict_results.json" should contain your test bleu, for example:
"eval_results.json" should contain your dev bleu, for example:
Please submit the actual output result of your code only. For reproducibility, we will require code submissions for all final solutions.
Start: June 30, 2013, midnight
Never
You must be logged in to participate in competitions.
Sign In