Triangular MT: Using English to improve Russian-to-Chinese machine translation Forum

Go back to competition Back to thread list Post in this thread

> Question about using public data

Hi Ajay,

I have a question about using public data:

1. Can we use public data (e.g., OPUS, CC25) for pre-training language model (e.g., BERT, XLM)?

Thanks for your help. Have a nice day.

Cheers,
Jeonghyeok Park

Posted by: JeonghyeokPark @ June 8, 2021, 4:18 a.m.

Hi Jeonghyeok,

The rules forbid the use any data other than what we released as part of the competition (even if they are publicly available).

Regards,
Ajay

Posted by: ajaynagesh @ June 11, 2021, 1 p.m.
Post in this thread