In the descriptions, I can see: "With this competition, we encourage researcher to extract symbols and descriptions in scientific documents. The data contains documents from 5 domains including: math, physics, biology, computer science, and economy".

Now, my question is, will the Test Set come in different topics that are explicitly mentioned (like in separate files) or will the Test Data come as a mixed bunch where we do not explicitly know the topic around each input text?


Posted by: bofoghi @ Nov. 24, 2021, 6:06 a.m.

Thanks for your question.

The data will come in separate files for each topic (similar to the training data files and format)

Posted by: laiviet @ Dec. 14, 2021, 5:50 p.m.
