Will there be a human evaluation to further evaluate the results for subtaskC? As the information on this website said: "To improve the reliability of the evaluation of Task C, we use a random subset of the test set and will do a human evaluation to further evaluate the systems with relatively high BLEU score." (Learn the Details -> Evaluation)
Posted by: huangxt39 @ March 17, 2020, 8:24 a.m.Yes. We already shared the results on Google Sheet at http://bit.ly/semeval2020-task4-results
Posted by: Shuailong @ March 17, 2020, 8:26 a.m.