Submissions are evaluated on the target test set. We report top-1 and top-5 accuracy on both the target test set (and source test set for reference only):
To submit your results to the leaderboard you must construct a submission zip file containing a single file test.json
containing the model’s results on the target and source test sets. This file should follow format detailed in the subsequent section.
The JSON submission format is composed of a single JSON object containing entries for every action in the test set. Specifically, the JSON file should contain:
'version'
property, set to '0.2'
'challenge'
property, which can assume the following values, depending on the challenge: ['domain_adaptation']
;sls
properties (see Supervision Levels Scale (SLS) page for more details):
sls_pt
: SLS Pretraining level.sls_tl
: SLS Training Labels level. Note: this refers to the source domain only. No annotations are allowed in the target domain.sls_td
: SLS Training Data level.'results_target'
object containing entries for every action in the target test set (e.g . 'P01_101_0'
is the first narration ID in the target test set).'results_source'
object containing entries for every action in the source test set (e.g . 'P01_11_0'
is the first narration ID in the source test set).Each action segment entry is a nested object composed of two entries: 'verb'
, specifying the class score for every verb class and the other, 'noun'
specifying the score for every noun class. Action scores are automatically computed by applying softmax to the verb and noun scores and computing the probability of each possible action.
{
"version": "0.2",
"challenge": "domain_adaptation",
"sls_pt": -1,
"sls_tl": -1,
"sls_td": -1,
"results_target": {
"P01_101_0": {
"verb": {
"0": 1.223,
"1": 4.278,
...
"96": 0.023
},
"noun": {
"0": 0.804,
"1": 1.870,
...
"299": 0.023
}
},
"P01_101_1": { ... },
...
}
"results_source": {
"P01_11_0": {
"verb": {
"0": 1.223,
"1": 4.278,
...
"96": 0.023
},
"noun": {
"0": 0.804,
"1": 1.870,
...
"299": 0.023
}
},
"P01_11_1": { ... },
...
},
}
If you wish to compute your own action scores, you can augment each segment submission with exactly 100 action scores with the key 'action'
{
...
"results_target": {
"P01_101_0": {
"verb": {
"0": 1.223,
"1": 4.278,
...
"96": 0.023
},
"noun": {
"0": 0.804,
"1": 1.870,
...
"299": 0.023
},
"action": {
"0,1": 1.083,
...
"96,299": 0.002
}
},
"P01_101_1": { ... },
...
}
}
The keys of the action
object are of the form <verb_class>,<noun_class>
.
You can provide scores in any float format that numpy is capable of reading (i.e. you do not need to stick to 3 decimal places).
If you fail to provide your own action scores we will compute them by
p(a = (v, n)) = p(v) * p(n)
To upload your results to CodaLab you have to zip the test file into a flat zip archive (it can’t be inside a folder within the archive).
You can create a flat archive using the command providing the JSON file is in your current directory.
$ zip -j my-submission.zip test.json
Start: July 14, 2021, midnight
Description: 2021 Open Testing Phase - Unsupervised Domain Adaptation Challenge
Nov. 25, 2021, 11:59 p.m.
You must be logged in to participate in competitions.
Sign In