EPIC-KITCHENS-55 Action Recognition

Organized by willprice - Current server time: March 30, 2025, 11:21 a.m. UTC

EPIC-KITCHENS-55 Action Recognition (No Challenge)

June 28, 2020, midnight UTC

Current

EPIC-KITCHENS-55 Action Recognition (No Challenge)

June 28, 2020, midnight UTC

End

Competition Ends

May 29, 2020, 11:59 p.m. UTC

Overview
Evaluation
Terms and Conditions
Submission Format

EPIC-Kitchens Action Recognition Challenge - Phase 3 (Nov 2019 - May 2020 [challenge announcements in CVPR])

Welcome to the EPIC-Kitchens Action Recognition challenge. EPIC-Kitchens is an unscripted egocentric action dataset collected from 32 different people from 4 cities across the world.

Dataset details

55 hours of video
11.5M frames
39,594 total action segments
125 verb classes, 352 noun classes
28,472 training action segments
Seen kitchens test set - 8,047 action segments
Unseen kitchens test set - 2,929 action segments

Goal

Classify trimmed action segments from seen and unseen kitchens by action verb and noun.

Maximum number of submissions: 50 (1 per day max)

For further details about the challenge, please see Sec. 4 of [1].

References

[1] D. Damen, H. Doughty, G. M. Farinella, S. Fidler, A. Furnari, E. Kazakos, D. Moltisanti, J. Munro and T. Perrett, W. Price, M. Wray, Scaling Egocentric Vision: The EPIC-KITCHENS Dataset, In European Conference on Computer Vision 2018.

Evaluation Criteria

Submissions are evaluated across 2 test sets:

Seen kitchens (kitchens that have action segments in the training set)
Unseen kitchens (kitchens that have no action segments in the training set)

We evaluate model performance across two sets of metrics:

Aggregate

These metrics are micro-averaged giving weight to each class proportionately to their frequency in the test set under evaluation.
- Top-1 accuracy
- Top-5 accuracy
Per-class

These metrics are macro-averaged giving equal weight to all classes regardless of their prevalence. We compute these for many-shot classes only: these are verb/noun classes that appear more than 100 times in training, and in the case of actions the cross product between the many shot verb and many shot noun classes given that the action appears at least once in training.

Many shot classes can be found on github for verbs, nouns, and actions .
- Precision
- Recall

Terms and Conditions

You agree to us storing your submission results for evaluation purposes.
You agree that if you place in the top-10 at the end of the challenge you will submit your code so that we can check for cheating.
You agree not to distribute the EPIC-Kitchens dataset without prior written permission.

Submissions

To submit your results to the leaderboard you must construct a submission zip file containing two files:

seen.json - Model inference on the seen kitchens test set (S1)
unseen.json - Model inference on the unseen kitchens test set (S2)

Both of these files follow the same format detailed below:

JSON Submission Format

The JSON submission format is composed of a single JSON object containing entries for every action in the test set. Specifically, the JSON file should contain:

a 'version' property, set to '0.1' (the only supported version so far);
a 'challenge' property, which can assume the following values, depending on the challenge: ['action_recognition', 'action_anticipation'];
a 'results' object containing entries for every action in the test set (e.g. '1924' is the first action ID in the seen test set).

Each action segment entry is a nested object composed of two entries: 'verb', specifying the class score for every verb class and the other, 'noun' specifying the score for every noun class. Action scores are automatically computed by applying softmax to the verb and noun scores and computing the probability of each possible action.

{
  "version": "0.1",
  "challenge": "action_recognition",
  "results": {
    "1924": {
      "verb": {
        "0": 1.223,
        "1": 4.278,
        ...
        "124": 0.023
      },
      "noun": {
        "0": 0.804,
        "1": 1.870,
        ...
        "351": 0.023
      }
    },
    "1925": { ... },
    ...
  }
}

If you wish to compute your own action scores, you can augment each segment submission with exactly 100 action scores with the key 'action'

{
  "version": "0.1",
  "challenge": "action_recognition",
  "results": {
    "1924": {
      "verb": {
        "0": 1.223,
        "1": 4.278,
        ...
        "124": 0.023
      },
      "noun": {
        "0": 0.804,
        "1": 1.870,
        ...
        "351": 0.023
      },
      "action": {
        "0,1": 1.083,
        ...
        "124,351": 0.002
      }
    },
    "1925": { ... },
    ...
  }
}

The keys of the action object are of the form <verb_class>,<noun_class>.

You can provide scores in any float format that numpy is capable of reading (i.e. you do not need to stick to 3 decimal places).

If you fail to provide your own action scores we will compute them by

Obtaining softmax probabilites from your verb and noun scores
Find the top 100 action probabilities where p(a = (v, n)) = p(v) * p(n)

Submission archive

To upload your results to CodaLab you have to zip both files into a flat zip archive (they can’t be inside a folder within the archive).

You can create a flat archive using the command providing the JSON files are in your current directory.

$ zip -j my-submission.zip seen.json unseen.json

EPIC-KITCHENS-55 Action Recognition (No Challenge)

Start: June 28, 2020, midnight

Description: Open Submission

Competition Ends

May 29, 2020, 11:59 p.m.

You must be logged in to participate in competitions.

Competition

EPIC-KITCHENS-55 Action Recognition

Previous

Current

End

EPIC-Kitchens Action Recognition Challenge - Phase 3 (Nov 2019 - May 2020 [challenge announcements in CVPR])

Dataset details

Goal

References

Evaluation Criteria

Aggregate

Per-class

Terms and Conditions

Submissions

JSON Submission Format

Submission archive

EPIC-KITCHENS-55 Action Recognition (No Challenge)

Competition Ends