Automatically describing open-domain videos using rich natural sentences is among the most challenging tasks of computer vision, natural language processing and machine learning. To stimulate research on this topic, we propose the Large Scale Movie Description (LSMDC) Challenge, which features a unified version of the recently published large-scale movie datasets (M-VAD and MPII-MD). More information about the datasets can be found here.
In this challenge the task is to generate single sentence descriptions of individual video clips. The challenge consists of two phases: public test set evaluation and blind (where we will not provide the sentence descriptions) test set evaluation.
Other related challenges:
To participate, you should first create an account on CodaLab. In order to submit your results, please, perform these steps:
Note, that we allow up to 5 submissions per day / 100 in total for the public test phase and 1 submision per day / 5 in total for the blind test phase.
We thank the "Microsoft COCO Image Captioning Challenge" organizers for sharing the evaluation code.
The MS COCO Caption Evaluation API is used to evaluate results. The software uses both candidate and reference captions, applies sentence tokenization, and output several performance metrics including BLEU-1, BLEU-2, BLEU-3, BLEU-4, ROUGE-L, METEOR and CIDEr-D. More details can be found in the paper Microsoft COCO Captions: Data Collection and Evaluation Server.
Winners will be selected based on a human evaluation of submissions on the blind test set (second phase of the challenge).
Start: Aug. 1, 2017, 5:48 p.m.
Description: Public Test set
Start: Aug. 1, 2017, 5:48 p.m.
Description: Blind Test set
Oct. 1, 2017, 12:58 p.m.
You must be logged in to participate in competitions.
Sign In# | Username | Score |
---|---|---|
1 | ElTanque | 0.134 |
2 | danieljf24 | 0.168 |
3 | arohrbach | 0.163 |