dataset link: https://github.com/BinaLab/FloodNet-Challenge-EARTHVISION2021
this link fails to download all the data that is mentioned in the description
Track 2 was downloaded completely (5 dirs of 2gb each)
Track 1 wasn't (4 dirs 2gb each downloaded, 1 dir 2gb didn't after multiple attempts)
The numbers in the brackets are the actual ones that we get after extracting the downloaded folders:
## Checking the number of data points (Track 1)
- The whole dataset has 2343 images, divided into training (~60%), validation (~20%), and test (~20%) sets.
- 1343 train (1015)
- 500 valid (230)
- 500 test
- total: 2343 (1245)
## Checking the number of data points (Track 2)
- This track contains a total of 4511 image-question pairs in the training set.
- This track contains a total of 1415 image-question pairs in the validation set.
- 4511 train (1397 images, qns 4511)
- 1415 valid (450 images, qns 1415)
I just checked the google drive. I have found the exact number of images mentioned in the dataset description for track 1 (train: labeled+flooded= 51, labeled+non-flooded= 347, unlabeled= 1047, total = 1445, validation: 450). Please try to download again. Sorry for your inconvenience.
Thank you!
Posted by: binalab @ April 5, 2021, 12:04 a.m.