The task is to perform video action classification on the Breakfast actions dataset. This dataset includes 1712 videos and shows activities related to breakfast preparation.
Run read_datasetBreakfast.py
to process the raw i3D data files. It stores the segments (containing i3D frames) and the corresponding labels into binary pickle
files.
python read_datasetBreakfast.py
Outputs:
raw_training_data.p
: Training data with 7075(segment, label)
tuples.unsorted_training_data.p
: Training data containing 80% ofraw_training_data.p
(5660 samples).training_data.p
: Final training dataset with(segment, label)
sorted in increasing order of segment lengths.validation_data.p
: Validation data with(segment, label)
containing 20% ofraw_training_data.p
(1415 samples).testing_data.p
: Test data with only segments and no labels (1536, but should be 1284).
-
data
: The video data can be downloaded from here. It contains I3D features that are computed for each frame. -
groundTruth/
: The actual action labels for the training data video frames. -
splits/
: Train-test split of the videos. Also contains mapping from action IDs to action names. -
training_data.dat
: Created byread_datasetBreakfast.py
. Contains tuples of video data and action labels, written using thepickle
library.