Dataset

The DatasetFromCSV class is designed to load video data according to a CSV file. This README provides instructions on how to use the dataset and how to convert annotation formats for new datasets. To facilitate the practice, we guide you through a complete data processing procudure by taking an open-sourced dataset as an example.

Usage

To use the DatasetFromCSV class, follow these steps:

Import the necessary modules:

import torch
from data.datasets import DatasetFromCSV

Initialize the dataset:

dataset = DatasetFromCSV(
    csv_path='path/to/your/csvfile.csv',
    data_root='path/to/data/root',
    transform=None,  # or provide your own transform functions
    resolution=(256, 256),
    num_frames=16,
    frame_interval=1,
    train=True,
    split_val=False
)

Use the dataset with a DataLoader:

from torch.utils.data import DataLoader

dataloader = DataLoader(dataset, batch_size=4, shuffle=True)
for batch in dataloader:
    videos = batch['video']
    captions = batch['caption']
    # Your training or validation code here

Annotation Format

The CSV file should have the following columns:

Basic format:

path, caption
path/to/video1, caption1
path/to/video2, caption2
...

Extended format with additional metadata (for multi-resolution training):

path, caption, fps, frames, height, width
path/to/video1, caption1, 30, 100, 512, 512
path/to/video2, caption2, 30, 50, 1080, 512
...

Ensure that the paths in the CSV file are either absolute or relative to the data_root provided during initialization.

In addition, to support a new dataset, you need to convert your annotations to the required CSV format.

Important Considerations

Transform Functions: If no transform functions are provided, default transforms for video and image data will be used. Ensure that your transform functions are compatible with the data format.
Resolution and Frame Settings: The resolution, num_frames, and frame_interval arguments should be set according to your specific requirements. These parameters control the size and number of frames sampled from each video.
Training and Validation Split: If split_val is set to True, the dataset will be split into training and validation sets. Ensure that the train parameter is set correctly to indicate whether the dataset is for training or validation.

ToyDataset

You can download the toydataset through the Onedrive and the GoogleDrive.

The structure should be

VideoTuna/
    ├── Dataset/
        ├── DATASET1
        ├── ToyDataset/
            ├── toydataset.csv
            ├── ToyDataset/
                ├── video1.mp4
                ├── video2.mp4
                ...

The toydataset.csv includes the path and the caption of the videos.

path, caption
Dataset/ToyDataset/videos/QNa4eMNKYwk_2.mp4,A woman with a patch on her eye next to a park bench.
Dataset/ToyDataset/videos/J4FJGUugMHM_0.mp4,A group of children and adults are playing with lego toys at an exhibition.
Dataset/ToyDataset/videos/jPu8eStQrnE_2.mp4,A potted plant is growing in a glass container.
...

An Example with the Open-sourced Dataset.

We here use Vript to illustrate the whole procudure for preparing a dataset.

Download Vript from HuggingFace:

huggingface-cli download
--resume-download Mutonix/Vript \
--local-dir path/to/Vript \
--local-dir-use-symlinks False

Unzip data:

cd {PROJECT}

python tools/unzip_vript.py --output_dir path/to/Vript/vript_short_videos_clips_unzip --zip_folder path/to/Vript/vript_short_videos_clips

Generate annotations:

python tools/vript_anno_converter.py --input_path path/to/Vript/vript_captions/vript_short_videos_captions.jsonl --output_path data/vript_short_videos_captions.csv --video_root path/to/Vript/vript_short_videos_clips_unzip

By following above steps, you can easily integrate Vript into our framework and train your own text-to-video models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets.md

datasets.md

Dataset

Usage

Annotation Format

Important Considerations

ToyDataset

An Example with the Open-sourced Dataset.

Files

datasets.md

Latest commit

History

datasets.md

File metadata and controls

Dataset

Usage

Annotation Format

Important Considerations

ToyDataset

An Example with the Open-sourced Dataset.