IFR+ Description

Improved version of IFRNet model: combines both promising approaches like optical flow based backward warping and kernel-based image refining. The proposed network architecture allowed to beat SOTA on Vimeo-triplet dataset.

IFRNet Description (CVPR2022)

Prevailing video frame interpolation algorithms, that generate the intermediate frames from consecutive inputs, typically rely on complex model architectures with heavy parameters or large delay, hindering them from diverse real-time applications. In this work, we devise an efficient encoder-decoder based network, termed IFRNet, for fast intermediate frame synthesizing. It first extracts pyramid features from given inputs, and then refines the bilateral intermediate flow fields together with a powerful intermediate feature until generating the desired output. The gradually refined intermediate feature can not only facilitate intermediate flow estimation, but also compensate for contextual details, making IFRNet do not need additional synthesis or refinement module. To fully release its potential, we further propose a novel task-oriented optical flow distillation loss to focus on learning the useful teacher knowledge towards frame synthesizing. Meanwhile, a new geometry consistency regularization term is imposed on the gradually refined intermediate features to keep better structure layout. Experiments on various benchmarks demonstrate the excellent performance and fast inference speed of proposed approaches.

Paper: IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation

Original github repository

Dataset

This work uses the Vimeo-triplet dataset. The dataset consists of 73,171 3-frame sequences with a fixed resolution of 448 x 256, extracted from 15K selected video clips from Vimeo-90K. This dataset is designed for temporal frame interpolation.

Download link.

Environmental requirements

GPU

Hardware (GPU)
- Prepare hardware environment with GPU processor
Framework
- MindSpore
For details, see the following resources:
- MindSpore Tutorial
- MindSpore Python API
Additional python packages:
- Install additional packages manually or using pip install -r requirements.txt command in the model directory.

Ascend 910

Hardware (Ascend)
- Prepare hardware environment with Ascend 910 (cann_6.0.0, euler_2.8, py_3.7)
Framework
- MindSpore Ascend 1.9.0 or later

Performance

Training Performance

Parameters	IFR+ (4xGPU)
Model Version	IFR+
Resources	4x Nvidia V100
Uploaded Date	05 / 26 / 2023 (month/day/year)
MindSpore Version	2.0.0.RC1
Dataset	Vimeo-triplet
Training Parameters	batch_size=6, 100 epochs
Optimizer	AdamW
Speed	1.89 s/step
Total time	4d 17h 11m
Checkpoint for Fine tuning	413.0 MB (.ckpt file)

Evaluation Performance

Parameters	IFR+ (1xNPU, CANN)
Model Version	IFR+
Resources	1x Ascend 910A
Uploaded Date	05 / 26 / 2023 (month/day/year)
MindSpore Version	2.0.0.20221118
Datasets	Vimeo-triplet
Batch_size	1
Inference speed, s	0.15 (448x256)
Inference speed, s	2.00 (1920x1024)
PSNR metric (exported)	36.54
SSIM metric (exported)	0.9710

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ifr_plus.md

ifr_plus.md

Contents

IFR+ Description

IFR+ Description

IFRNet Description (CVPR2022)

Dataset

Environmental requirements

GPU

Ascend 910

Performance

Training Performance

Evaluation Performance

Files

ifr_plus.md

Latest commit

History

ifr_plus.md

File metadata and controls

Contents

IFR+ Description

IFR+ Description

IFRNet Description (CVPR2022)

Dataset

Environmental requirements

GPU

Ascend 910

Performance

Training Performance

Evaluation Performance