DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations

This repo is the official implementation of the AAAI 2024 paper "DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations"

DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations

Requirements

torch == 1.7.1+cu101

numpy == 1.19.2

opencv-python == 4.5.1.48

Data Preparation

The structure of the training data is shown below:

├── Hybrid/

│ ├── Degraded/

│ │ ├── Blur/

│ │ ├── Noise/

│ │ ├── Shadow/

│ │ ├── Watermark/

│ │ ├── WithBack/

To generate the training dataset, run:

python generate_training_dataset.py (Coming soon)

Or download from: Pre-training Dataset (21.5G)

Train & Test

We control our hyper-parameters, such as batch size or learning rate, through exclusive yaml files. They are stored in the options folder. For pre-training, fine-tuning and testing, you should specify an appropriate yaml file. We have provided a sample file in the options folder.

Pre-train

Edit ./options/pretrain.yml
python pretrain.py

Fine-tune

Edit ./options/finetune.yml
python finetune.py

Test

Edit ./options/test.yml
python test.py

Note that the terminal output during the PSNR test is meaningless. In the next step we will evaluate the output images using the standard skimage.metrics.

Model Zoo

Pretrained Model	Pretrained Model
Asymmetric Comparison	One Drive
Symmetric Comparison	One Drive

Acknowledge

Our work is based on the following theoretical works:

and we are benefiting a lot from the following projects:

Citation

@inproceedings{wang2024docnlc,
  title={DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations},
  author={Wang, Ruilu and Xue, Yang and Jin, Lianwen},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={38},
  number={6},
  pages={5563--5571},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
TestImages		TestImages
data		data
metrics		metrics
models		models
options		options
scripts/back_projection		scripts/back_projection
utils		utils
README.md		README.md
finetune.py		finetune.py
pre_train.py		pre_train.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations

Requirements

Data Preparation

Train & Test

Pre-train

Fine-tune

Test

Model Zoo

Acknowledge

Citation

About

Releases

Packages

Languages

RylonW/DocNLC

Folders and files

Latest commit

History

Repository files navigation

DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations

Requirements

Data Preparation

Train & Test

Pre-train

Fine-tune

Test

Model Zoo

Acknowledge

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages