From 95e95d47861ca40dac556d87b19eac6077fab5a2 Mon Sep 17 00:00:00 2001 From: benoit74 Date: Tue, 3 Sep 2024 20:23:54 +0200 Subject: [PATCH] Add warning disclaimer about current scraper status --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 966a80b..38afdf8 100644 --- a/README.md +++ b/README.md @@ -11,6 +11,9 @@ storing content for offline usage. [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/gutenberg2zim.svg)](https://pypi.org/project/gutenberg2zim/) [![Docker](https://ghcr-badge.deta.dev/openzim/gutenberg/latest_tag?label=docker)](https://ghcr.io/openzim/gutenberg) +> [!WARNING] +> This scraper is now known to have a serious flaw. A critical bug https://github.com/openzim/gutenberg/issues/219 has been discovered which leads to incomplete archives. Work on https://github.com/openzim/gutenberg/issues/97 (complete rewrite of the scraper logic) now seems mandatory to fix these annoying problems. We however currently miss the necessary bandwidth to address these changes. Help is of course welcomed, but be warned this is going to be a significant project (at least 10 man.days to change the scraper logic so that we can fix the issue I would say, so probably the double since human is always bad at estimations). + ## Coding guidelines Main coding guidelines comes from the [openZIM Wiki](https://github.com/openzim/overview/wiki)