Fix tests - steamline testing + multiprocessing backup #179

hadrilec · 2023-12-26T10:46:17Z

No description provided.

tfardet

I think there is a bug in the multiprocessing code (and I don't have time to debug it), please just wrap the original code in the try/except rather than the new one with the func_settings and func, especially as this will eventually go away with #182 anyway.

Otherwise we're back at 3-hour tests.

tfardet · 2023-12-27T07:08:39Z

pynsee/geodata/_get_geodata.py

+            length = len(list_bbox)
+            irange = range(length)

-            data_all = pd.concat(list_data).reset_index(drop=True)
+            func_settings = _set_global_var
+            func = _get_data_with_bbox2

+            try:
+                with multiprocessing.Pool(
+                    initializer=func_settings, initargs=(args,), processes=Nprocesses
+                ) as pool:                    
+                    list_output = list(
+                        tqdm.tqdm(
+                            pool.imap(func, irange),
+                            total=length
+                        )
+                    )
+            except Exception:
+                func_settings(args)
+                list_output = []
+
+                for p in tqdm.trange(length):
+                    list_output.append(func(p))
+
+                msg = """
+                Multiprocessing failed in the geodata collection,
+                a traditional loop was used instead
+                """   
+                logger.warning(msg)
+
+            data_all = pd.concat(list_output).reset_index(drop=True)


do not include these changes, there is a bug somewhere which leads to the 4h-long tests

tfardet · 2023-12-27T07:09:03Z

tests/macrodata/test_pynsee_macrodata.py

+            #     df = _build_series_list()
+            #     test = isinstance(df, pd.DataFrame)
+            #     os.environ['pynsee_use_sdmx'] = "False"
+            #     self.assertTrue(test)


uncomment or remove

tfardet · 2023-12-27T07:09:10Z

tests/macrodata/test_pynsee_macrodata.py

pynsee/geodata/_get_geodata.py

hadrilec · 2023-12-30T10:43:28Z

@tfardet, shall I merge?

tfardet · 2023-12-30T17:22:38Z

@hadrilec yes. The tests still took 3h, for reasons that I don't understand... but it seems to be already the case for the last 3 commits on master and checking the coverage showed that the multiprocessing code was used, not the except block.

Could you first merge master into it and fix the conflicts? (since you are using a branch on the pynsee account, I can't do it myself)

hadrilec and others added 4 commits November 19, 2023 18:17

multiprocessing backup solution if it fails

845af1e

Fix bare exception

3c2e92d

tests only on py 3.8+3.11, geopandas version fix

35761cb

normalize tests

8cfb62e

InseeFrLab deleted a comment from codecov-commenter Dec 27, 2023

tfardet requested changes Dec 27, 2023

View reviewed changes

InseeFrLab deleted a comment from codecov-commenter Dec 27, 2023

hadrilec added invalid This doesn't seem right dont merge and removed invalid This doesn't seem right labels Dec 27, 2023

tfardet reviewed Dec 27, 2023

View reviewed changes

pynsee/geodata/_get_geodata.py Outdated Show resolved Hide resolved

Test simply wrapping original code

202b643

InseeFrLab deleted a comment from codecov-commenter Dec 28, 2023

tgrandje mentioned this pull request Aug 7, 2024

Multiprocessing sometimes freezes get_geodata #203

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tests - steamline testing + multiprocessing backup #179

Fix tests - steamline testing + multiprocessing backup #179

hadrilec commented Dec 26, 2023

tfardet left a comment

tfardet Dec 27, 2023

tfardet Dec 27, 2023

tfardet Dec 27, 2023

hadrilec commented Dec 30, 2023

tfardet commented Dec 30, 2023 •

edited

Loading

Fix tests - steamline testing + multiprocessing backup #179

Are you sure you want to change the base?

Fix tests - steamline testing + multiprocessing backup #179

Conversation

hadrilec commented Dec 26, 2023

tfardet left a comment

Choose a reason for hiding this comment

tfardet Dec 27, 2023

Choose a reason for hiding this comment

tfardet Dec 27, 2023

Choose a reason for hiding this comment

tfardet Dec 27, 2023

Choose a reason for hiding this comment

hadrilec commented Dec 30, 2023

tfardet commented Dec 30, 2023 • edited Loading

tfardet commented Dec 30, 2023 •

edited

Loading