`which_package` to yield unique collection of package records #5108

kenodegard · 2023-12-12T03:21:13Z

Description

The refactor of which_package accidentally resulted in the same package record being returned multiple times (once for each file collision). This results in misleading build warnings.

Instead we only want to return a unique set of package records.

Resolves #5106

Checklist - did you ...

Add a file to the news directory (using the template) for the next release's release notes?
Add / update necessary tests?
~~Add / update outdated documentation?~~

tests/test_inspect_pkg.py

mbargull

gh-5041 actually had two related regressions:

The one fixed here.
conda_build.post._lookup_in_prefix_packages is now printing a List[PrefixRecord] which means it prints repr(PrefixRecord) that have all PathData etc. expanded (leading to very verbose logs with sometimes >1M characters per line).

_lookup_in_prefix_packages needs to be changed to print(..., [str(record) for record in precs_in_reqs], ...) or the like.

news/5108-fix-which_package

tests/test_inspect_pkg.py

dholth · 2023-12-14T17:50:43Z

conda_build/inspect_pkg.py

-        for file in prec["files"]:
-            if samefile(prefix / file, path):
-                yield prec
+        if any(samefile(prefix / file, path) for file in prec["files"]):


(n * m) ? I see this version skips work if true.

Too bad to iterate over all installed files to do the search

worst case yes, same as before

any returns on the first True: https://docs.python.org/3/library/functions.html#any

This is what avoids the duplicated records to begin with, right?

In a subsequent PR, we could see if implementing a cached path -> artifact map would speed this up. It can be a veeery slow part of the post processing stage in conda-build.

This is what avoids the duplicated records to begin with, right?

correct, this is essentially what was happening pre #5041:

conda-build/conda_build/inspect_pkg.py

Lines 57 to 62 in c71c4ab

for dist in fn(prefix):

# dfiles = set(dist.get('files', []))

dfiles = dist_files(prefix, dist)

# TODO :: This is completely wrong when the env is on a case-sensitive FS!

if any(norm_ipp == normcase(w) for w in dfiles):

yield dist

cached path -> artifact map

definitely something to consider

jaimergp · 2023-12-14T18:00:06Z

conda_build/post.py

-            "{}: {} found in {}{}".format(
-                msg_prelude, n_dso_p, [prec.name for prec in precs], and_also
-            ),
+            f"{msg_prelude}: {n_dso_p} found in {[str(prec) for prec in precs]}{and_also}",


This upgrades from name to the full str representation. i.e. from python to defaults::python-3.12.0-h123abc_0, I think? Is that intentional?

yes, this is now more detailed (but still less verbose than str(precs)) than before

the change was inspired by the fix below on line 1214-1215, where precs_in_reqs: List[PackageRecord] was cast to string resulting in a very verbose output, see #5108 (review)

I think it's good that this is now more consistent with the full spec which was already output at https://github.com/conda/conda-build/pull/5108/files#diff-5073339a2e88f80714f82a001af334869aa5df9627b2f00ffd18d7a2035f537eR1190 .

jaimergp · 2023-12-14T18:13:30Z

Do we need to de-dup in line 1180 in post.py or are we confident there will be no dups?

    precs = list(which_package(in_prefix_dso, run_prefix))

kenodegard · 2023-12-14T18:24:46Z

@jaimergp AFAIK no deduping is needed once we merge this PR, with the modifications in which_package it is only possible to return a specific PackageRecord once

tests/test_inspect_pkg.py

tests/conftest.py

mbargull · 2023-12-14T19:26:33Z

conda_build/post.py

-            " (likely) or a missing dependency (less likely)".format(
-                msg_prelude, [prec.name for prec in precs]
-            ),
+            f"{msg_prelude}: .. but {[str(prec) for prec in precs]} not in reqs/run, "


Although I find the change to use str(prec) in the len(precs_in_reqs) == 0 and len(precs) > 0 case above a welcome change, one could argue that we might only want to output the name here so users get a more succinct "you might've missed package in run requirements" message.

But I don't have a strong opinion on this one.

mbargull

Added some minor nit comments, but overall this LGTM, thanks!

Co-authored-by: Marcel Bargull <[email protected]>

conda-bot added the cla-signed [bot] added once the contributor has signed the CLA label Dec 12, 2023

kenodegard mentioned this pull request Dec 12, 2023

How to interpet linking WARNINGs from conda-build 3.28.x? #5106

Closed

kenodegard self-assigned this Dec 12, 2023

jdblischak mentioned this pull request Dec 12, 2023

Require libcurl conda-forge/tiledb-feedstock#225

Merged

4 tasks

kenodegard added 4 commits December 12, 2023 15:45

Add which_package unittest

20d0a12

Only yield package record once

7998d33

Add news

3f19216

Skip using tmp_env in older conda

f955083

kenodegard force-pushed the reduce-which_package branch from 8812608 to f955083 Compare December 12, 2023 21:46

kenodegard changed the base branch from main to 3.28.x December 12, 2023 21:50

jezdez previously approved these changes Dec 13, 2023

View reviewed changes

mbargull reviewed Dec 14, 2023

View reviewed changes

tests/test_inspect_pkg.py Outdated Show resolved Hide resolved

mbargull requested changes Dec 14, 2023

View reviewed changes

kenodegard added 2 commits December 14, 2023 10:47

Refactor test_which_package into a local test

2ce0e20

Prefer str over repr in _lookup_in_prefix_packages

904078a

kenodegard dismissed jezdez’s stale review via 904078a December 14, 2023 16:54

kenodegard commented Dec 14, 2023

View reviewed changes

news/5108-fix-which_package Show resolved Hide resolved

Update news/5108-fix-which_package

424e1f2

kenodegard requested review from mbargull and jezdez December 14, 2023 16:57

mbargull reviewed Dec 14, 2023

View reviewed changes

tests/test_inspect_pkg.py Outdated Show resolved Hide resolved

mbargull reviewed Dec 14, 2023

View reviewed changes

tests/test_inspect_pkg.py Outdated Show resolved Hide resolved

jezdez previously approved these changes Dec 14, 2023

View reviewed changes

Set comparison

89c7bc8

kenodegard dismissed jezdez’s stale review via 89c7bc8 December 14, 2023 17:27

Fix softlink test per feedback

496092c

dholth reviewed Dec 14, 2023

View reviewed changes

jaimergp reviewed Dec 14, 2023

View reviewed changes

mbargull reviewed Dec 14, 2023

View reviewed changes

tests/test_inspect_pkg.py Outdated Show resolved Hide resolved

mbargull reviewed Dec 14, 2023

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

mbargull reviewed Dec 14, 2023

View reviewed changes

mbargull previously approved these changes Dec 14, 2023

View reviewed changes

kenodegard dismissed mbargull’s stale review via 05785c7 December 14, 2023 19:45

Remove tmp_env fixture

05785c7

Co-authored-by: Marcel Bargull <[email protected]>

mbargull approved these changes Dec 14, 2023

View reviewed changes

kenodegard mentioned this pull request Dec 14, 2023

Release 3.28.x #5071

Closed

67 tasks

beeankha approved these changes Dec 14, 2023

View reviewed changes

kenodegard merged commit a22d6ef into conda:3.28.x Dec 14, 2023
24 checks passed

kenodegard deleted the reduce-which_package branch December 14, 2023 23:17

jclarkeSTFC mentioned this pull request Jan 22, 2024

Unpin conda-build mantidproject/mantid#36681

Merged

github-actions bot added the locked [bot] locked due to inactivity label Dec 14, 2024

github-actions bot locked as resolved and limited conversation to collaborators Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`which_package` to yield unique collection of package records #5108

`which_package` to yield unique collection of package records #5108

kenodegard commented Dec 12, 2023 •

edited

Loading

mbargull left a comment

dholth Dec 14, 2023 •

edited

Loading

kenodegard Dec 14, 2023

jaimergp Dec 14, 2023

jaimergp Dec 14, 2023

kenodegard Dec 14, 2023

jaimergp Dec 14, 2023

kenodegard Dec 14, 2023

mbargull Dec 14, 2023

jaimergp commented Dec 14, 2023

kenodegard commented Dec 14, 2023 •

edited

Loading

mbargull Dec 14, 2023

mbargull left a comment

	for dist in fn(prefix):
	# dfiles = set(dist.get('files', []))
	dfiles = dist_files(prefix, dist)
	# TODO :: This is completely wrong when the env is on a case-sensitive FS!
	if any(norm_ipp == normcase(w) for w in dfiles):
	yield dist

which_package to yield unique collection of package records #5108

which_package to yield unique collection of package records #5108

Conversation

kenodegard commented Dec 12, 2023 • edited Loading

Description

Checklist - did you ...

mbargull left a comment

Choose a reason for hiding this comment

dholth Dec 14, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaimergp commented Dec 14, 2023

kenodegard commented Dec 14, 2023 • edited Loading

Choose a reason for hiding this comment

mbargull left a comment

Choose a reason for hiding this comment

`which_package` to yield unique collection of package records #5108

`which_package` to yield unique collection of package records #5108

kenodegard commented Dec 12, 2023 •

edited

Loading

dholth Dec 14, 2023 •

edited

Loading

kenodegard commented Dec 14, 2023 •

edited

Loading