NEWS

1.5.5 | 2024-12-11 (R-only)

* No user-visible changes. Maintenance work to stay on CRAN.

1.5.4 | 2024-12-07 (R-only)

* No user-visible changes. Maintenance work to stay on CRAN.

1.5.3 | 2023-05-09 (R-only)

* Add a function to set the number of OpenMP threads.

1.5.2 | 2023-04-19 (Python-only)

* Fixed building on non-GNU platforms such as Windows.

1.5.1 | 2023-04-11 (R-only)

* Restored ability to compile with CUDA support on older toolkits which do
  not support C++17 in nvcc.

* Marked contrast_data as incompatible with the CUDA version.

* Fixed the main MDFS function to avoid calling ComputeMaxInfoGains with
  contrast_data with the CUDA version.

1.5.0 | 2023-03-13

* The default (recommended) range value has been changed. All invocations
  without the range parameter set are affected, as are all runs of the
  GetRange function without setting the k parameter. The new default
  (recommendation) is less conservative and fixes the issue which used
  to arise with imbalanced datasets of few objects where the old
  recommended range would not allow to capture the effective split.
  The new default mandates that only k=3 objects are needed (as opposed
  to k=5) and the estimation is done using the total number of objects
  (as opposed to only objects of the less numerous class).

1.4.0 | 2023-01-30

* Contrast variables are now handled separately to achieve the correct
  null hypothesis distribution. Each analysed tuple has at most one
  contrast variable and tuples with such contribute only to contrast
  scores. Actual variables have their max IG computed only among them.
  This has also led to the increase of the default number of contrast
  variables to equal the number of actual variables (but still at least
  30) instead of 10% of them.

* Removed option to output minimum IGs - they were not defined well and
  needlessly complicated the implementation. Users relying on this
  feature are advised to switch to functions outputting tuples instead
  as they give usefully broad view of the IG distribution.

* CPU parallelism efficiency is greatly improved.

* Discretisations are now also parallelised on the CPU.

* AddContrastVariables utility function is deprecated in favor of
  GenContrastVariables which returns only the contrast data.

* The value for the divisions parameter is no longer guessed for 1D.
  Instead, it always defaults to 1 for consistency.

* Add capability to output tuples' entropy and variation of information
  (in case of no decision) and decision's entropy conditioned on tuples
  as well as decision's measure analogous to the variation of information
  but conditional on the tuples (defined as a sum of information gains).
  Variation of information measures are defined only in 2D.

* Optimised 2D version for information gain computation. Now, both decisionful
  and decisionless paths are optimised for 2D (1D was already optimal). This
  results in a speed gain of 30% of computation time in a scenario with 100
  objects and 5000 variables.

* Expose the GetRange function used to calculate the probable optimal value
  of the range parameter.

* Add capability to output values averaged instead of maximised over multiple
  discretisations.

* Disable the warning from K-S test during p-value fitting. It was too strict
  and was warning on perfectly fine fits.

1.3.0 | 2022-04-19

* Adds discrete variant which works (as the name suggests) on
  discrete data (as opposed to continuous data) and skips the
  randomised discretisation procedure entirely. A dedicated
  ComputeMaxInfoGainsDiscrete and ComputeInterestingTuplesDiscrete
  functions are provided for this purpose. The current limitation
  is that all variables must have the same cardinality.

* Adds decisionless mode -- to allow to compute entropy and mutual
  information using the same engine. To compute these, leave the
  decision parameter NULL.

* Restores ability to not set I.lower in ComputeInterestingTuples and
  thus also to search for higher-dimension interesting tuples.

* Removes legacy undocumented fallback to "guessed" pc.xi (when it was
  explicitly set to NULL) as it conflicted with our recommendation.

* Improves validation of input.


1.2.0 | 2021-02-09

* IG threshold equal 0 (the new default value) or below means that there
  is no threshold filtering applied.
  Some IG results might be negative due to logarithm rounding and they
  are passed unchanged and unfiltered to the user now.
  Any positive IG threshold filters normally like it did before.

* Unfiltered tuple IG computation uses an optimised procedure now
  which is considerably faster.
  Speedup ratio is a logarithmic function of the number of variables.
  This brings tuple IG computation's performance close to that of the
  max IG.

* The optimised tuple IG computation procedure allows now to return the
  IG matrix directly.

1.1.1 | 2021-01-21

* Fix compilation on GCC 11.

1.1.0 | 2021-01-06

* Interesting tuples functionality was largely redone. It is now limited
  to 2D and requires inputting 1D IG. However, the results are now
  interpretable.

* Ability to output minimum IG per tuple instead of only maximum.
  NB outputted IG is still maximized over discretizations.

* Minor optimizations.

1.0.5 | 2019-11-10

* fix CUDA failure (issue in linking)

1.0.4 | 2019-10-27

* add function to discretize interesting variables for inspection

* "return.tuples" parameter now also controls returning of relevant
  discretization numbers

* change optional parameter "pseudo.count" name to "pc.xi" to better reflect
  its effect

* fix "interesting.vars" parameter to not require being sorted

* add "require.all.vars" parameter

* optimize IG computation on CPU (up to 4x speedup)

1.0.3 | 2018-11-07

* add note about recommended FDR control method

* fix a possible memory error introduced in 1.0.2 (mismatched new/delete[] operators)

1.0.2 | 2018-10-31

* improve default range estimation

* allow overriding pseudo.count in MDFS function

* set default p.adjust.method to default p.adjust method ("holm")

* set default level to suggested FWER level (0.05)

* fix CUDA version to not abruptly exit R on error

* fix use.CUDA=T in MDFS function

1.0.1 | 2018-06-26

* fix factors as input decision in ComputeMaxInfoGains,
  ComputeInterestingTuples and MDFS

* fix adjusted.p.value in MDFS

* return statistic and p.value in MDFS