deel-ai / puncc Goto Github PK

View Code? Open in Web Editor NEW

245.0 5.0 14.0 6.28 MB

👋 Puncc is a python library for predictive uncertainty quantification using conformal prediction.

Home Page: https://deel-ai.github.io/puncc/

Python 99.61% Makefile 0.39%

conformal-prediction uncertainty-estimation uncertainty-quantification conformal-inference conformal-regressors

puncc's People

Contributors

Stargazers

Watchers

Forkers

valeman enriczhang sharpe5 surajitdb sharing-sam-work theblackcoathunt danibene brifitz qxzsilver1 rischkong mataluis2k jmwoloso cat2tom

puncc's Issues

API: conformalizing a pretrained underlying model with no splitter

In case we want to conformalize a pretrained model using the API, the current ConformalPredictor class requires a splitter even when X_fit and y_fit are not necessary.

In such situation, the correct behavior would be to pass a None splitter argument for the ConformalPredictor's constructor and use calibration data in the call for fit. The ConformalPredictor should allow a None splitter only when train argument is False.

[Bug]: Makefile does not run properly when no "python" is installed on machine (but only "python3", for example)

Module

Other

Contact Details

No response

Current Behavior

After git clone https://github.com/deel-ai/puncc.git and cd puncc, the following action has failed:

$ make prepare-dev
python -m venv puncc-dev-env
make: python: Command not found
make: *** [Makefile:17: prepare-dev] Error 127

Cause: on my local machine (ubuntu via WSL on windows), I do not have a python installed, but only python3 and python3.X.

Possible solutions: None trivial, for what I know.
Using python3 is a bit better because it is explicit in what we want, but we could have an old (python <= 3.7) version pointed by python3.

Remark: I do not know if not having a python but only python3 is standard, but since I have it, others could (no major tinkering with python on my machine)

Expected Behavior

Installation of prepare-dev virtual environment with puncc and other packages.

Version

v0.9

Environment

- OS: Ubuntu 20.04.5 LTS (Focal Fossa) on Windows Subsystem for Linux (WSL)
- Python version:
- Packages used version:

Relevant log output

No response

To Reproduce

Within a terminal (linux) without python, run:

make prepare-dev

Conformal Anomaly Detection

Update CV+ calibration for conformal multiregression

Add a correction function (callable) as an argument in the calibrate method. By default, use Bonferroni.
Enable coordinates-wise weights such that they have the same shape as the nonconformity scores.

Unexpected behaviour: SplitCP seems to ignore my pretrained model

I am training my predictor (LinearRegression()) with my own data, and then I want to create a conformal predictor with SplitCP that takes my pre-trained model and does conformalization.

Here is my problem:

from sklearn.linear_model import LinearRegression
from sklearn.datasets import make_regression

X_fit, y_fit = make_regression(n_samples=200, n_features=1, noise=50, random_state=42, bias=200)
X_cal, y_cal = make_regression(n_samples=200, n_features=1, noise=50, random_state=42, bias=200)
X_test, y_test = make_regression(n_samples=100, n_features=1, noise=50, random_state=42, bias=200)

mod = LinearRegression()
mod.fit(X_fit, y_fit)
print(mod.coef_)

> [85.88287056]

So my mod has been trained correctly. I give two different cases:

Case 1

base = BasePredictor(mod, is_trained=True)
cp = SplitCP(base)
cp.fit(X_calib=X_cal, y_calib=y_cal)

Now, I expect SplitCP to have "learned" that my mod is ready for prediction (via manually setting is_trained=True).
However, the predicted values returned by cp.predict are unexpected, while cp.predictor.predict(..) behaves as expected (returns predictions of mod).

Good:

internal_call_pred = cp.predictor.predict(X_test)
print(internal_call_pred[:10])
> [287.12357943 214.6184216  116.30331871 234.13103249 165.98971046
 262.76792038 167.34292778 253.7391835  259.67508504 293.32885547]

Bad:

preds, lo, hi = cp.predict(X_test, 0.1)
print(preds[:10])
>[ nan  nan -inf  nan -inf  nan -inf  nan  nan  nan]

Remark: during a previous run, the code celle above was returning small values around 0.0, so maybe it is returning something that is not initialized properly.

Case 2

On the other hand, this seems to work correctly:

base = BasePredictor(mod, is_trained=True)
cp = SplitCP(base, train=False)
cp.fit(X_calib=X_cal, y_calib=y_cal)

internal_call_pred = cp.predictor.predict(X_test)
print(internal_call_pred[:10])
>[287.12357943 214.6184216  116.30331871 234.13103249 165.98971046
 262.76792038 167.34292778 253.7391835  259.67508504 293.32885547]

preds, lo, hi = cp.predict(X_test, 0.1)
print(preds[:10])
>[287.12357943 214.6184216  116.30331871 234.13103249 165.98971046
 262.76792038 167.34292778 253.7391835  259.67508504 293.32885547]

Problem:

Either is_trained in BasePredictor or train in SplitCP is redundant, or the latter ignore the first.
It is unexpected that cp.predict returns dummy values when the call to the underlying sklearn fitted model via cp.predict.predict(...) still works as expected

Possible to pass sklearn model (already trained) directly to SplitCP?

Module

Regression

Contact Details

No response

Feature Request

Would it be possible to avoid hand-wrapping my own pre-trained sklearn model with BasePredictor?

That is, pass directly my model to SplitCP, for instance?

See below for minimal example, which will raise an error for cp.fit(...) : AttributeError: 'LinearRegression' object has no attribute 'is_trained'

A minimal example

from deel.puncc.regression import SplitCP

mod = LinearRegression()
mod.fit(X_fit, y_fit)
cp = SplitCP(mod, train=False)
cp.fit( X_calib=X_cal, y_calib=y_cal)

Version

v0.9

Environment

- Python version: Python 3.10.12 via colab

Documentation for Inductive Conformal Anomaly Detection

Getting started (gh-pages)
Tutorial
Theory overview

Multivariate quantile

The method deel.puncc.api.utils.quantile is to be updated to compute the $\boldsymbol{x}$-th quantile of a matrix-shaped input $\boldsymbol{a}$ $\in \mathbb{R} ^{n \times m}$, $n$ being the number of examples and $m$ the number of attributes in the output space. The expected result is a vector $\boldsymbol{q}$ $\in \mathbb{R}^m$, where the $i$-th element is the $x_i$-th quantile of the $i$-th column of $a$.

$\boldsymbol{x}$ can either be a vector $\in (0,1)^{m}$ or a scalar. In the latter case, it is duplicated $m$ times to be $\in (0,1)^{m}$.
The weights $\boldsymbol{w}$ should either be $\in (0,1)^{n \times m}$ or $\in \mathbb{R} ^{n}$. In the latter case, it is duplicate $m$ times to be $\in (0,1)^{n \times m}$.
Update NotImplementedError raised when $a$ is not unidimensional.
The implementation should ensure backwards compatibility. All previous code must run with the previously provided function signature.
Update documentation.
Add tests.

[Bug]: Instance variable is_trained is used incorrectly

Module

Prediction (API)

Contact Details

[email protected]

Current Behavior

DualPredictor is supposed to be initialized with is_trained parameter that should be a list of booleans. There are a few issues related to this variable:

Using lists as default values is not recommended because it can lead to unwanted side effects. However, side effects may never occur if the parameter is not updated within constructor, see more here. You can use tuple instead.
The instance variable is_trained is not updated after method fit is completed:

for count, model in enumerate(self.models):
    if not self.is_trained[count]:
        model.fit(X, y, **dictargs[count])
#end of method

Instead, you may want to rewrite it like this:

for count, model in enumerate(self.models):
    if not self.is_trained[count]:
        model.fit(X, y, **dictargs[count])
        self.is_trained[count] = True
#end of method

Exception can be not thrown when it should in some cases when is_trained is a part of condition of if statement:

...
if self.train:
    if self.splitter is None:
        raise RuntimeError(
                "The splitter argument is None but train is set to "
                + "True. Please provide a correct splitter to train "
                + "the underlying model."
        )
    logger.info(f"Fitting model on fold {i+cached_len}")
    predictor.fit(X_fit, y_fit, **kwargs)  # Fit K-fold predictor

    # Make sure that predictor is already trained if train arg is False
elif self.train is False and predictor.is_trained is False:
    raise RuntimeError(
        "'train' argument is set to 'False' but model is not pre-trained"
    )

else:  # Skipping training
    logger.info("Skipping training.")
...

In elif statement predictor.is_trained is used as boolean but in fact it can be a list if predictor is an instance of DualPredictor. In this case it will be True if the list is not empty even though a model can still be not trained.

The solution suggested in point 2 is only partial. I think the best way would be to keep is_trained variable private (rename it to _is_trained) and introduce a property is_trained which will behave as instance variable but will be implemented as a method under the hood. For example for DualPredictor it can look like this:

@property
def is_trained(self) -> bool:
    return self._is_trained[0] and self._is_trained[1]

Expected Behavior

is_trained is expected to be boolean
is_trained is expected to change after models are trained

Version

v0.9

Environment

No response

Relevant log output

No response

To Reproduce

I do not have example of a code that actually fails because of that. The test named test_locally_adaptive_cp actually runs the part of the code with aforementioned elif statement.

User friendly weight normalization

For simplicity, the weight normalization in case of nonexchangeable CP needs to be done inside the method deel.puncc.api.calibration.BaseCalibrator.calibrate.

Also, documentation should clarify if the weights passed as arguments are to be normalized or not.

Multivariate Conformal Regression

Hypothesis testing correction

Add a new module deel.puncc.api.corrections and implement Bonferroni (and other multiple hypothesis testing) correction.

Conformal Multi-label Classification

TypeError: _quantile_dispatcher() got an unexpected keyword argument 'method'

Module

None

Contact Details

No response

Current Behavior

I have two qunatile catboost regressor model but crq.predict(X_test, alpha=0.05) throws an error :- TypeError: _quantile_dispatcher() got an unexpected keyword argument 'method'

Wrap models in predictor

predictor = DualPredictor(models=[reg_low, reg_high])

CP method initialization

crq = CQR(predictor)

The call to `fit` trains the model and computes the nonconformity

scores on the calibration set

crq.fit(X_fit=X_train, y_fit=y_train, X_calib=X_valid, y_calib=y_valid)

The predict method infers prediction intervals with respect to

the significance level alpha = 20%

y_pred, y_pred_lower, y_pred_upper = crq.predict(X_test, alpha=0.05)

Compute marginal coverage and average width of the prediction intervals

coverage = regression_mean_coverage(y_test, y_pred_lower, y_pred_upper)
width = regression_sharpness(y_pred_lower=y_pred_lower,
y_pred_upper=y_pred_upper)
print(f"Marginal coverage: {np.round(coverage, 2)}")
print(f"Average width: {np.round(width, 2)}")

Expected Behavior

It should run

Version

v0.9

Environment

- OS:
- Python version:
- Packages used version:

Relevant log output

No response

To Reproduce

param = {'loss_function': 'Quantile:alpha=0.05',
'learning_rate': 0.4607417710785185,
'l2_leaf_reg': 0.03572230525884548,
'depth': 4,
'boosting_type': 'Plain',
'bootstrap_type': 'MVS',
'min_data_in_leaf': 8}
reg_left = CatBoostRegressor(task_type="GPU", devices='-1', **param)
param = {'loss_function': 'Quantile:alpha=0.95',
'learning_rate': 0.002097382718709981,
'l2_leaf_reg': 0.07411180923916862,
'depth': 1,
'boosting_type': 'Plain',
'bootstrap_type': 'Bayesian',
'min_data_in_leaf': 5,
'bagging_temperature': 9.119533192831474}
reg_high = CatBoostRegressor(task_type="GPU", devices='-1', **param)

Wrap models in predictor

predictor = DualPredictor(models=[reg_low, reg_high])

CP method initialization

crq = CQR(predictor)

The call to `fit` trains the model and computes the nonconformity

scores on the calibration set

crq.fit(X_fit=X_train, y_fit=y_train, X_calib=X_valid, y_calib=y_valid)

The predict method infers prediction intervals with respect to

the significance level alpha = 20%

y_pred, y_pred_lower, y_pred_upper = crq.predict(X_test, alpha=0.05)

Compute marginal coverage and average width of the prediction intervals

Fix cross-validation-plus quantile: (1+1/n) correction is missing

TODO
The old branch https://github.com/deel-ai/puncc/tree/fix-cv-quantile, not merged into main, contains some corrections to the quantile procedure followed during cross-validation-plus.
These must be reimplemented in the current main.
It is not worth the time to merge this old code, just re-write and re-check everything for statistical correctness.

Where:

deel/puncc/api/calibration.py

The imprecise code:

        y_lo = (-1) * np.quantile(
            (-1) * concat_y_lo, 1 - alpha, axis=1, method="inverted_cdf"
        )
        y_hi = np.quantile(
            concat_y_hi, 1 - alpha, axis=1, method="inverted_cdf"
        )

The 1 - alpha should be (1 - alpha)(1 + 1/n), where n is the number of training points (only in the case of jackknife+ and CV+!)

Sources:

https://arxiv.org/pdf/1905.02928.pdf#equation.3.11
First paragraph: https://arxiv.org/pdf/1905.02928.pdf#subsection.1.2, the q+ and q- formulae.

Remark: for small values of alpha, outside the admissible range, the authors set the quantile to infinity. We should consider this and see what we do in our code.
An infinite prediction interval can be useless in practice, especially for a user not acquainted with conformal prediction.

All MAD predictions should be positive.

Hi!

Thank you for creating Puncc. I'm trying to use LocallyAdaptiveCP as described here https://deel-ai.github.io/puncc/regression.html#deel.puncc.regression.LocallyAdaptiveCP

                mu_model = xgb.XGBRegressor()
                sigma_model = xgb.XGBRegressor()
                # Wrap models in a mean/variance predictor
                mean_var_predictor = MeanVarPredictor(
                    models=[mu_model, sigma_model]
                )
                cp = LocallyAdaptiveCP(mean_var_predictor)
                cp.fit(X_fit=X_train, y_fit=y_train, X_calib=X_test, y_calib=y_test)

But I get an error: All MAD predictions should be positive. Any idea of what am I missing?
I think the error comes from

puncc/deel/puncc/api/nonconformity_scores.py

Line 248 in 6e0a8f8

raise RuntimeError("All MAD predictions should be positive.")

mean_absolute_deviation = absolute_difference(y_pred, y_true)
if np.any(sigma_pred < 0):
    raise RuntimeError("All MAD predictions should be positive.")
return mean_absolute_deviation / (sigma_pred + EPSILON)

But I don't know how to avoid it. Any pointers would be greatly appreciated!

[Feature]: prevent empty sets (keep conformal validity, lose upper bound tight coverage)

While working with conformal classification with JD, we realized that it makes sense to be able to force non-empty prediction sets, at least from an operational point of view.

For example, in classification via softmax, this corresponds to always including the class whose score is highest (even if very low).
In the case of (R)APS, this means bypassing the randomization step.

In the RAPS paper, here is the explication of the phenomenon:

https://openreview.net/pdf?id=eNdiU_DbM9#appendix.B

Here also the detail of the algo with and without the randomization:

Algorithms 2 & 3: If $rand == False$, than do not do randomization.
Notice that randomization is both in computation of nonconformity scores and prediction, so we must take care of both.

I reckon we could achieve this simply by adding a flag to the class, such as at instantiation:

aps_cp = RAPS(class_predictor, [...], avoid_empty_sets=True)

ConformalPredictor extension for multivariate prediction

Is there an easy way to get the quantile that you use to build the interval?

How can we get the conformalizing quantile after cp.fit(...)?

Maybe we could get something like:

cp.get_conformalizer(alpha=0.043)
> 3.095

The nonconformity scores should already stored somewhere in the object, after fit, it would boil down to apply the correct (non trivial?) quantile formula from within puncc to the array of scores.

Alpha_calib_check for multivariate regression

Module

Calibration (API)

Contact Details

[email protected]

Feature Request

In the case of multivariate regression, we may have a vector of alphas, of the same length as the number of features of the Y variable. We could modify the alpha_calib_check function so that it takes a float or a np.ndarray alpha as an argument and checks that all coordinates of alpha satrisfy the desired constraint.

A minimal example

No response

Version

v0.9

Environment

- OS:
- Python version:
- Packages used version:

[Bug]: - Conflict between alpha_calib_check and weighted CP

Module

Calibration (API)

Contact Details

[email protected]

Current Behavior

The function alpha_calib_check is only designed to work for the usual CP, where all weights are equal, it should be adapted to the case of the weighted CP.

Expected Behavior

Easy solution: call alpha_calib_check only in the case of usual CP, and allow for returning infinity as a quantile in the case of weighted CP.
Better solution: modify alpha_calib_check to take into account the weighted version of CP.

Version

v0.9

Environment

- OS:
- Python version: 
- Packages used version:

Relevant log output

No response

To Reproduce

There is nothing to reproduce.

Nonconformity scores for multivariate regression

Add nonconfomity score and prediction set functions for conformal multivariate regression in their respective modules.

Data check in IdSplitter

Data check in splitting.IdSplitter is not exhaustive:

Need to add a supported type check
Indexing need to be adapted to the data structures if necessary
Number of features check needs to be done on all sample

Also, it should be transferred to utils module.

Update calibrator for multivariate regression.

Update deel.puncc.calibration.BaseCalibrator for multivariate regression:

Add a method name_placeholder that returns the $\mathbf{x}$-th quantile of already computed nonconformity scores.
Add a correction function (callable) as an argument in the calibrate method. By default, use Bonferroni.
The methods fit and calibrate shoud be compatible with multivariate prediction. Specifically, calibrate should be allowed to run with a vector of alphas
The CvPlusCalibrator should support multivariate prediction

Make sure backward compatibility is respected: univariate prediction should be a special case.

deel-ai / puncc Goto Github PK

puncc's People

Contributors

Stargazers

Watchers

Forkers

puncc's Issues

Module

Contact Details

Current Behavior

Expected Behavior

Version

Environment

Relevant log output

To Reproduce

Case 1

Case 2

Problem:

Module

Contact Details

Feature Request

A minimal example

Version

Environment

Module

Contact Details

Current Behavior

Expected Behavior

Version

Environment

Relevant log output

To Reproduce

Module

Contact Details

Current Behavior

Wrap models in predictor

CP method initialization

The call to fit trains the model and computes the nonconformity

scores on the calibration set

The predict method infers prediction intervals with respect to

the significance level alpha = 20%

Compute marginal coverage and average width of the prediction intervals

Expected Behavior

Version

Environment

Relevant log output

To Reproduce

Wrap models in predictor

CP method initialization

The call to fit trains the model and computes the nonconformity

scores on the calibration set

The predict method infers prediction intervals with respect to

the significance level alpha = 20%

Compute marginal coverage and average width of the prediction intervals

Module

Contact Details

Feature Request

A minimal example

Version

Environment

Module

Contact Details

Current Behavior

Expected Behavior

Version

Environment

Relevant log output

To Reproduce

Recommend Projects

Recommend Topics

Recommend Org

The call to `fit` trains the model and computes the nonconformity

The call to `fit` trains the model and computes the nonconformity