birkhoffg / relax Goto Github PK

Pytorch is only needed for loading data. Our library mainly handles tabular data, so data loading would not be a bottleneck to most scenarios. Pytorch Dataloader is overkill for our project in most use cases.

Purpose

Write a drop-in NumpyLoader.

ToDo

Delete the Pytorch Dependency

https://github.com/BirkhoffG/cfnet/blob/24783713dc787cd5b13e70aa483e455c4856198f/settings.ini#L18

https://github.com/BirkhoffG/cfnet/blob/24783713dc787cd5b13e70aa483e455c4856198f/cfnet/datasets.py#L9

Next, modify the following code to make them not inherent Pytorch Dataset and DataLoader:

https://github.com/BirkhoffG/cfnet/blob/24783713dc787cd5b13e70aa483e455c4856198f/cfnet/datasets.py#L12-L22

https://github.com/BirkhoffG/cfnet/blob/24783713dc787cd5b13e70aa483e455c4856198f/cfnet/datasets.py#L35-L51

Expected Functionalities

NumpyDataset should contain all the input data.

# x, y are jax.numpy.array, such that len(x) == len(y)
dataset = NumpyDataset(x, y)

x, y = dataset[:] # access all the data of x, y
x_5, y_5 = dataset[:5] # access first five data of x, y

NumpyLoader iterates the NumpyDataset. See Pytorch Docs.

batch_size = 128
dataloader = NumpyLoader(
    dataset, # a `NumpyDataset`
    batchsize=batch_size,
    shuffle=True, # if True, shuffle the data; else, return the data in order
    drop_last=False # if True, discard the last batch (if len(dataset) % batchsize != 0); else, return the last batch
)

for x, y in dataloader:
    assert len(x) == batch_size
    assert len(y) == batch_size
    ...

Refactor util functions

Move

binary_cross_entropy in cfnet.methods.vanilla
grad_update, cat_normalize in cfnet.training_module

into cfnet.utils

Inefficiency in indexing data

https://github.com/BirkhoffG/cfnet/blob/6d5c3ec28351de79e575bbded5e67ab1bab48b03/cfnet/datasets.py#L95

You can write something like:

 batch_data = self.dataset[self.indices]

Complete Contributor Notes

logging during the training seems to slow the training

This line seems to slow the entire training

https://github.com/BirkhoffG/cfnet/blob/6d5c3ec28351de79e575bbded5e67ab1bab48b03/cfnet/module.py#L249-L250

Document configs for DataModule

Code Formatting in Black

Provide Default Data Configs for `TabularDataModule`

Proposal

data_module = TabularDataModule('adult')

As such, TabularDataModule will automatically load data_configs of the adult dataset.

We should also allow TabularDataModule to pass user-defined configs (i.e., current argument data_configs: str | dict).

`ipynb_path.py` links to wrong directory

Bugs when using `PredictiveTrainingModuleConfigs` for specifying configurations

We don't support PredictiveTrainingModuleConfigs

ReLax/relax/module.py

Line 154 in ec1a411

def __init__(self, m_configs: Dict[str, Any]):

Complete Installation tutorial

Migrate `cfnet` to `ReLax`

There are some outdated links (i.e., some links are still written in cfnet). Please try to fix ALL of them to the correct links.

For example,
https://github.com/BirkhoffG/ReLax/blob/v0.1/relax/data/module.py#L415-L416

CI/CD takes too long

Seems to run some unnecessary tests (e.g., train some models) during the testing

Proposing a `BasePredFn` Mixin

Document `nbs/04_learning.ipynb`

`pred_fn` as an input argument for `LocalCFExplanationModule`, not for init argument

Check monitor_metrics

Check monitor_metrics before actually finding the metric in logs.

Before this line:
https://github.com/BirkhoffG/cfnet/blob/2ee1a3203a9935e89b2ed8adf175ee1150fd9960/cfnet/_ckpt_manager.py#L54

Check monitor_metrics.

raise ValueError(...)

if monitor_metrics is not appropriately configured.

Customize DataModule-dependent constraints

We use cat_normalize for encoding features, and clip continuous features to [0, 1]. This is because we use one-hot encoding for cat features, and min-max scalar for cont features.

If a user wants to use other encoding methods (e.g., standardized cont features), our current way of handing normalized data is not applicable.

Proposed features:

Migrate to nbdev v2

Pass `seed` and `batch_size` to the dataloader functions in `TabularDataModule`

Pass seed and batch_size to TabularDataModule.train_dataloader, TabularDataModule.val_dataloader, and TabularDataModule.test_dataloader.
batch_size should also be an argument in TrainingConfigs
https://github.com/BirkhoffG/cfnet/blob/2ee1a3203a9935e89b2ed8adf175ee1150fd9960/cfnet/train.py#L15
Deprecated batch_size in DataConfigs
Finally, pass appropriate arguments:
https://github.com/BirkhoffG/cfnet/blob/2ee1a3203a9935e89b2ed8adf175ee1150fd9960/cfnet/train.py#L58-L59

Remove Everything with `@deprecated`

For example, remove this function:

ReLax/relax/evaluate.py

Line 156 in 729cacf

def generate_cf_results_local_exp(

Support `Merlin` dataloader

https://github.com/NVIDIA-Merlin/dataloader

Plugin to #50

Support Optional `monitor_metrics` in `TrainingConfigs`

Something like:

If monitor_metrics is None:
    # no checkpoint manage

Document `nbs/06_evaluate.ipynb`

Support aux arguments of `pred_fn` to be passed to `generate_cf_explanations`

Currently, we assume pred_fn is a function of only one input x. E.g., it is something like:

pred_fn = lambda x: 2 * x + 1

However, it is possible that user-defined pred_fn takes other arguments.

Hence, I propose

def generate_cf_explanations(
    cf_module: BaseCFModule,
    datamodule: TabularDataModule,
    pred_fn: Callable[[jnp.DeviceArray], jnp.DeviceArray] = None,
    *,
    t_configs=None,
    pred_fn_args: dict=None
)

where inside, we call pred_fn as

pred_fn(x, **pred_fn_args)

This offers additional flexibility for models that are not implemented using our framework.

Some open-sourced libraries of hyper-parameter searching:

Optuna [doc]
Ax [doc]
Dragonfly [link]
Spearmint [link]
BoTorch [link]

Implement ProtoCF

https://arxiv.org/abs/1907.02584

Do not retrain parametric models if they are already trained

ReLax/relax/evaluate.py

Lines 71 to 73 in 729cacf

 print(f'{type(cf_module).__name__} contains parametric models. ' 

 'Starts training before generating explanations...') 

 cf_module.train(datamodule, t_configs)

`jax.nn.sigmoid` leads to `nan` when calculating gradient if input is large

Reference:
https://stackoverflow.com/a/68293931

	print(f'{type(cf_module).__name__} contains parametric models. '
	'Starts training before generating explanations...')
	cf_module.train(datamodule, t_configs)

birkhoffg / relax Goto Github PK

relax's People

Contributors

Watchers

Forkers

relax's Issues

Purpose

ToDo

Expected Functionalities

Proposal

Recommend Projects

Recommend Topics

Recommend Org