mlr-org / bbotk Goto Github PK

Black-box optimization framework for R.

License: GNU Lesser General Public License v3.0

R 99.26% C 0.74%

machine-learning data-science black-box-optimization optimization hyperparameter-tuning r mlr3 bbotk r-package hyperparameter-optimization

bbotk's Introduction

bbotk - Black-Box Optimization Toolkit

Package website: release | dev

bbotk is a black-box optimization framework for R. It features highly configurable search spaces via the paradox package and optimizes every user-defined objective function. The package includes several optimization algorithms e.g. Random Search, Grid Search, Iterated Racing, Bayesian Optimization (in mlr3mbo) and Hyperband (in mlr3hyperband). bbotk is the base package of mlr3tuning, mlr3fselect and miesmuschel.

Resources

There are several sections about black-box optimization in the mlr3book. Often the sections about tuning are also relevant for general black-box optimization.

Getting started with black-box optimization.
An overview of all optimizers and tuners can be found on our website.
Learn about log transformations in the search space.
Or more advanced search space transformations.
Run multi-objective optimization.
The mlr3viz package can be used to visualize the optimization process.
Quick optimization with the bb_optimize function.

Installation

Install the latest release from CRAN.

install.packages("bbotk")

Install the development version from GitHub.

pak::pkg_install("mlr-org/bbotk")

Example

# define the objective function
fun = function(xs) {
  - (xs[[1]] - 2)^2 - (xs[[2]] + 3)^2 + 10
}

# set domain
domain = ps(
  x1 = p_dbl(-10, 10),
  x2 = p_dbl(-5, 5)
)

# set codomain
codomain = ps(
  y = p_dbl(tags = "maximize")
)

# create objective
objective = ObjectiveRFun$new(
  fun = fun,
  domain = domain,
  codomain = codomain,
  properties = "deterministic"
)

# initialize instance
instance = oi(
  objective = objective,
  terminator = trm("evals", n_evals = 20)
)

# load optimizer
optimizer = opt("gensa")

# trigger optimization
optimizer$optimize(instance)

##    x1 x2  x_domain  y
## 1:  2 -3 <list[2]> 10

# best performing configuration
instance$result

##    x1 x2  x_domain  y
## 1:  2 -3 <list[2]> 10

# all evaluated configuration
as.data.table(instance$archive)

##            x1        x2          y           timestamp batch_nr x_domain_x1 x_domain_x2
##  1: -4.689827 -1.278761 -37.716445 2024-08-13 17:52:54        1   -4.689827   -1.278761
##  2: -5.930364 -4.400474 -54.851999 2024-08-13 17:52:54        2   -5.930364   -4.400474
##  3:  7.170817 -1.519948 -18.927907 2024-08-13 17:52:54        3    7.170817   -1.519948
##  4:  2.045200 -1.519948   7.807403 2024-08-13 17:52:54        4    2.045200   -1.519948
##  5:  2.045200 -2.064742   9.123250 2024-08-13 17:52:54        5    2.045200   -2.064742
## ---                                                                                    
## 16:  2.000000 -3.000000  10.000000 2024-08-13 17:52:54       16    2.000000   -3.000000
## 17:  2.000001 -3.000000  10.000000 2024-08-13 17:52:54       17    2.000001   -3.000000
## 18:  1.999999 -3.000000  10.000000 2024-08-13 17:52:54       18    1.999999   -3.000000
## 19:  2.000000 -2.999999  10.000000 2024-08-13 17:52:54       19    2.000000   -2.999999
## 20:  2.000000 -3.000001  10.000000 2024-08-13 17:52:54       20    2.000000   -3.000001

bbotk's People

Contributors

Stargazers

Watchers

Forkers

philipp-baumann minghao2016 sumny lionel- tdhock rnaimehaom michaelchirico asiripanich jemus42

bbotk's Issues

make domain of objective optional

Sometimes we only know the search_space (param_set of the OptimInstance) but not the domain (search space after trafo) of the objective. We should not always be obliged to define a domain because it is not used anyways.

Therefore it should be optional to define the domain.

Review Documentation

At some point we have to carefully review the complete documentation.

archive and printer displays wrong name for y

Archive:
x_rep_1 x_rep_2 V1
1: 0.00145 0.004792 0.00002507
2: -0.90146 -0.875556 1.57923700

allow parallel evals in Evaluator

Terminator stagnation and an occasionally failing learner return an error during tuning

Hi,

I stumbled on this by chance, not sure if it can be classified as bug but I thought you should know about it.

When combined with a learner that fails occasionally stagnation terminator returns the error:

Error in if (self$terminator$is_terminated(self)) { : 
  missing value where TRUE/FALSE needed

Example:

lrn_rpart <- lrn("classif.rpart")
ig <- po("filter", flt("information_gain"))

ps <- ParamSet$new(list(
  ParamDbl$new("classif.rpart.cp", lower = 0, upper = 0.05),
  ParamInt$new("information_gain.filter.nfeat", lower = 20L, upper = 60L),
  ParamFct$new("information_gain.type", levels = c("infogain",
                                                   "gainratio")) # I know gainratio does not work well with Sonar
))

glrn <- ig %>>%
  lrn_rpart

glrn <- GraphLearner$new(glrn) 

glrn$encapsulate <-  c(train = "evaluate", predict = "evaluate")

cv5 <- rsmp("cv", folds = 5)

tsk <- mlr_tasks$get("sonar")

instance <- TuningInstance$new(
  task = tsk,
  learner = glrn,
  resampling = cv5,
  measures = msr("classif.ce"),
  param_set = ps,
  terminator =  term("stagnation", iters = 5, threshold = 0)
)

tuner <- TunerRandomSearch$new()
set.seed(123)
tuner$tune(instance)

After 6 configurations evaluated the error occurs - I trust it is due to the NaN in the performance measure

instance$archive()
   nr batch_nr  resample_result task_id                     learner_id resampling_id iters params tune_x warnings errors classif.ce
1:  1        1 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      0  0.2648084
2:  2        2 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      0  0.2596980
3:  3        3 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      5        NaN
4:  4        4 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      0  0.2454123
5:  5        5 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      0  0.2501742
6:  6        6 <ResampleResult>   sonar information_gain.classif.rpart            cv     5 <list> <list>        0      0  0.2737515

All the best,

Milan

Multi-crit: Implement NDS (non dominated sorting)

Probably as a Method of OptimInstanceMulticrit or Archive.

related: mlr-org/mlr3hyperband#45

eval_batch(xdt): xdt should be able to contain more then x cols

If we call OptimInstance$eval_batch(xdt) from inside an optimizer we might have more information than just the x values (e.g. AcqFunction value that lead to this x value in MBO).

Either we allow more information in xdt or we allow adding more info afterwards (which would be a bit cumbersome),

how do we define which extra colums the archive should contain on cobstruction?

probably we simply pass a 3rd paramset in addition to domain and codomain that lists all of them?

consider trafos of paramset. are they done here?

in evaluator? also make sure they dont happen twice if we connect this to mlr3tuning!

Archive unnest not working if codomain is named y

mlr-org/mlr3misc#42

Terminators could have tag if they are capable of multi-objective

For example TerminatorStagnation would probably not be directly applicable for multi-objective?

add adagio cmaes from attic as Tuner

move most terminators back from attic and iterate them

onload and zzz needs to be checked and iterated

allow encapsulated evals in Evalutor, with option to switch off / on

RS as part of bbotk

It would make sense to have this as a reference optimizer implementation in bbotk

we need a way to parametrize / define constants for objective functions

tuner_objective function that goes into arbitrary optimizer

There should be a function that can be passed into an arbitrary tuner e.g. optimize(obj$tuner_objective).

do we want a baseclass for generic BB optimizers? might help to induce some structure

List of lists for opt_x in archive

xss_trafoed should be added as a list of lists to the Archive in add_evals even when just one parameter is optimized.

Document how the result is stored

In the description under technical details

it would be nice if we could make a objective autolog into an archive

Write Tutorial/Vignette

-inlcude basic example
-inlcude MOO
-inlcude Parallelization
-include how to handle y + "extra" returns from the objective

mlr3pipelines: Optimizers for threshold tuning

I implemented two Optimizers for threshold tuning: OptimizerNloptr and OptimizerGenSA.
You can find them here.

In general, I would need to factor out the threshold tuning logic and the optimization logic, but this should be trivial.

It would be cool if they could be added to bbotk or mlr3tuning.

Rename param_set of the OptimInstance

param_set is used in many objects to control how this object behaves
here the param_set is actually the search space
Therefore: Name it search_space ?

other suggestions welcome

Archive: how do we handle to have unevaluated points as allowed points in there?

iterate DESC before cran submit, currently very barebones

implement at least basic logging - at leats for fun evals

Structure of instance$result unclear

Should be defined properly. Also the signature of assign_result:

Suggestion:

list(
  xdt, # data.table with one or multiple rows, 
       # subset of the archive, meaning xdt is subset of search_space
  y, # numerical vector for single-crit, data.table for multi-crit
  x_opt # list (of lists) with one or mulitple elements. 
        # transformed x values, subset of domain of objective
)

Alternative: the result is just a data.table in the exact same way as in archive$data. Basically a subset of arhcive$data and in cases the optimizer returns a result that was not previously evaluated such a data.table has to be constructed.

Actually I would prefer the alternative suggestion because it feels simpler and more coherent.

Archive: we should check that all added elements to Archive are valid /feasible

where do optimizers store their result?

in the archive? the objective? do they simply rerurn it?

Add logging on console

Currently removed because it lived in the encapsulation.

nds_selection should be here, not in mlr3hyperband

nds_selection.R returns the best subset of points by non-dominated sorting with hypervolume contribution for tie-breaking.

Also needed for multiobjective black-box optimization. Should go into the Archive class here?

I opened an issue in mlr3tuning as well; be aware of duplication.

OptimInstance should have an optional sampler slot

That would be the right place where you can put a ParamSet specific sampler that could be used for Hyperband or RandomSearch etc.

Alternatively we can have a Subclass but I don't see a big need for that here.

Prolem: The user does not directly see if the sampler actually works. i.e. some tuners like the GridSearch would just use the ParamSet directly and ignore the Sampler.

use design class from paradox?

thats a datatable of multiple points. isnt this useful in quite a few places?
also has an assert?

Add unnest function to Archive

plot for tuning instance?

It would be nice to have a quick plot function that shows the tuning curve and for 1 and 2d problems some response surface with a simple interpolation.

Would that be part of mlr3viz?

Add terminator stagnation batch

For sequential feature selection a terminator that terminates after the performance does not improve more than threshold over the last batch would be useful.

look at progressbar / pb / get_progessor in mlr3

do we allow a user to specify min / max optimization of objectives? IMHO yes

how do we define the interface for this? tags?

copy terminator tests from mlr3tuning

implement random search - just to see if stuff feels right

implemented in main dir under random_search.R
feels ok

Implement basic optimizers

FIXME: we could add some basic, simple optimizers from R here. connecting them here would enable them for many tasks in optimization, not only mlr3tuning. think then how mlr3mbo extends this system then / regiusters itself

I think this goes beyont bbotk?

current user-way to define objective feels very clumsy

i now wrote this due to the dt interface....

fn = function(dt) {
  y = map_dbl(seq_row(dt), function(i) {
    x = dt[i,]
    sum(x^2)
  })
  data.table(y = y)
}

rewrite copy pasta terminators from mlr3tuning to new API

Should the terminator return a value between 0 and 1 instead of TRUE, FALSE?

There are a few interesting approaches of optimizers that change their behavior depending on the progress of the optimization. We could easily support those by having all terminators returning values between 0 and 1, whereas values >= 1 mean that the optimization should terminate.

annoying "redundancies" with mlr3: future and encapsulation

not sure how to handle this:

the package should allow for the following:
if multiple points are evaluated, this should be parallelized (by future) and encapsulated (by callr).
now it seems reasonable to copy over / do something similar as in mlr3.
NB: I have no problem with copy-pasting that code, thats not the issue here!

If I do that, and bbotk is used in mlr3tuning, we now have these features twice. That seems confusing to the user?
Example: I could now switch on the parallel-option for bbotk, but I could also switch it on for mlr3.
The same for encapsulation.

What would be the best way out here? @mllg