juliaai / mljtuning.jl Goto Github PK

View Code? Open in Web Editor NEW

66.0 9.0 12.0 655 KB

Hyperparameter optimization algorithms for use in the MLJ machine learning framework

License: MIT License

Julia 100.00%

machine-learning julia hyperparameter-optimization mlj grid-search random-search

mljtuning.jl's Introduction

mljtuning.jl's People

Contributors

Stargazers

Watchers

Forkers

okonsamuel ludoro lhnguyen-vn mkg33 standardgalactic davnn rikhuijzer pitmonticone mohamed82008 pebeto paradacarleton dpaetzel

mljtuning.jl's Issues

fit! on TunedModel instances fails with verbosity > 1

discovered here: JuliaAI/MLJ.jl#711

Plotting results of tuning is not working for negative measurements

The source of the issue is here

combination of acceleration and acceleration_resampling

@ablaom Ok. I have had a look at the issue #15 and #14. Here are some of my concerns concerning the combination of acceleration and acceleration_resampling parameters.
All combinations of those parameters seemed fine to me except.

acceleration = CPUProcesses() and acceleration_resampling = CPUProcesses(). In the same worker cores used in accelerating the tuning process is also the same used in accelerating the resampling process. This may result in slowdown. Depending on the case using either (acceleration = CPU1() and acceleration_resampling = CPUProcesses()) or ( acceleration = CPUProcesses() and acceleration_resampling = CPU1()) would be better.
acceleration = CPUThreads() and acceleration_resampling = CPUProcesses(). (Same issue as in 1)
My suggestion would be to not allow uses use the above two case.
Any Objections?

Incrementing `n` by 1 is adding 2 models to history

dtc = DecisionTreeClassifier()
r   = range(dtc, :max_depth, lower=1, upper=50);

tmodel = TunedModel(model=dtc, ranges=[r, ],
                tuning=Grid(resolution=50),
                measure=cross_entropy,
                n=48);
mach = machine(tmodel, (@load_iris)...)

julia> fit!(mach, verbosity=2);
[ Info: Updating Machine{ProbabilisticTunedModel{Grid,…},…} @969.
[ Info: Attempting to add 1 models to search, bringing total to 49. 
measurement: 9.611640903764574
measurement: 9.611640903764574
[ Info: Training Machine{DecisionTreeClassifier,…} @695.

@assert length(report(mach).history) == 48

tmodel.n = 49
fit!(mach, verbosity=2);

julia> length(report(mach).history)
50

Wierd thing is that if I use intial n=10 and increment to n=11, the issue is absent.

Remove unsafe use of dictionaries in multithreaded version of tuning

replace @distributed with pmap

Currently in MLJ acceleration with CPUThreads is implemented using @distributed. This effectively splits up the given range (1:nfolds or 1:nmetamodels) into equal chunks and sends them off to all workers loaded with addprocs. This is great if the each chunk runs in the same amount of time otherwise some overhead is experienced. Also the user lacks the ability to specify the actual workers to be used in computing. (This might not be a big deal)
pmap implementation allows user more control (if they wish) in how these tasks are sent to to these workers.(this is due to batch_size and AbstractWorkerPool options it exposes).
Previously the main reason for not adopting pmap was because nested pmap hangs see JuliaLang/Distributed.jl#62 (There is a workaround this stated there).
The only limitation left in adopting this is that calling pmap from within Threads.@spawn some times hangs.( Although i don't think it is practical to call pmap from threads. What is more common is calling threads from processes) see JuliaLang/Distributed.jl#69

Add CPUThreads as option for acceleration

Plots warning for scale when visualising tuning results.

I think the problem is somewhere in the generation of the tuned_model report.

Skipping parts of search space?

Hi,

There are some parts of search space where my model will fail to return a usable result, and this triggers a BoundError at the prediction step. The parts of search space where this will occur is not clearly defined (and is probabilistic), so clean! is not applicable. I am wondering how I may wrap the tuning with a try-catch so that certain errors are simply returned as an infinite loss, rather than breaking the entire tuning and exiting?

Thanks!
Miles

For a 0.3.1 release

To merge onto dev:

(enhancement) Add RandomSearch tuning strategy (JuliaAI/MLJ.jl#37 PR #30)

Machines wrapping `TunedModel` instances should never cache data

Since TunedModel is just a wrapper, caching data might create an unnecessary copy.

Here tmodel::TunedModel wraps an EvoTreeClassifier object and X is a DataFrame.

mach = machine(tmodel, X, y) |> fit!
julia> mach.data # "outer" unecessary cached data
(3×2 DataFrame
 Row │ a        b          
     │ Float64  Float64    
─────┼─────────────────────
   1 │     1.0  0.136725
   2 │     2.0  0.00546956
   3 │     3.0  0.947711, CategoricalArrays.CategoricalValue{Char, UInt32}['a', 'a', 'a'])

julia> mach.cache[end].fitresult.machine.data # atomic model specific cached data
((matrix = [1.0 0.13672511011651545; 2.0 0.005469560151032837; 3.0 0.9477113320687569], names = [:a, :b]), CategoricalArrays.CategoricalValue{Char, UInt32}['a', 'a', 'a'])

The matrix is Tables.matrix(X) and so is a copy, not a view.

Remedy Declare MLJBase.caches_data_by_default(::Type{<:TunedModel}) = false.

Allow `rows=...` specification in `learning_curve`

Tuned Model interface doesnt have class_weights

The TunedModel interface supports parameter - weights but not class_weights.
For classification problems with highly imbalanced classes, we need tuned model measures working with class weights.

Intermittent failure of CI for Julia 1.3

Sometimes CI fails for Julia 1.3 (never 1.0). Maybe this is related to #48 and will go away when that is resolved (unsafe multithreading). For the record I'm recording the problem here:

ERROR: LoadError: On worker 2:
392LoadError: LoadError: EOFError: read end of file
393read at ./iostream.jl:361
394parse_cache_header at ./loading.jl:1334
395stale_cachefile at ./loading.jl:1413
396_require_search_from_serialized at ./loading.jl:752
397_require at ./loading.jl:1001
398require at ./loading.jl:922
399require at ./loading.jl:917
400include at ./boot.jl:328 [inlined]
401include_relative at ./loading.jl:1105
402include at ./Base.jl:31 [inlined]
403include at /home/travis/build/alan-turing-institute/MLJTuning.jl/test/models.jl:7
404top-level scope at /home/travis/build/alan-turing-institute/MLJTuning.jl/test/models.jl:14
405include at ./boot.jl:328 [inlined]
406include_relative at ./loading.jl:1105
407include at ./Base.jl:31
408include at ./client.jl:424
409top-level scope at none:0
410eval at ./boot.jl:330
411#105 at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Distributed/src/process_messages.jl:290
412run_work_thunk at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Distributed/src/process_messages.jl:79
413run_work_thunk at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Distributed/src/process_messages.jl:88
414#98 at ./task.jl:333
415in expression starting at /home/travis/build/alan-turing-institute/MLJTuning.jl/test/models/DecisionTree.jl:7
416in expression starting at /home/travis/build/alan-turing-institute/MLJTuning.jl/test/models.jl:14
417Stacktrace:
418 [1] sync_end(::Array{Any,1}) at ./task.jl:300
419 [2] macro expansion at ./task.jl:319 [inlined]
420 [3] remotecall_eval(::Module, ::Array{Int64,1}, ::Expr) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Distributed/src/macros.jl:217
421 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Distributed/src/macros.jl:201
422 [5] include at ./boot.jl:328 [inlined]
423 [6] include_relative(::Module, ::String) at ./loading.jl:1105
424 [7] include(::Module, ::String) at ./Base.jl:31
425 [8] include(::String) at ./client.jl:424
426 [9] top-level scope at none:6
427

cc: @OkonSamuel

Non-thread safe use of resampling machines

Strictly speaking, as currently implemented, calling fit! on a resampling machine mach mutates the value of mach.model.resampling if resampling has a RNG as a field (all of them do, I believe). It may make sense to modify this behaviour by changing the interface point for RNGs in resampling (in MLJBase). However, in the meantime, I suggest we insert a deep copy on the right hand side of https://github.com/alan-turing-institute/MLJTuning.jl/blob/db5ab433c0570641eade702db0dca55379643dbe/src/tuned_models.jl#L447

cc @OkonSamuel

Allow `TunedModel(mymodel; kwargs....)` in addition to `TunedModel(model=mymodel; kwargs...)`

Make `RandomSearch` the default, instead of `Grid`

I don't think this would be too bad, and useful preparation for making the MLJ model interface more flexible later.

The MLJTuning API doesn't really touch on this point. A tuning strategy needs to implement a models method to generate models to evaluate, but doesn't say how the models are generated. They needn't be mutations of a single object. However, the MLJ model interface currently states that models must be mutable, so some tuning strategies do use mutation to generate their models.

TODO:

To see if the change would be breaking, update this table:

tuning strategy	assumes model types are mutable	pkg providing strategy
`Grid`	yes	MLJTuning
`RandomSearch`	yes	MLJTuning
`LatinHypercube`	yes	MLJTuning.jl
`MLJTreeParzenTuning()`	?	TreeParzen.jl
`ParticleSwarm`	?	MLJParticleSwarmOptimization.jl
`AdaptiveParticleSwarm`	?	MLJParticleSwarmOptimization.jl
`Explicit()`	no	MLJTuning.jl

cc @juliohm

For a 0.5 release

Modify API to accomodate user-specified "selection heuristic" (#75)
Bump version

Extend [compat] RecipesBase = "^0.8,^1.0"

Remove `learning_curve!` which has been deprecated

See #150.

Decouple training weights from weights for measures, in TunedModel

This brings the tuning in line with the proposal in JuliaAI/MLJBase.jl#405 .

This would be breaking for clients but does not break the API for tuning strategy implementations.

However, the corresponding change in MLJBase (an explicit dependency of this one) will only become live in a new minor release of that package (0.15). So, by lifting MLJTuning's [compat] of MLJBase to this minor version at the same time as implementing the current issue, we can safely release the change as a patch.

Add recommendation to implement built-in Cartesian ranges in new tuning strategies

Add particle swarm strateg(ies)

Particle swarm optimization (PSO) is a simple and computationally cheap optimization method by exploring the search space with a coordinated swarm. Since most available tuning strategies only generate a list of models without any optimization heuristics, PSO would be a valuable addition to the existing collection of strategies.

For our purposes of hyperparameter tuning, a good implementation should be able to support both NumericRanges and NominalRanges, or any combinations of the two. To this end categorical hyperparameters would be embedded as probability vectors to extend usual PSO variants.

A naive algorithm would expose some meta-hyperparameters to the user (e.g. inertia, cognitive and social coefficients), but an adaptive scheme such as in Optim.jl could automate this choice.

Given the population of the swarm, each agent maps to a model to be evaluated at each iteration. The total number of iterations is calculated from the set number of models n of the TunedModel. In cases where the strategy generates a surplus of models (e.g. 3 agents and a maximum number of 10 models would leave 2 models unevaluated in the last iteration), then the extra models would be saved for later warm-starts (see #131 for more detail).

Typo in error message for `TunedModel` missing arguments

julia> t = TunedModel(ConstantClassifier())
ERROR: ArgumentError: You need to specify `range=...`, unless `tuning=Explicit` and and `models=...` is specified instead. 
Stacktrace:

This should read "tuning=Explicit()" (parentheses forgotten).

Add a method to check invariants of a tuning strategy

This is needed because tuning stategy structs are mutable.

Something like a clean! method that we have already for models.

@lhnguyen-vn

Add a demonstration of ShapML.jl, perhaps to a study of Ames data set

Add plot recipe for visualising multiple hyperparameter tuning outcomes

Context: JuliaAI/MLJ.jl#416

Score measure maximization double signal flip

There is a double signal flip when optimizing for a score measure.

As I understand, lines 51-53 should be removed to fix score measure maximization in method best(heuristic::NaiveSelection, history).

https://github.com/alan-turing-institute/MLJTuning.jl/blob/3e989d5e7d677d73b4c06d087f9a10c6fadcd511/src/selection_heuristics.jl#L46-L58

The first signal flip is done at line 50 (where weights[1] is negative from method measure_adjusted_weights) and then a second flip reverts the signal at line 53.

learning_curve is not using "smart" fitting (performance issue)

I have observed that when increasing the number of trees in a random forest the algorithm does not generate evaluations at uniform intervals of time, but slows down. This strongly suggests that the model is being refit from scratch every time.

x1 = rand(100);
x2 = rand(100);
x3 = rand(100);
X = (x1=x1, x2=x2, x3=x3);
y = 2*x1 .+ 5*x2 .- 3*x3 .+ 0.2*rand(100);
atom = DecisioinTreeRegressor()
ensemble = EnsembleModel(atom=atom, n=50)
mach = machine(ensemble, X, y)
 r_n = range(ensemble, :n, lower=10, upper=10000)

Now run the next line and observe the slow down as the output is generated:

learning_curve(mach, range=r_n, verbosity=2)

For a 0.3 release

Merged onto to dev:

(breaking) Add enhancements the tuning strategy interface. This should not break use of TunedModel, and the built-in strategies Grid and Explicit are updated with no new behaviour (#21)
(breaking) Add n_remaining argument to models! method to give access to number of evaluations remaining. This should not break use of TunedModel, and the built-in strategies Grid and Explicit are updated with no new behaviour (#23 PR #26)

Add options to change criterion for what constitutes "best model"

JuliaAI/MLJ.jl#487

Frameworks for HP optimization

Julia HP optimization packages:

Hyperopt.jl @baggepinnen (Random search, Latin hypercube sampling, Bayesian opt)
TreeParzen.jl (port of Hyperopt.py to Julia) @IQVIA-ML @iqml
NaiveGAflux.jl (helps automate Flux) @DrChainsaw

Other HP optimization packages:

There are projects that benchmark different AutoML systems: https://openml.github.io/automlbenchmark/
From our conversation: JuliaAI/MLJ.jl#416 (comment)
I wanted to tell you guys about Optuna (repo & paper) a new framework for HP optimization.
A nice comparison w/ Hyperopt shows what can be done for HP visualization:
https://neptune.ai/blog/optuna-vs-hyperopt

Here are a few snips:

A 3 minute clip: https://www.youtube.com/watch?v=-UeC4MR3PHM

It would really be amazing for MLJ to incorporate this!

Checklist for 0.7.0 release

Higher dimensional ranges and nested ranges specification

One dimensional range in MLJBase, how does that fit with MLJTuning and with the generalisation where you may want to specify “spaces” for sets of parameters.

It might be interesting to see how this is done in other optimisation packages in Julia such as JUMP.

Looking beyond Julia, there MLRMBO which can handle seriously complex parameter spaces see example. MLR3 has a parameter package called paradox - nested conditions can be described as outlined here nested parameter conditions.

Re-instate progress meters for tuning

See https://github.com/alan-turing-institute/MLJBase.jl/blob/master/src/resampling.jl for how to do this with distributed or multithreading

Improve the `Explicit` strategy

This stategy (originally added for testing purposed only) is has not been publicised, but needs some improvements before doing so:

Problems:

Currently the explicit list (or iterator) of models is specified as range and all models need to have the same type - which is extremely restrictive, as the main use case is for evaluating multiple models of different types.
Currently the user has to specify some model=... , say the first in the list, which is clunky

Suggested resolution:

Add a new keyword models=... whose specification flags the strategy automatically as Explicit, and whose value is the iterator of models to be compared. Internally models is copied to range. An error is thrown if models and range are both specified and not the same, or if model and models are both specified but model isa eltyype(models) is not true. This maintains backwards compatibility.

Implementation:

To get around the design issue requiring all models to have the same type, we can apply a thin wrapper to all models in the list and give a wrapped model the same setproperty!/getproperty interface as the original. The alternative (I'm guessing, from memory) is to remove the model type parameter M from MLJBase.Resampler{M} which could be painful, but worth checking as this would be less of a hack. Any loss of performance in dropping the type parameter is likely trivial in 99% of use cases.

Add warning in documentation about unpredictability of history order when using parallelization

Add `n_remaining` argument to models! (breaking)

Some simple tuning strategies, such as RandomSearch (which I am working on) will want to
return as many models as possible in one hit, for evaluation in parallel. Unfortunately, the number of iterations set by the user in his TunedModel instance, is not currently available to the models! method.

I propose we pass a new argument n_remaining to the models! method:

MLJTuning.models!(tuning::MyTuningStrategy, model, history, state, n_remaining, verbosity)

Here n_remaining is the difference tuned_model.n - length(history).

Scores versus losses being addressed properly in TreeParzen?

That is, do we make sure we are getting the sign correct.

Add LatinHypercube tuning

Are GridSearch using the update! method?

Hi everyone,

While benchmarking some toy grid searches, I obtained odd results, and it seemed to me that performing a grid search using a TunedModel is slower than it should be.

The idea is to run a grid search over a model that implements the update method, and avoid re-fitting models from scratch for each sampled hyper-parameter set. More precisly, by arranging the grid search so it only changes 1 hyper-parameter per iteration.

Here is a sample code on a toy problem, using EnsembleModel and DecisionTree. The idea is to play with the number of estimators of the Ensemble, and find the optimal one. While a naïve approach would be to restart training form scratch for each new number of estimators, a smarter approach would be to start at the lowest number, and add 1 estimator at each iteration. The updating cost of the ensemble model is then very low (only 1 new estimator to fit) and we expect the Grid search to be much faster.

using MLJ, BenchmarkTools, MLJModels

X = MLJ.table(rand(100, 10));
y = 2X.x1 - X.x2 + 0.05*rand(100);
tree_model = @load  DecisionTreeRegressor
RNG = 90125

Solving this problem using the MLJ interface:

# Tuned Model
forest_model = EnsembleModel(atom=tree_model, rng=RNG)
r = range(forest_model, :n; values=[i for i in 4:103]);
all_rows = collect(1:100)

self_tuning_forest_model = TunedModel(model=forest_model,
                                      tuning=Grid(shuffle=false),
                                      resampling=[(all_rows, all_rows)],
                                      range=r,
                                      measure=rms);

self_tuning_forest = machine(self_tuning_forest_model, X, y);
fit!(self_tuning_forest, verbosity=1)
m1 = self_tuning_forest.report.best_history_entry.measurement[1]
n1 = self_tuning_forest.report.best_history_entry.model.n

@btime begin
    self_tuning_forest = machine(self_tuning_forest_model, X, y);
    fit!(self_tuning_forest, verbosity=0)
end

Then, I have implemented 2 manual grid searches. The first is not intelligent and will restart from scratch, the second will only mutate the n_estimator field of the EnsembleModel and update the associated machine.

# Get the ranges values for n_estimator
values = self_tuning_forest.report.plotting.parameter_values

# Dumb Grid
results = Vector{Float64}(undef, 100)
forest_model = EnsembleModel(atom=tree_model, rng=RNG)
for i in values
    forest_model.n = i
    mach = machine(forest_model, X, y)        
    fit!(mach, verbosity=0)
    rms(predict(mach, X), y)
    results[i-3] = rms(predict(mach, X), y)
end

m3, ind = findmin(results)
n3 = values[ind]

@btime begin
    for i in values        
        forest_model.n = i
        mach = machine(forest_model, X, y)        
        fit!(mach, verbosity=0)
        rms(predict(mach, X), y)
    end
end

# Smart retraining
results = Vector{Float64}(undef, 100)
forest_model = EnsembleModel(atom=tree_model, rng=RNG)
mach = machine(forest_model, X, y)
for i in values
    forest_model.n = i
    fit!(mach, verbosity=0)
    results[i-3] = rms(predict(mach, X), y)
end

m2, ind = findmin(results)
n2 = values[ind]

@btime begin
    mach = machine(forest_model, X, y)
    for i in values
        forest_model.n = i
        fit!(mach, verbosity=0)
        rms(predict(mach, X), y)
    end
end

The obtained results are the following:

Measure	Tuned Model	Dumb Grid	Smart Grid
Fitting Time	737ms	737ms	79ms
Metric (rms)	0.097	0.097	0.099
Optimal n_estimator	11	11	4

Given those results, it seems to me that the Grid Search using a TunedModel is just performing a naïve search by retraining every new model from scratch, instead of re-fitting them. We can also see that we can improve the speed of the grid search by a factor of 10 on this toy example.

I started delving into the implementation details, and found that the problem was not coming form the Grid implementation. The Grid creates a list of models to train by cloning and mutating them, but if we mutate the model field of a machine and set it to a new one, the machine should still update itself as in this example:

### Cloning model, keeping the machine
forest_model = EnsembleModel(atom=tree_model, rng=RNG)
mach = machine(forest_model, X, y)        
fit!(mach)

forest_model_2 = deepcopy(forest_model)
forest_model_2.n +=1
mach.model = forest_model_2
fit!(mach)

Then I started looking at the TunedModel code, but things are becoming much more complicated and I'm afraid I would not be able to understand it alone.

As always, thanks for the time and support you provide me.

`learning_curve` throwing nested task error

X, y = make_blobs()

model = (@load RandomForestClassifier pkg=DecisionTree)()
mach = machine(model, X, y)

r = range(model, :n_trees, lower=10, upper=70, scale=:log10)
many_curves = learning_curve(mach,
                             range=r,
                             resampling=Holdout(),
                             measure=cross_entropy,
                             rng_name=:rng,
                             rngs=1)

Evaluating Learning curve with 1 rngs:   0%[>                 ]  ETA: N/A┌ Error: Problem fi
tting the machine Machine{RandomForestClassifier,…}.                              
└ @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:533
[ Info: Running type checks... 
[ Info: Type checks okay. 
┌ Error: Problem fitting the machine Machine{Resampler{Holdout},…}. 
└ @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:533
[ Info: Running type checks... 
[ Info: Type checks okay. 
┌ Error: Problem fitting the machine Machine{ProbabilisticTunedModel{Grid,…},…}. 
└ @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:533
[ Info: Running type checks... 
[ Info: Type checks okay. 
ERROR: TaskFailedException
Stacktrace:
  [1] wait
    @ ./task.jl:322 [inlined]
  [2] threading_run(func::Function)
    @ Base.Threads ./threadingconstructs.jl:34
  [3] macro expansion
    @ ./threadingconstructs.jl:93 [inlined]
  [4] build_forest(labels::Vector{UInt32}, features::Matrix{Float64}, n_subfeatures::Int64, 
n_trees::Int64, partial_sampling::Float64, max_depth::Int64, min_samples_leaf::Int64, min_samples_split::Int64, min_purity_increase::Float64; rng::Random.MersenneTwister)        
    @ DecisionTree ~/.julia/packages/DecisionTree/iWCbW/src/classification/main.jl:223
  [5] fit(m::MLJDecisionTreeInterface.RandomForestClassifier, verbosity::Int64, X::DataFrames.DataFrame, y::CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}})                           
    @ MLJDecisionTreeInterface ~/.julia/packages/MLJDecisionTreeInterface/RZmUr/src/MLJDecisionTreeInterface.jl:200                                                             
  [6] fit_only!(mach::Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}; rows::
Vector{Int64}, verbosity::Int64, force::Bool)                                              
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:531
  [7] #fit!#103
    @ ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:598 [inlined]
  [8] fit_and_extract_on_fold
    @ ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:1088 [inlined]
  [9] (::MLJBase.var"#276#277"{MLJBase.var"#fit_and_extract_on_fold#299"{Vector{Tuple{Vector{Int64}, Vector{Int64}}}, Nothing, Nothing, Int64, Vector{LogLoss{Float64}}, Vector{typeof(predict)}, Bool, Bool, CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}}, DataFrames.DataFrame}, Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}, Int64, ProgressMeter.Progress})(k::Int64)
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:932
 [10] mapreduce_first
    @ ./reduce.jl:392 [inlined]
 [11] _mapreduce(f::MLJBase.var"#276#277"{MLJBase.var"#fit_and_extract_on_fold#299"{Vector{Tuple{Vector{Int64}, Vector{Int64}}}, Nothing, Nothing, Int64, Vector{LogLoss{Float64}}, Vector{typeof(predict)}, Bool, Bool, CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}}, DataFrames.DataFrame}, Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}, Int64, ProgressMeter.Progress}, op::typeof(vcat), #unused#::IndexLinear, 
A::UnitRange{Int64})                                                                       
    @ Base ./reduce.jl:403
 [12] _mapreduce_dim
    @ ./reducedim.jl:318 [inlined]
 [13] #mapreduce#672
    @ ./reducedim.jl:310 [inlined]
 [14] mapreduce
    @ ./reducedim.jl:310 [inlined]
 [15] _evaluate!(func::MLJBase.var"#fit_and_extract_on_fold#299"{Vector{Tuple{Vector{Int64}, Vector{Int64}}}, Nothing, Nothing, Int64, Vector{LogLoss{Float64}}, Vector{typeof(predict)}, Bool, Bool, CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}}, DataFrames.DataFrame}, mach::Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}, #unused#::CPU1{Nothing}, nfolds::Int64, verbosity::Int64)
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:931
 [16] evaluate!(mach::Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}, resampling::Vector{Tuple{Vector{Int64}, Vector{Int64}}}, weights::Nothing, class_weights::Nothing, rows::Nothing, verbosity::Int64, repeats::Int64, measures::Vector{LogLoss{Float64}}, operations::Vector{typeof(predict)}, acceleration::CPU1{Nothing}, force::Bool)              
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:1126
 [17] evaluate!(::Machine{MLJDecisionTreeInterface.RandomForestClassifier, true}, ::Holdout, ::Nothing, ::Nothing, ::Nothing, ::Int64, ::Int64, ::Vector{LogLoss{Float64}}, ::Vector{typeof(predict)}, ::CPU1{Nothing}, ::Bool)                                                 
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:1193
 [18] fit(::Resampler{Holdout}, ::Int64, ::DataFrames.DataFrame, ::CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}})                           
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/resampling.jl:1337
 [19] fit_only!(mach::Machine{Resampler{Holdout}, false}; rows::Nothing, verbosity::Int64, force::Bool)                                                                                
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:531
 [20] #fit!#103
    @ ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:598 [inlined]
 [21] event!(metamodel::MLJDecisionTreeInterface.RandomForestClassifier, resampling_machine::Machine{Resampler{Holdout}, false}, verbosity::Int64, tuning::Grid, history::Nothing, state
::NamedTuple{(:models, :fields, :parameter_scales, :models_delivered), Tuple{Vector{MLJDecisionTreeInterface.RandomForestClassifier}, Vector{Symbol}, Vector{Symbol}, Bool}})
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/tuned_models.jl:395
 [22] #35
    @ ~/.julia/packages/MLJTuning/efiDR/src/tuned_models.jl:433 [inlined]
 [23] iterate
    @ ./generator.jl:47 [inlined]
 [24] _collect(c::Vector{MLJDecisionTreeInterface.RandomForestClassifier}, itr::Base.Generator{Vector{MLJDecisionTreeInterface.RandomForestClassifier}, MLJTuning.var"#35#36"{Machine{Resampler{Holdout}, false}, Int64, Grid, Nothing, NamedTuple{(:models, :fields, :parameter_scales, :models_delivered), Tuple{Vector{MLJDecisionTreeInterface.RandomForestClassifier}, Vector{Symbol}, Vector{Symbol}, Bool}}, ProgressMeter.Progress}}, #unused#::Base.EltypeUnknown, 
isz::Base.HasShape{1})                                                                     
    @ Base ./array.jl:695
 [25] collect_similar
    @ ./array.jl:606 [inlined]
 [26] map
    @ ./abstractarray.jl:2294 [inlined]
 [27] assemble_events!(metamodels::Vector{MLJDecisionTreeInterface.RandomForestClassifier}, 
resampling_machine::Machine{Resampler{Holdout}, false}, verbosity::Int64, tuning::Grid, history::Nothing, state::NamedTuple{(:models, :fields, :parameter_scales, :models_delivered), Tuple{Vector{MLJDecisionTreeInterface.RandomForestClassifier}, Vector{Symbol}, Vector{Symbol}, Bool}}, acceleration::CPU1{Nothing})
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/tuned_models.jl:432
 [28] build!(history::Nothing, n::Int64, tuning::Grid, model::MLJDecisionTreeInterface.RandomForestClassifier, model_buffer::Channel{Any}, state::NamedTuple{(:models, :fields, :parameter_scales, :models_delivered), Tuple{Vector{MLJDecisionTreeInterface.RandomForestClassifier}, Vector{Symbol}, Vector{Symbol}, Bool}}, verbosity::Int64, acceleration::CPU1{Nothing}, resampling_machine::Machine{Resampler{Holdout}, false})                                     
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/tuned_models.jl:625
 [29] fit(::MLJTuning.ProbabilisticTunedModel{Grid, MLJDecisionTreeInterface.RandomForestClassifier}, ::Int64, ::DataFrames.DataFrame, ::CategoricalVector{Int64, UInt32, Int64, CategoricalValue{Int64, UInt32}, Union{}})                           
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/tuned_models.jl:704
 [30] fit_only!(mach::Machine{MLJTuning.ProbabilisticTunedModel{Grid, MLJDecisionTreeInterface.RandomForestClassifier}, true}; rows::Nothing, verbosity::Int64, force::Bool)
    @ MLJBase ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:531
 [31] #fit!#103
    @ ~/.julia/packages/MLJBase/HZmTU/src/machines.jl:598 [inlined]
 [32] (::MLJTuning.var"#61#62"{Machine{MLJTuning.ProbabilisticTunedModel{Grid, MLJDecisionTreeInterface.RandomForestClassifier}, true}, Nothing, Symbol, Int64, ProgressMeter.Progress})
(rng::Random.MersenneTwister)                                                              
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/learning_curves.jl:231
 [33] mapreduce_first
    @ ./reduce.jl:392 [inlined]
 [34] _mapreduce(f::MLJTuning.var"#61#62"{Machine{MLJTuning.ProbabilisticTunedModel{Grid, MLJDecisionTreeInterface.RandomForestClassifier}, true}, Nothing, Symbol, Int64, ProgressMeter.Progress}, op::typeof(MLJTuning._collate), #unused#::IndexLinear, A::Vector{Random.MersenneTwister})                                                                   
    @ Base ./reduce.jl:403
 [35] _mapreduce_dim
    @ ./reducedim.jl:318 [inlined]
 [36] #mapreduce#672
    @ ./reducedim.jl:310 [inlined]
 [37] mapreduce
    @ ./reducedim.jl:310 [inlined]
 [38] _tuning_results(rngs::Vector{Random.MersenneTwister}, acceleration::CPU1{Nothing}, tuned::Machine{MLJTuning.ProbabilisticTunedModel{Grid, MLJDecisionTreeInterface.RandomForestClassifier}, true}, rows::Nothing, rng_name::Symbol, verbosity::Int64)
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/learning_curves.jl:229
 [39] learning_curve(::MLJDecisionTreeInterface.RandomForestClassifier, ::MLJBase.Source, ::
Vararg{MLJBase.Source, N} where N; resolution::Int64, resampling::Holdout, weights::Nothing, measures::Nothing, measure::LogLoss{Float64}, rows::Nothing, operation::Nothing, ranges::Nothing, range::MLJBase.NumericRange{Int64, MLJBase.Bounded, Symbol}, repeats::Int64, acceleration::CPU1{Nothing}, acceleration_grid::CPU1{Nothing}, verbosity::Int64, rngs::Int64, rng_name::Symbol, check_measure::Bool)                                                      
    @ MLJTuning ~/.julia/packages/MLJTuning/efiDR/src/learning_curves.jl:173
 [40] #learning_curve#58
    @ ~/.julia/packages/MLJTuning/efiDR/src/learning_curves.jl:92 [inlined]
 [41] top-level scope
    @ REPL[44]:1

    nested task error: AssertionError: length(ints) == 501
    Stacktrace:
      [1] mt_setfull!(r::Random.MersenneTwister, #unused#::Type{UInt64})
        @ Random /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/RNGs.jl:260
      [2] reserve1
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/RNGs.jl:291 [inlined]
      [3] mt_pop!
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/RNGs.jl:296 [inlined]
      [4] rand
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/RNGs.jl:464 [inlined]
      [5] rand
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/Random.jl:256 [inlined]
      [6] rand(rng::Random.MersenneTwister, sp::Random.SamplerRangeNDL{UInt64, Int64})
        @ Random /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/generation.jl:332
      [7] rand!
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/Random.jl:271 [inlined]                              
      [8] rand!(rng::Random.MersenneTwister, A::Vector{Int64}, X::UnitRange{Int64})
        @ Random /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/Random.jl:266
      [9] rand
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/Random.jl:279 [inlined]
     [10] rand
        @ /Users/julia/buildbot/worker/package_macos64/build/usr/share/julia/stdlib/v1.6/Random/src/Random.jl:282 [inlined]
     [11] macro expansion
        @ ~/.julia/packages/DecisionTree/iWCbW/src/classification/main.jl:224 [inlined]
     [12] (::DecisionTree.var"#62#threadsfor_fun#22"{Random.MersenneTwister, Vector{UInt32}, Matrix{Float64}, Int64, Int64, Int64, Float64, DecisionTree.var"#20#21"{Vector{Float64}}, Vector{Union{DecisionTree.Leaf{UInt32}, DecisionTree.Node{Float64, UInt32}}}, Int64, Int64, UnitRange{Int64}})(onethread::Bool)
        @ DecisionTree ./threadingconstructs.jl:81
     [13] (::DecisionTree.var"#62#threadsfor_fun#22"{Random.MersenneTwister, Vector{UInt32}, Matrix{Float64}, Int64, Int64, Int64, Float64, DecisionTree.var"#20#21"{Vector{Float64}}, Vector{Union{DecisionTree.Leaf{UInt32}, DecisionTree.Node{Float64, UInt32}}}, Int64, Int64, UnitRange{Int64}})()
        @ DecisionTree ./threadingconstructs.jl:48

(MachineLearningInJulia2020) pkg> status
      Status `~/Google Drive/Julia/MLJ/MachineLearningInJulia2020/Project.toml`
  [336ed68f] CSV v0.9.6
  [324d7699] CategoricalArrays v0.10.1
  [ed09eef8] ComputationalResources v0.3.2
  [a93c6f00] DataFrames v1.2.2
  [7806a523] DecisionTree v0.10.11
  [31c24e10] Distributions v0.25.18
  [f6006082] EvoTrees v0.8.4
  [98b081ad] Literate v2.9.3
  [add582a8] MLJ v0.16.9
  [a7f614a8] MLJBase v0.18.23
  [d354fa79] MLJClusteringInterface v0.1.4
  [094fc8d1] MLJFlux v0.2.5
  [6ee0df7b] MLJLinearModels v0.5.6
  [d491faf4] MLJModels v0.14.12
  [1b6a4a23] MLJMultivariateStatsInterface v0.2.2
  [5ae90465] MLJScikitLearnInterface v0.1.10
  [b8a86587] NearestNeighbors v0.4.9
  [a03496cd] PlotlyBase v0.8.18
  [91a5bcdd] Plots v1.22.4
  [321657f4] ScientificTypes v2.3.0
  [2913bbd2] StatsBase v0.33.10
  [bd369af6] Tables v1.6.0
  [b8865327] UnicodePlots v2.4.6
  [9a3f8284] Random

Julia 1.6.3

TagBot trigger issue

This issue is used to trigger TagBot; feel free to unsubscribe.

If you haven't already, you should update your TagBot.yml to include issue comment triggers.
Please see this post on Discourse for instructions and more details.

Error message not propagated on setting acceleration=CPUThreads

When setting acceleration=CPUThreads(), the error message, which should be defined in clean!:
https://github.com/alan-turing-institute/MLJTuning.jl/blob/c64bff7d8e68c25f63ac49820b884314046b30d3/src/tuned_models.jl#L190-L191

not shown properly

ERROR: LoadError: MethodError: Cannot `convert` an object of type ComputationalResources.CPU1{Nothing} to an object of type ComputationalResources.CPUThreads{Nothing}
Closest candidates are:
  convert(::Type{S}, ::T) where {S, T<:(Union{CategoricalString{R}, CategoricalValue{T,R} where T} where R)} at /home/darren/.julia/packages/CategoricalArrays/dmrjI/src/value.jl:103
  convert(::Type{T}, ::T) where T at essentials.jl:168
  ComputationalResources.CPUThreads{Nothing}(::Any) where T at /home/darren/.julia/packages/ComputationalResources/vtKSz/src/ComputationalResources.jl:69
Stacktrace:
 [1] setproperty!(::MLJTuning.ProbabilisticTunedModel{Grid,MyLDAPipe,ComputationalResources.CPUThreads{Nothing},ComputationalResources.CPU1{Nothing}}, ::Symbol, ::ComputationalResources.CPU1{Nothing}) at ./Base.jl:21
 [2] clean!(::MLJTuning.ProbabilisticTunedModel{Grid,MyLDAPipe,ComputationalResources.CPUThreads{Nothing},ComputationalResources.CPU1{Nothing}}) at /home/darren/.julia/packages/MLJTuning/65NXe/src/tuned_models.jl:192
 [3] #TunedModel#5(::MyLDAPipe, ::Grid, ::StratifiedCV, ::Nothing, ::MLJBase.CrossEntropy{Float64}, ::Nothing, ::Function, ::Array{Any,1}, ::Array{Any,1}, ::Bool, ::Int64, ::Nothing, ::ComputationalResources.CPUThreads{Nothing}, ::ComputationalResources.CPU1{Nothing}, ::Bool, ::typeof(TunedModel)) at /home/darren/.julia/packages/MLJTuning/65NXe/src/tuned_models.jl:170
 [4] (::MLJTuning.var"#kw##TunedModel")(::NamedTuple{(:model, :tuning, :resampling, :measure, :acceleration, :ranges),Tuple{MyLDAPipe,Grid,StratifiedCV,MLJBase.CrossEntropy{Float64},ComputationalResources.CPUThreads{Nothing},Array{Any,1}}}, ::typeof(TunedModel)) at ./none:0
 [5] top-level scope at /home/darren/project/PTSDClassifier/train_classifier_local.jl:240
 [6] include at ./boot.jl:328 [inlined]
 [7] include_relative(::Module, ::String) at ./loading.jl:1105
 [8] include(::Module, ::String) at ./Base.jl:31
 [9] include(::String) at ./client.jl:424
 [10] top-level scope at REPL[1]:1
in expression starting at /home/darren/project/PTSDClassifier/train_classifier_local.jl:230

This might be closed when we add CPUThreads as an option in this (#15 )?

Issue for triggering new releases

@JuliaRegistrator register()

Make a tuned model's number of iterations, n, available to setup method

Latin hypercube strategy needs to know the total iteration count up-front but it cannot get this information. The temporary solution in #96 is to introduce a separate n_max parameter for the Latin strategy, but there is little use in this having a different value.

So the proposal is to add n to the signature of setup:

state = setup(tuning::MyTuningStrategy, model, range, n, verbosity)

Add deprecation warning for `learning_curve!`

One can use learning_curve for both machines and models + data and neither method mutates the arguments.

juliaai / mljtuning.jl Goto Github PK

mljtuning.jl's Introduction

MLJTuning

Contents

Who is this repo for?

What is provided here?

How do I implement a new tuning strategy?

Overview

Interface points for user input

Implementation requirements for new tuning strategies

Summary of functions

The tuning strategy type

Range types

The clean! method: For validating tuning strategy

The setup method: To initialize state

The extras method: For adding user-inspectable data to the history

The models method: For generating model batches to evaluate

Including model metadata

The tuning_report method: To add to the user-inspectable report

The default_n method: For declaring the default number of iterations

The supports_heuristic trait

Sample implementations

How do I implement a new selection heuristic?

mljtuning.jl's People

Contributors

Stargazers

Watchers

Forkers

mljtuning.jl's Issues

Recommend Projects

Recommend Topics

Recommend Org

The `clean!` method: For validating tuning strategy

The `setup` method: To initialize state

The `extras` method: For adding user-inspectable data to the history

The `models` method: For generating model batches to evaluate

The `tuning_report` method: To add to the user-inspectable report

The `default_n` method: For declaring the default number of iterations

The `supports_heuristic` trait