Deion I'm trying to add trees manually to the txt file on li

[QUESTION] Adding trees manually on txt file about lightgbm HOT 1 OPEN

NDanies commented on June 4, 2024

[QUESTION] Adding trees manually on txt file

from lightgbm.

Comments (1)

jameslamb commented on June 4, 2024

Thanks for using LightGBM.

I've reformatted your post a bit to make the difference between code, output from code, and your own words a bit clearer. If you're new to GitHub and/or writing markdown, please see https://docs.github.com/en/get-started/writing-on-github/getting-started-with-writing-and-formatting-on-github/basic-writing-and-formatting-syntax#quoting-code to learn about how I did that.

I don't immediately see what's wrong with your model file but in general... don't manually merge files like this. There are just too many ways for it to go wrong (mixed line endings, accidental inclusion of special characters, failure to update all parameters correctly, etc.).

If it's acceptable to not be training both models at the same time, do this:

train the first model
save it to a text file with .save_model()
move that file over to the other machine
train another model on that machine, with new data, passing the path to the first model file to init_model

That will cause LightGBM to begin boosting from the end of the first model, using your new dataset. It's not exactly the same as the process you described (where you train two models from scratch and then combine them), but it's a good way to incorporate learnings from new data into an already-training model.

Like this pattern:

LightGBM/tests/python_package_test/test_engine.py

Lines 1050 to 1063 in 28536a0

 init_gbm = lgb.train(params, lgb_train, num_boost_round=20) 

 model_name = "model.txt" 

 init_gbm.save_model(model_name) 

 evals_result = {} 

 gbm = lgb.train( 

 params, 

 lgb_train, 

 num_boost_round=30, 

 valid_sets=lgb_eval, 

 # test custom eval metrics 

 feval=(lambda p, d: ("custom_mae", mean_absolute_error(p, d.get_label()), False)), 

 callbacks=[lgb.record_evaluation(evals_result)], 

 init_model="model.txt", 

 )

There are some more details on how to do this at https://stackoverflow.com/questions/73664093/lightgbm-train-vs-update-vs-refit/73669068#73669068

There is an API in LightGBM's C API, LGBM_BoosterMerge(), that I believe can be used to take 2 models and combine them together without training, but it's not currently exposed in the Python package.

LightGBM/src/c_api.cpp

Lines 2008 to 2009 in 28536a0

 int LGBM_BoosterMerge(BoosterHandle handle, 

 BoosterHandle other_handle) {

We could consider adding that in the Python package.

from lightgbm.

Recommend Projects

[QUESTION] Adding trees manually on txt file about lightgbm HOT 1 OPEN

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

	init_gbm = lgb.train(params, lgb_train, num_boost_round=20)
	model_name = "model.txt"
	init_gbm.save_model(model_name)
	evals_result = {}
	gbm = lgb.train(
	params,
	lgb_train,
	num_boost_round=30,
	valid_sets=lgb_eval,
	# test custom eval metrics
	feval=(lambda p, d: ("custom_mae", mean_absolute_error(p, d.get_label()), False)),
	callbacks=[lgb.record_evaluation(evals_result)],
	init_model="model.txt",
	)

	int LGBM_BoosterMerge(BoosterHandle handle,
	BoosterHandle other_handle) {