The dgt from microsoft

Dense Gradient Tree

This repository houses the supporting code for the paper Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent.

The Dense Gradient Tree(DGT) technique supports learning decision trees of a given height for (a) multi-class classification, (b) regression settings, with both (a) standard supervised, and (b) bandit feedback. In the bandit feedback setting, the true loss function is unknown to the learning algorithm; the learner can only query the loss for a given prediction. The goal then is to learn decision trees in an online manner, where at each round the learner maintains a tree model, makes prediction for the presented features, receives a loss, and updates the tree model.

Setup

Install necessary packages

Create a new conda environment named dgt_env with python==3.6.8, pytorch==1.7.0 and install all dependencies inside:

$ conda env create -f dgt_env.yml
$ conda activate dgt_env

Change working directory to src:

$ cd src

Run the algorithm

To reproduce some of our results, please run bash run.sh.

The script by default runs our algorithm with height 6 on ailerons. Commands for abalone, satimage, and pendigits are commented out.
To change height of the tree learnt, change the argument corresponding to --height flag.
The --proc_per_gpu option denotes how many processes to run per GPU. It defaults to 4 which is ideal for a typical GPU but on a GPU with small memory, reducing it from 4 might be required.
The --num_gpu option denotes how many GPUs to parallelize over (and assumes device ordinal of GPUs start with 0). It defaults to 1.

Note: For abalone dataset we report the final performance across 5 different shuffles.

Check Results

Final scores, i.e. mean test RMSE/Accuracy and standard deviation, can be found in the file ./out/exp@{dataset}_{height}@{start_time}/meanstd-exps/meanstd-run-summary.csv under the columns test_acc_mean and test_acc_std.

Code Contributors

Ajaykrishna Karthikeyan
Naman Jain

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

microsoft / dgt Goto Github PK

dgt's Introduction

Dense Gradient Tree

Setup

Code Contributors

Contributing

dgt's People

Contributors

Stargazers

Watchers

Forkers

dgt's Issues

DataLoader for Forestcover and Census1990

CTSlice splits for regression dataset

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent