Comments (5)
-
For kfold, yes! With the help of
carefree-data
,carefree-learn
is able to performKRandom
andKFold
easily. In fact, I've provided some example codes (./tests/benchmark/test_openml.py
) to illustrate how to utilize these kinds of splits.For instance, you can perform kfold with:
import cflearn
from cfdata.tabular import *
model = "fcnn"
config = {}
x, y = TabularDataset.iris().xy
benchmark = cflearn.Benchmark(
"iris",
TaskTypes.CLASSIFICATION,
models=model,
)
results = benchmark.k_fold(
10, # num repeat
x, y,
run_tasks=True,
benchmarks={model: {"custom": config}},
)
results.comparer.log_statistics()
Which will produce:
================================================================================================================================
| metrics | acc | auc |
--------------------------------------------------------------------------------------------------------------------------------
| | mean | std | score | mean | std | score |
--------------------------------------------------------------------------------------------------------------------------------
| fcnn_iris_custom | 0.873333 | 0.105198 | 0.768135 | 0.977333 | 0.024074 | 0.953259 |
================================================================================================================================
BTW The above codes will create a folder called
__iris__
in your cwd, and you can walk around the files to see what's going on under the hood 😄
-
For time series, currently it is not supported, but will be supported in the near future in
carefree-data
😉Once
carefree-data
supports time series,carefree-learn
will follow up with some time series models ASAP 😆
from carefree-learn.
Hi @carefree0910 , Thank you for all the work and looking forward to testing the updates. I will be looking at them soon. Thanks again!
from carefree-learn.
That is fantastic news! I am looking forward to this update :)
from carefree-learn.
Hi @jmrichardson ! Since time series datasets have lots of formats, so I'm just wondering whether it is convinient for you to provide me a toy example of your dataset (e.g. 100 samples from the original dataset which are already desensitized), so I can design the APIs better, thanks in advance! 😆
from carefree-learn.
Hi @jmrichardson ! I've updated some codes and now time series datasets are supported! Hope you enjoy these examples, and if they don't meet your demand, feel free to re-open this issue at any time! 😆
from carefree-learn.
Related Issues (20)
- Introduce callbacks to `Trainer`
- Enhance `LossBase`
- Make sure that models are always in eval mode in inference
- Support specifying `resource_config` of `Parallel` in `Experiment`
- Try to accelerate `DNDF` with `Function`
- Text fields support? HOT 8
- Depend `batch_size` on dataset size
- Plateau monitoring should depend on running mean HOT 1
- Should log checkpoints earlier when using `warmup`
- Support `use_final_bn` in `FCNNHead`
- Fix `cflearn.deepspeed`
- Try to support customizing `Pipeline`.
- Newly registered models could not be used in distributed training.
- Support directly registering an `nn.Module`
- Beautify `tqdm` display in distributed training.
- Add unittest for #78
- Bugs may occur when ctrl+c at inference stage.
- nb HOT 1
- Examples won't run HOT 2
- What version is working? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from carefree-learn.