Comments (4)
Thanks for your issue. Is the order of the channel-cfgs
in step 4 the same as that of algorithm.channel_cfg
in step 3?
from mmrazor.
Thanks for your issue. Is the order of the
channel-cfgs
in step 4 the same as that ofalgorithm.channel_cfg
in step 3?
Yes. the channel-cfgs
in step 4 is the same as that of algorithm.channel_cfg
in step 3, the same file.
python -m torch.distributed.launch --nproc_per_node=8 ./tools/mmcls/train_mmcls.py
configs/pruning/autoslim/autoslim_mbv2_subnet_8xb256_in1k.py
--cfg-options algorithm.channel_cfg=/root/work/mmrazor/work_dirs/autoslim_mbv2_search_8xb1024_in1k/subnet_199761362.yaml
--launcher pytorch
By the way, for step3 if 'algorithm.channel_cfg' is 3 files which is the same as 'channel-cfgs' in step 4, that's all right.
But if 'algorithm.channel_cfg' is only 1 file which is the same as 'channel-cfgs' in step 4, the test result in step 5 is wrong.
from mmrazor.
I'm a little confused. Maybe you do not need to split checkpoint when you only retrain one subnet('algorithm.channel_cfg' is only 1 file). Only when you retrain multiple subnets together (just like autoslim), you need to split the checkpoint before you test subnets with different flops.
By the way, experiments show that replacing LabelSmoothLoss
with CrossEntropyLoss
may lead to better results. And we will fix it later.
from mmrazor.
When i only retrain one subnet, i dont need to split checkpoint, I knew it when I tried to retrain 3 subnets later.
But when the first time i used it, i did not know. you should fix the split step or include a detailed explanation.
I'll try the CrossEntropyLoss. How good?
Is autoslim suitable for mmdet.FasterRCNN or yolo? @HIT-cwh
Thank you for your great work.
from mmrazor.
Related Issues (20)
- I can't reproduce dfad results
- How to get started??
- [Bug] TypeError: 'NoneType' object is not iterable
- Try to reproduce CWD in VOC data set
- [Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail HOT 4
- [Bug] (suggested fix) `nn.Parameter` are not added to root after being traced in `mmrazor.models.task_utils.tracer.fx.custom_tracer.build_graphmodule()` HOT 2
- [Bug] (suggested fix) `mmrazor.models.algorithms.quantization.mm_architecture.MMArchitectureQuant.sync_qparams()` fails if there are modules present in other modes but not in forward `mode='tensor'` HOT 4
- I want to obtain the current epoch value and associate it with the custom distillation loss
- cannot use recorder to obtain panoptic_head info from mask2former
- [Bug] `mmrazor.engine.runner.quantization_loops.QATValLoop` calls `after_val_epoch` hook twice with different keys, causing `mmengine.hooks.checkpoint_hook._save_best_checkpoint()` to fail with `KeyError` for the `save_best` config
- [Bug] Custom Distillation MMSeg CWD loss nan problem
- When I use methodoutputs to access the results of assigner, I only obtain one sample
- Regarding tables and accuracy
- [Bug] (suggested fix) `mmrazor.models.algorithms.mm_architecture.MMArchitectureQuant.get_deploy_model()` fails if `predict` mode lacks nodes from the `model.quantizer.tracer.skipped_methods` configuration, but the architecture `quantizer.prepare(fp32_model)` has these nodes. HOT 4
- Is this a dead project ? HOT 1
- 我在用mmrazor通过yolov5-x蒸馏yolov5-s时候遇到了问题 HOT 1
- No Sign of activation quantization with QAT HOT 1
- MAP is stucked at 0 for Mobilenet V2 SSD QAT without pretrained model [Bug]
- [Docs] A100算力加持!书生大模型实战营第3期全面升级,趣味闯关模式等你开启
- Missing keys after RTMDET knowledge distillation HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mmrazor.