Comments (3)
Can you set both hq_token_only = True and multiask_ Output= False? And can you show your training loss log? did you train with the box prompt as input?
from sam-hq.
This is the training loss as follows:
At the beginning
Finished epoch: 0
Averaged stats: training_loss: 0.4607 (0.6271) loss_dice: 0.3408 (0.4383) loss_mask: 0.1234 (0.1887
Validating...
valid_dataloader len: 59
[ 0/59] eta: 0:01:52 val_boundary_iou_0: 0.3348 (0.3348) val_iou_0: 0.4443 (0.4443) time: 1.899 max mem: 10998
[58/59] eta: 0:00:00 val_boundary_iou_0: 0.3429 (0.3641) val_iou_0: 0.4028 (0.4534) time: 0.702 max mem: 10998
Total time: 0:00:42 (0.7280 s / it)
Averaged stats: val_boundary_iou_0: 0.3429 (0.3641) val_iou_0: 0.4028 (0.4534)
valid_dataloader len: 35
[ 0/35] eta: 0:00:16 val_boundary_iou_1: 0.5491 (0.5491) val_iou_1: 0.6967 (0.6967) time: 0.472 max mem: 10998
[34/35] eta: 0:00:00 val_boundary_iou_1: 0.5697 (0.5639) val_iou_1: 0.7347 (0.7131) time: 0.160 max mem: 10998
Total time: 0:00:06 (0.1767 s / it)
Averaged stats: val_boundary_iou_1: 0.5697 (0.5639) val_iou_1: 0.7347 (0.7131)
valid_dataloader len: 36
[ 0/36] eta: 0:01:05 val_boundary_iou_2: 0.5984 (0.5984) val_iou_2: 0.7327 (0.7327) time: 1.813 max mem: 10998
[35/36] eta: 0:00:00 val_boundary_iou_2: 0.5396 (0.5335) val_iou_2: 0.6921 (0.6936) time: 0.742 max mem: 10998
Total time: 0:00:33 (0.9207 s / it)
Averaged stats: val_boundary_iou_2: 0.5396 (0.5335) val_iou_2: 0.6921 (0.6936)
valid_dataloader len: 63
[ 0/63] eta: 0:00:54 val_boundary_iou_3: 0.2103 (0.2103) val_iou_3: 0.3468 (0.3468) time: 0.864 max mem: 10998
[62/63] eta: 0:00:00 val_boundary_iou_3: 0.2943 (0.2850) val_iou_3: 0.4127 (0.4070) time: 0.302 max mem: 10998
Total time: 0:00:19 (0.3136 s / it)
Averaged stats: val_boundary_iou_3: 0.2943 (0.2850) val_iou_3: 0.4127 (0.4070)
come here save at /home/quchunguang/datasets/sam-hq/output1/epoch_0.pth
At the end
epoch: 99 learning rate: 1.0000000000000006e-12
[ 0/50] eta: 0:01:27 training_loss: 0.2335 (0.2335) loss_dice: 0.1677 (0.1677) loss_mask: 0.065: 1.7565 data: 0.9814 max mem: 11005
[49/50] eta: 0:00:00 training_loss: 0.2519 (0.2657) loss_dice: 0.1849 (0.1967) loss_mask: 0.067: 0.7031 data: 0.0039 max mem: 11005
Total time: 0:00:36 (0.7278 s / it)
Finished epoch: 99
Averaged stats: training_loss: 0.2519 (0.2657) loss_dice: 0.1849 (0.1967) loss_mask: 0.0675 (0.0690
Validating...
valid_dataloader len: 59
[ 0/59] eta: 0:01:51 val_boundary_iou_0: 0.3998 (0.3998) val_iou_0: 0.4573 (0.4573) time: 1.888 max mem: 11005
[58/59] eta: 0:00:00 val_boundary_iou_0: 0.2846 (0.3288) val_iou_0: 0.3507 (0.3955) time: 0.681 max mem: 11005
Total time: 0:00:42 (0.7137 s / it)
Averaged stats: val_boundary_iou_0: 0.2846 (0.3288) val_iou_0: 0.3507 (0.3955)
valid_dataloader len: 35
[ 0/35] eta: 0:00:15 val_boundary_iou_1: 0.4165 (0.4165) val_iou_1: 0.4103 (0.4103) time: 0.455 max mem: 11005
[34/35] eta: 0:00:00 val_boundary_iou_1: 0.4708 (0.4933) val_iou_1: 0.5607 (0.5773) time: 0.158 max mem: 11005
Total time: 0:00:06 (0.1734 s / it)
Averaged stats: val_boundary_iou_1: 0.4708 (0.4933) val_iou_1: 0.5607 (0.5773)
valid_dataloader len: 36
[ 0/36] eta: 0:00:59 val_boundary_iou_2: 0.5588 (0.5588) val_iou_2: 0.6405 (0.6405) time: 1.651 max mem: 11005
[35/36] eta: 0:00:00 val_boundary_iou_2: 0.5378 (0.5127) val_iou_2: 0.6370 (0.6172) time: 0.755 max mem: 11005
Total time: 0:00:32 (0.8922 s / it)
Averaged stats: val_boundary_iou_2: 0.5378 (0.5127) val_iou_2: 0.6370 (0.6172)
valid_dataloader len: 63
[ 0/63] eta: 0:00:52 val_boundary_iou_3: 0.2597 (0.2597) val_iou_3: 0.4398 (0.4398) time: 0.835 max mem: 11005
[62/63] eta: 0:00:00 val_boundary_iou_3: 0.3006 (0.3051) val_iou_3: 0.3893 (0.4143) time: 0.301 max mem: 11005
Total time: 0:00:19 (0.3076 s / it)
Averaged stats: val_boundary_iou_3: 0.3006 (0.3051) val_iou_3: 0.3893 (0.4143)
come here save at /home/quchunguang/datasets/sam-hq/output1/epoch_99.pth
The training loss ranges from 0.46 to 0.23, and the network converges; Or increase epoch to obtain smaller training loss?
My training input did not use box prompt. Just use box prompt for coarse target localization during local testing. According to your prompt, the segmentation effect after modifying the parameters is as follows, but the effect is not very ideal:
from sam-hq.
Then what's the used prompt during your network training? How many training images are there?
from sam-hq.
Related Issues (20)
- Data Loader throwing FileNotFound error After few epochs of training HOT 1
- About custom dataset. HOT 2
- I wanted to train the model with my images HOT 3
- How do I run the training script if I only have one graphics card? HOT 6
- Can the weight format generated by your training code be directly used in GroundedSAM HOT 2
- Hello author, how do you measure loss_mask and loss dice, do you have any references? HOT 2
- When I use ''multimask_output=True'', only one mask is generated HOT 1
- Killing Subprocess during training HOT 2
- Why does my implementation of sam-hq light takes longer time? HOT 1
- When using my own data for training, the first epoch reached an IOU value of over 0.9. May I ask if there is a problem? The data is 5000 HOT 1
- Where is the training script for Light HQ-SAM? HOT 1
- DecompressionBombWarning HOT 1
- How to export the tensorrt model? HOT 1
- Performing batch image detection using a for loop can lead to memory leakage. HOT 2
- Error when trying to load embedding image in browser using onnx HOT 3
- How to generate embedding file using SAMHQ? HOT 2
- how to generate a sliding comparison gif file in the Visual comparison part HOT 2
- The fine-tuned HQ-SAM model shows a significant improvement in accuracy. However, the val_iou_0 accuracy of HQ-SAM during training is very low. HOT 2
- how to train the model from previous saved checkpoints HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sam-hq.