Comments (3)
这个值,只要不是16就会报这个错误,很奇怪
from transunet.
Hi;
I tried to test the vit_patches_size to 8 instead of 16, but got the below error (notice image size is 224):
Input In [25], in Transformer.__init__(self, config, img_size, vis) 257 def __init__(self, config, img_size, vis): 258 super(Transformer, self).__init__() --> 259 self.embeddings = Embeddings(config, img_size=img_size) 260 self.encoder = Encoder(config, vis) Input In [25], in Embeddings.__init__(self, config, img_size, in_channels) 140 patch_size = (img_size[0] // b_size // grid_size[0], img_size[1] // b_size // grid_size[1]) 141 patch_size_real = (patch_size[0] * b_size, patch_size[1] * b_size) --> 142 n_patches = (img_size[0] // patch_size_real[0]) * (img_size[1] // patch_size_real[1]) 143 self.hybrid = True 144 else: ZeroDivisionError: integer division or modulo by zero
May I ask if your image_size is 224 and patch_size is 16 can it run properly. I have the following error running on my side:
RuntimeError: Calculated padded input size per channel: (14 x 14). Kernel size: (16 x 16). Kernel size can't be greater than actual input size
I checked and found that after convolution processing in the encoder part, the feature map specification was (1024,14,14), ignoring the batch size. At this time, the size of the feature map is 14x14, which is not enough to support the ViT calculation with patch_size of 16. Is there something wrong with me?
from transunet.
这个值,只要不是16就会报这个错误,很奇怪
好兄弟,请问图像大小224,patch_size为16能正常运行吗?
from transunet.
Related Issues (20)
- Different input size (width x height)
- Reason for [-125, 275] input clipping
- Need R50+ViT-B_16 rather than R50-ViT-B_16! HOT 1
- 当我在运行TransUNet-main的train.py时出现错误:KeyError: 'Transformer/encoderblock_0/Multi5HeadDotProductAttention_1/query/kernel is not a file in the archive' 这是在我进行KeyError: 'Transformer/encoderblock_0/MultiHeadDotProductAttention_1/query\\kernel is not a file in the archive'后的更改出现的错误,csdn说这是os.path.join 合并路径的时候出现的问题,更改后仍然出现以上错误 HOT 4
- Even if we fix the seed, the results change for each training.
- asking for you help
- ACDC dataset 100 cases of data HOT 1
- Training for three-channel dataset
- Need R50+ViT-L_16 pretrained model rather than R50+ViT-L_32 HOT 1
- 导入包报错,文件夹重名导致的坑 HOT 2
- Data preprocessing HOT 2
- Training with different image size HOT 1
- Is the Synapse multi-organ segmentation dataset experimental result obtained from is the 20 samples official test set?
- change patch_size during test
- PreActBottleneck
- Training performance issues on small-sized targets
- The issue arises from the absence of the "lists_Synapse" folder.
- ACDC dataset HOT 1
- About the solution of problems like "have 3 channels, but got 1000 channels instead"
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transunet.