Comments (7)
In order to get 19_large_depthwise_conv2d_torch_extension installed correctly on windows 10, you can try my steps to install it.
- Modify the
cast.h
file according to BachiLi/diffvg#26 (comment). This file can be found inC:\Users\****\AppData\Roaming\Python\Python39\site-packages\torch\include\pybind11\cast.h
where pybind is located.
Specifically change this part of the code
template <typename T1, typename T2> class type_caster<std::pair<T1, T2>> : public tuple_caster<std::pair, T1, T2> {};
to:template <typename T1, typename T2> class type_caster<std::pair<T1, T2>> { typedef std::pair<T1, T2> type; public: bool load(handle src, bool convert) { if (!isinstance<sequence>(src)) return false; const auto seq = reinterpret_borrow<sequence>(src); if (seq.size() != 2) return false; return first.load(seq[0], convert) && second.load(seq[1], convert); } static handle cast(const type &src, return_value_policy policy, handle parent) { auto o1 = reinterpret_steal<object>(make_caster<T1>::cast(src.first, policy, parent)); auto o2 = reinterpret_steal<object>(make_caster<T2>::cast(src.second, policy, parent)); if (!o1 || !o2) return handle(); tuple result(2); PyTuple_SET_ITEM(result.ptr(), 0, o1.release().ptr()); PyTuple_SET_ITEM(result.ptr(), 1, o2.release().ptr()); return result.release(); } static constexpr auto name = _("Pair"); template <typename T> using cast_op_type = type; operator type() & { return type(cast_op<T1>(first), cast_op<T2>(second)); } operator type() && { return type(cast_op<T1>(std::move(first)), cast_op<T2>(std::move(second))); } protected: make_caster<T1> first; make_caster<T2> second; };
- Modify the cuda files
cutlass\19_large_depthwise_conv2d_torch_extension\*.cu
.
19_large_depthwise_conv2d_torch_extension\backward_data_fp16.cu
19_large_depthwise_conv2d_torch_extension\backward_data_fp32.cu
19_large_depthwise_conv2d_torch_extension\backward_filter_fp16.cu
19_large_depthwise_conv2d_torch_extension\backward_filter_fp32.cu
19_large_depthwise_conv2d_torch_extension\forward_fp16.cu
19_large_depthwise_conv2d_torch_extension\forward_fp32.cu
chage
options.update({input.size(0), input.size(2), input.size(3), input.size(1)},
{weight.size(0), weight.size(2), weight.size(3), 1});
to
options.update({(int)input.size(0), (int)input.size(2), (int)input.size(3), (int)input.size(1)},
{(int)weight.size(0), (int)weight.size(2), (int)weight.size(3), 1});
- run `python setup.py install --user'.
- Finish.
Test on windows10, python 3.9, vs2022.
from replknet-pytorch.
Hi, we have not tried Windows, but it seems a cuda-version-related error. We used both cuda 10.2 and 11.2 and it worked fine.
from replknet-pytorch.
老哥,加个qq聊一下,我也是win。1183654643
from replknet-pytorch.
老哥,加个qq聊一下,我也是win。1183654643
请问你解决了吗,我win10安装报错error: command 'C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.29.30133\bin\HostX86\x64\link.exe' failed with exit status 1181
from replknet-pytorch.
老哥,加个qq聊一下,我也是win。1183654643
请问你解决了吗,我win10安装报错error: command 'C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.29.30133\bin\HostX86\x64\link.exe' failed with exit status 1181
解决了,装linux系统,训练速度提升了4倍左右,但还是比较慢
from replknet-pytorch.
老哥,加个qq聊一下,我也是win。1183654643
请问你解决了吗,我win10安装报错error: command 'C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.29.30133\bin\HostX86\x64\link.exe' failed with exit status 1181
解决了,装linux系统,训练速度提升了4倍左右,但还是比较慢
老哥,请问您如何解决的呢,您的联系方式是正确的吗,QQ搜索不到
from replknet-pytorch.
老哥,加个qq聊一下,我也是win。1183654643
请问你解决了吗,我win10安装报错error: 命令 'C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.29.30133\bin\HostX86\x64\link.exe' 失败,退出状态为 1181
解决了,装linux系统,训练速度提升了4倍左右,但还是比较慢
老哥,请问您如何解决的呢,您的联系方式是正确的吗,QQ搜索不到
qq注销了,微信 liu_yi_fei_lao_gong
from replknet-pytorch.
Related Issues (20)
- failed when compiled cutlass-master HOT 2
- CUDA out of memory when testing on cityspaces
- windoes --Python setup.py install --user is fail
- Warning from setup.py
- Reproducibility for large kernel conv
- NotImplementedError HELP PLEASE!
- depthwise_conv2d_implicit_gemm installation
- RuntimeError: Error compiling objects for extension
- About large depthwise conv2d kernel speed HOT 3
- DepthWiseConv2dImplicitGEMM has no 'padding' class attribute(actually zero) HOT 9
- Using DW 3x3 in stem block
- ModuleNotFoundError: No module named '_depthwise_conv2d_implicit_gemm_C' HOT 2
- Validation metric difference between saving state_dict() and the whole model.
- No module named 'timm.optim.novograd‘ HOT 3
- Welcome update to OpenMMLab 2.0
- Using a transformer type model to calculate ERF yields very poor results
- Questions about re-param of models
- I try to increase the -1 dim ,the big kernal becomes more and more slower than conv2d HOT 1
- Instructions about how to install 19_large_depthwise_conv2d_torch_extension
- export to onnx failing with RuntimeError: invalid unordered_map<K, T> key
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from replknet-pytorch.