Comments (4)
@Costigan Thank you for submitting this bug report. It seems to be an internal ILGPU compiler issue that should not occur and is not related to an error in your program. I have already investigated the described problem and I guess that it has been caused by a synchronization issue in the ABI
class (fixed in d149410). After fixing this issue I can load the kernel using the CudaAccelerator
and CLAccelerator
classes without any problems.
Can you test whether the latest commits solve this problem? Furthermore, rolling back to version 0.6 should circumvent this problem as well.
from ilgpu.
Some additional remarks: It looks like you are not using any Group
synchronization instructions after loading data into shared memory. Is this intended?
Note that using shared memory in combination with implicitly grouped kernels can lead to undefined and unintended behavior. We should add an error message to ILGPU that simply forbids such kernels. Please use an explicitly grouped kernel to have fine-grained control regarding the number of threads per group.
from ilgpu.
I'll close this issue for now, since the above-mentioned commit solves this problem on all test machines. Should you encounter further problems, we can always take up this topic again.
from ilgpu.
@m4rs-mt I'm sorry for not responding quickly. I ended up going with a multi-core solution, and performance was good enough. However, getting this implementation going will help me in the future. I'll get it running and add another comment confirming that. Wrt the synchronization instructions, you're right. I should have included one after loading shared memory. Thank you.
from ilgpu.
Related Issues (20)
- Debug.Assert is not take effect on opencl HOT 4
- Compilation fails if there are more than two loops,why ? HOT 28
- System.AccessViolationException when adding useless if statement in kernel HOT 8
- Intel GPUs ,float64 type is not supported on this device HOT 5
- Are vector data types supported? HOT 3
- A Tensor Library HOT 6
- VelocityDevice and MaxGridSize HOT 3
- Sample of "AlgorithmsRadixSort" failed on OpenCL device HOT 4
- `NullReferenceException` when passing empty `ArrayView`s to OpenCL kernel HOT 1
- XMath.Pow() only work on CPU HOT 3
- Better error messages when kernel program failed to run. HOT 1
- Is it possible to use a stored dataset on GPU again and again with throwing extra data to GPU, and even change the value of the established dataset? HOT 1
- Add a CPU-GPU-Shared MemoryBuffer for systems that support it HOT 2
- Iteration of value with loops on GPU slows down significantly HOT 5
- Feature request: cudaStreamWaitEvent HOT 7
- Higher precision float (decimal) support? HOT 2
- Passing Int128 as kernel parameter is not working HOT 3
- System.BadImageFormatException in System.Reflection.Metadata.dll HOT 4
- OpenCL.CLException HOT 2
- [QUESTION]: Exception in Accelerator.Synchronize on CUDA HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ilgpu.