Comments (4)
I did a few preliminary comparison myself, using double, complex transform, FFT sizes 2^9, 2^18, 2^21, and in all cases clFFT was much faster.
I think improving performance, to achieve at least parity with clFFT is paramount.
from rocfft.
Thanks for your comment, yes we are working to get there!
from rocfft.
Three years later, did rocFFT reach performance parity with clFFT?
from rocfft.
Yes, for some of the cases! And faster for some of non-power-of-2 cases.
from rocfft.
Related Issues (20)
- cannot build rocFFT on CUDA HOT 7
- rocfft_aot_helper rocfft_kernel_cache.db : Assertion `thread != NULL && "cannot lock() from (null)"' failed HOT 4
- Plan creation kills performance HOT 4
- Selecting gpu to run rocfft HOT 4
- Test failed on Radeon VII: Assertion `childNodes.size() >= 3 && childNodes.size() < 6' failed HOT 2
- Segfault while compiling rocFFT HOT 7
- ROCFFT_RTC_CACHE_PATH Default Location for HPC HOT 1
- segfault during build of rocFFT on Fedora HOT 15
- rocfft_aot_helper input checking HOT 5
- Multi-gpu multi-node FFT HOT 3
- rocFFT Test Suite Fails HOT 42
- Failure in random_real_3d/random_params.vs_fftw on ROCm 5.5 HOT 1
- Path for sphinx/requirements.txt in README is missing a fullstop
- rocFFT version for rocm 6.0.0, 5.7.1 and 5.7.0 HOT 1
- Compiler does not gracefully handle multi-core builds
- [Bug]: incorrect results in 3d complex-complex inplace FFT HOT 6
- Performance ISSUE: Slow performance of rocfft comaped to cufft in MI200 series accelerators vs A100 GPUs HOT 2
- [Issue]: Unable to build from source rocFFT 1.0.25 for ROCm 6.0.2 HOT 1
- [Issue]: Reuse of rocfft_plan_description does not reinitialize stride vectors
- [Issue]: Incorrect result on transposing forward DFT
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rocfft.