Comments (2)
Splitting it in two functions could remove the boxing:
# ╔═╡ 633d1ddc-131d-465a-ac8c-c382c972dc31
function radon_tullio(I, θs, zs)
sinogram = similar(I, length(zs), length(θs))
fill!(sinogram, 0)
midpoint = size(I, 1) ÷ 2 + 1
if I isa CuArray
I_2 = linear_interpolation((1:size(I, 1), 1:size(I, 2)), I)
I_int = adapt(CuArray{Float32}, I_2)
@show "cuda"
else
I_int = linear_interpolation((1:size(I, 1), 1:size(I, 2)), I)
end
sθ = sin.(θs)
cθ = cos.(θs)
t(θs, zs, sinogram, midpoint, I_int)
return sinogram ./ maximum(sinogram)
end
# ╔═╡ 4e0f0aa8-614d-4bc3-bc93-94ba1235b1d7
function t(θs, zs, sinogram, midpoint, I_int)
sθ = sin.(θs)
cθ = cos.(θs)
@tullio sinogram[is, iθ] = @inbounds(begin
x = zs[z] * sθ[iθ] + zs[is] * cθ[iθ] + midpoint
y = -zs[z] * cθ[iθ] + zs[is] * sθ[iθ] + midpoint
I_int(y, x)
end)
end
from tullio.jl.
So it works now but performance it not better than the threaded nested for loops on the CPU.
Is an 200x200 array and ranges of 200 to small?
from tullio.jl.
Related Issues (20)
- Reporting a bug when Tullio being included with LoopVectorization HOT 1
- [Question] Is it possible to create a vector of SVectors from a Matrix using Tullio? HOT 2
- [Question] How to change summation order? HOT 5
- Use package extensions HOT 1
- How finalizers `|>` work HOT 5
- Method error when broadcast and sum of matrices HOT 1
- Upgrade to CUDA.CUDAKernels HOT 9
- Bug when using Tullio + LoopVectorization HOT 5
- Add Finch.jl backend HOT 4
- CUDA v4 support HOT 2
- Using threads, vs setting threads=false gives different result HOT 3
- Issue with vectorized functions on GPU HOT 3
- Error when specifying the range of an index with a UnitRange HOT 4
- Scalar indexing with CUDA HOT 10
- Please update dep of FillArrays to v1.
- Bad interaction with Enzyme? HOT 6
- Zygote with Tullio gives wrong gradients/pullbacks using CUDA HOT 1
- Use EllipsisNotation ? HOT 2
- Error when using Loopvectorization
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tullio.jl.