Skip to content

Pull requests: JuliaGPU/CUDA.jl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Enzyme: add make_zero of cuarrays
#2600 opened Dec 18, 2024 by wsmoses Loading…
Use GPUArrays caching allocator cuda array Stuff about CuArray. performance How fast can we go?
#2593 opened Dec 15, 2024 by pxl-th Draft
make CUDA randn work with Zygote
#2581 opened Dec 9, 2024 by bgctw Loading…
Directed rounding
#2576 opened Dec 3, 2024 by orkolorko Loading…
[CUSPARSE] Fix constructor of sparse empty matrices bugfix This gets something working again. cuda libraries Stuff about CUDA library wrappers.
#2575 opened Dec 2, 2024 by amontoison Loading…
Add function for recursively printing parameter memory cuda kernels Stuff about writing CUDA kernels. enhancement New feature or request speculative Not sure about this one yet.
#2560 opened Nov 21, 2024 by charleskawczynski Loading…
WIP: Native I/O. cuda kernels Stuff about writing CUDA kernels. speculative Not sure about this one yet.
#2485 opened Sep 5, 2024 by maleadt Draft
Bugfix for batched gemv bugfix This gets something working again. needs changes Changes are needed.
#2481 opened Aug 28, 2024 by kose-y Draft
High Level Wrapper for Fused Matmul + Bias + Activation cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
#2360 opened May 4, 2024 by avik-pal Draft
Mark more methods as device methods enhancement New feature or request
#2336 opened Apr 19, 2024 by vchuravy Draft
Use PrecompileTools to warmup CUDA.jl enhancement New feature or request needs changes Changes are needed.
#2325 opened Apr 15, 2024 by vchuravy Draft
Add a dispatch for LinearAlgebra.norm2 cuda array Stuff about CuArray. needs changes Changes are needed.
#2302 opened Mar 22, 2024 by sharanry Draft
KA 0.10 API changes
#2084 opened Sep 19, 2023 by vchuravy Draft
Use TaskLocalValues
#2075 opened Sep 8, 2023 by vchuravy Draft
Support FFT adjoint plans and test cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
#2073 opened Sep 4, 2023 by gaurav-arya Draft
Add contract through FastmathOverlays.jl cuda kernels Stuff about writing CUDA kernels. enhancement New feature or request
#2037 opened Aug 16, 2023 by vchuravy Draft
WIP: Add an index typevar to CuDeviceArray. enhancement New feature or request help wanted Extra attention is needed performance How fast can we go?
#1895 opened May 3, 2023 by maleadt Draft
Add an experimental opaque closure type. cuda kernels Stuff about writing CUDA kernels. enhancement New feature or request speculative Not sure about this one yet.
#1853 opened Apr 4, 2023 by maleadt Draft
Add wrappers for NVPERF
#1823 opened Mar 22, 2023 by vchuravy Draft
Use Atomix
#1790 opened Mar 10, 2023 by vchuravy Draft
4 tasks
Support for qr of strided inputs (non-contiguous views) cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
#1764 opened Feb 9, 2023 by evelyne-ringoot Draft
ProTip! Exclude everything labeled bug with -label:bug.