Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix template parameter
IterationsUnroll
type from int to bool
#1534
opened May 11, 2024 by
peakcrosser7
Loading…
Update half.h - typo at line 138(unnecessary space before '1')
#1527
opened May 8, 2024 by
sjbae1999
Loading…
add publication: ‘EVT: Accelerating Deep Learning Training with Epilo…
#1526
opened May 7, 2024 by
reed-lau
Loading…
feat: support kFactor 8 used in mma tensor op tile iterator
#1512
opened Apr 29, 2024 by
gavinchen430
Loading…
Fix C++17 version detection in helper_macros.hpp
#1479
opened Apr 12, 2024 by
nickjeliopoulos
Loading…
Fix device thread
gemm.h
constructor
inactive-30d
#1473
opened Apr 11, 2024 by
luliyucoordinate
Loading…
Add Faster Neighborhood Attention to PUBLICATIONS
inactive-30d
#1471
opened Apr 11, 2024 by
alihassanijr
Loading…
Add missing #include <memory> for definition of std::addressof.
#1470
opened Apr 10, 2024 by
Gregory-Meyer
Loading…
Fix B operand variable name and comments
inactive-30d
#1458
opened Apr 6, 2024 by
andylolu2
Loading…
Refactor to use FastDivmod for predicated strided dgrad iterators.
inactive-30d
#1453
opened Apr 3, 2024 by
ZelboK
Loading…
add a new epilogue for the case that the output is not packed
inactive-30d
#1437
opened Mar 28, 2024 by
hwu36
Loading…
Allow setting a custom TmaDescriptor for TMAStore.
inactive-30d
#1428
opened Mar 26, 2024 by
ipiszy
Loading…
Add support for mixed 4-bit/8-bit data types GEMM
#1413
opened Mar 19, 2024 by
alexsamardzic
Loading…
Add couple configs into generator.py for mixed input MM
inactive-30d
#1350
opened Feb 16, 2024 by
alexsamardzic
Loading…
Add support for dynamic offsets to DefaultEpilogue
inactive-30d
inactive-90d
#1274
opened Dec 19, 2023 by
ezhulenev
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.