DefTruth/CUDA-Learn-Notes Events in 2024 - Ecosyste.ms: Timeline

Wenzha0Wu starred DefTruth/CUDA-Learn-Notes

October 17, 2024 2:33am

DefTruth published a release on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:24am

v2.4.12 SGEMM TF32 Block Swizzle

## What's Changed * [SGEMM] SGEMM TF32 Thread Block Swizzle by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/84 * [HGEMM] mma4x4_warp4x4_stages with swizzle by @DefTruth in https...

DefTruth created a tag on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:24am

v2.4.12 - 🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.

DefTruth created a branch on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

opt-hgemm-mma - 🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.

DefTruth deleted a branch DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

opt-hgemm-mma

DefTruth pushed 1 commit to main DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

[SGEMM] Update SGEMM TF32 Benchmark (#87) * Update README.md * Update hgemm_wmma_stage.cu * Update README.md ... 8c6922b

Ecosyste.ms: Timeline

DefTruth/CUDA-Learn-Notes

Wenzha0Wu starred DefTruth/CUDA-Learn-Notes

October 17, 2024 2:33am

DefTruth published a release on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:24am

v2.4.12 SGEMM TF32 Block Swizzle

DefTruth created a tag on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:24am

DefTruth created a branch on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

DefTruth deleted a branch DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

DefTruth pushed 1 commit to main DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

DefTruth closed a pull request on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

[SGEMM] Update SGEMM TF32 Benchmark

DefTruth opened a pull request on DefTruth/CUDA-Learn-Notes

October 17, 2024 2:07am

[SGEMM] Update SGEMM TF32 Benchmark

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 17, 2024 2:05am

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 17, 2024 2:04am

Dylan-cx starred DefTruth/CUDA-Learn-Notes

October 17, 2024 2:03am

charlotteLive starred DefTruth/CUDA-Learn-Notes

October 17, 2024 1:48am

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 17, 2024 1:46am

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 17, 2024 1:43am

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 17, 2024 1:37am

DefTruth created a branch on DefTruth/CUDA-Learn-Notes

October 17, 2024 1:25am

DefTruth deleted a branch DefTruth/CUDA-Learn-Notes

October 17, 2024 1:25am

DefTruth pushed 1 commit to main DefTruth/CUDA-Learn-Notes

October 17, 2024 1:23am

DefTruth closed a pull request on DefTruth/CUDA-Learn-Notes

October 17, 2024 1:23am

[SWISH] support Swish F32/F16 kernel

DefTruth created a review on a pull request on DefTruth/CUDA-Learn-Notes

October 17, 2024 1:22am

ywq880611 starred DefTruth/CUDA-Learn-Notes

October 16, 2024 1:41pm

DefTruth deleted a branch DefTruth/CUDA-Learn-Notes

October 16, 2024 12:51pm

DefTruth pushed 1 commit to main DefTruth/CUDA-Learn-Notes

October 16, 2024 12:49pm

DefTruth closed a pull request on DefTruth/CUDA-Learn-Notes

October 16, 2024 12:49pm

[HGEMM] mma4x4_warp4x4_stages with swizzle

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:38pm

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:37pm

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:28pm

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:25pm

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:19pm

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

October 16, 2024 12:16pm