Ecosyste.ms: Timeline

Browse the timeline of events for every public repo on GitHub. Data updated hourly from GH Archive.

DefTruth/CUDA-Learn-Notes

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-hgemm-mma DefTruth/CUDA-Learn-Notes
  • Update hgemm_wmma_stage.cu 01d4710

View on GitHub

DefTruth created a review comment on a pull request on DefTruth/CUDA-Learn-Notes
typo: relu -> swish ?

View on GitHub

DefTruth created a review comment on a pull request on DefTruth/CUDA-Learn-Notes
pragma unroll和for对齐

View on GitHub

DefTruth created a review comment on a pull request on DefTruth/CUDA-Learn-Notes
代码风格,本仓库使用2空格作为缩进

View on GitHub

DefTruth created a review on a pull request on DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth created a comment on a pull request on DefTruth/CUDA-Learn-Notes
LGTM ~

View on GitHub

DefTruth deleted a branch DefTruth/CUDA-Learn-Notes

opt-sgemm-swizzle

MenglingD starred DefTruth/CUDA-Learn-Notes
DefTruth created a branch on DefTruth/CUDA-Learn-Notes

opt-hgemm-mma - 🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.

DefTruth pushed 1 commit to main DefTruth/CUDA-Learn-Notes
  • [SGEMM] SGEMM TF32 Thread Block Swizzle (#84) * Update sgemm.py * Update sgemm_wmma_tf32_stage.cu * Update sge... a10bcb4

View on GitHub

DefTruth closed a pull request on DefTruth/CUDA-Learn-Notes
[SGEMM] SGEMM TF32 Thread Block Swizzle
DefTruth opened a pull request on DefTruth/CUDA-Learn-Notes
[SGEMM] SGEMM TF32 Thread Block Swizzle
DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

RenShuhuai-Andy starred DefTruth/CUDA-Learn-Notes
DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes
  • Update hgemm_wmma_stage.cu 9c03d0f

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes
  • Update sgemm_wmma_tf32_stage.cu 9a49ac7

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes
  • Update sgemm_wmma_tf32_stage.cu cfed000

View on GitHub

DefTruth pushed 1 commit to opt-sgemm-swizzle DefTruth/CUDA-Learn-Notes

View on GitHub

Load more