Ecosyste.ms: Timeline

Browse the timeline of events for every public repo on GitHub. Data updated hourly from GH Archive.

YangWang92

YangWang92 starred stanfordnlp/pyreft
YangWang92 starred aitomatic/semikong
YangWang92 starred AllAboutAI-YT/cursor_prompts
YangWang92 created a comment on an issue on microsoft/VPTQ
I think it is a bug, and can you help me to pull a request to fix it. You can be a contributor of the project. Thanks!

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ
Let me check, thanks for your feedback!

View on GitHub

YangWang92 created a comment on an issue on deepseek-ai/DeepSeek-V3
I found that they have some configurations https://github.com/deepseek-ai/DeepSeek-V3/tree/main/inference/configs, which include 16B/236B/671B.

View on GitHub

YangWang92 forked sgl-project/sglang

YangWang92/sglang

YangWang92 created a comment on a pull request on deepseek-ai/DeepSeek-V3
I've uploaded the converted bf16 model here for everyone to use freely: https://huggingface.co/collections/opensourcerelease/deepseek-v3-bf16-676d7fa1b3f500d39f8f559b

View on GitHub

YangWang92 pushed 1 commit to patch-1 YangWang92/DeepSeek-V3
  • Add CUDA cache clearing in memory management Added torch.cuda.empty_cache() to free up unused memory on the GPU, 65d8f5f

View on GitHub

YangWang92 pushed 1 commit to patch-1 YangWang92/DeepSeek-V3
  • sort filename to reduce memory costs e6e66fd

View on GitHub

YangWang92 opened a pull request on deepseek-ai/DeepSeek-V3
handle missing scale_inv_name
Fixed an issue where `weight` and `weight_scale_inv` (e.g. `model.layers.39.mlp.experts.92.gate_proj.weight` and `model.layers.39.mlp.experts.92.gate_proj.weight_scale_inv`) were not in the same Sa...
YangWang92 pushed 1 commit to patch-1 YangWang92/DeepSeek-V3
  • handle missing scale_inv_name Fixed an issue where `weight` and `weight_scale_inv` (e.g. `model.layers.39.mlp.expert... 1e3a836

View on GitHub

YangWang92 starred deepseek-ai/DeepSeek-V3
YangWang92 pushed 1 commit to master VPTQ/hessian_collector
  • add save inv hessian code 612a131

View on GitHub

YangWang92 pushed 25 commits to master VPTQ/hessian_collector

View on GitHub

YangWang92 closed a pull request on VPTQ/hessian_collector
M300
YangWang92 pushed 5 commits to m300 VPTQ/hessian_collector

View on GitHub

YangWang92 opened a pull request on VPTQ/hessian_collector
M300
YangWang92 pushed 1 commit to main microsoft/VPTQ
  • fix: a small bug fix for the initialization of the residual index tensor. (#147) * Fixed a small bug in the initiali... 170770c

View on GitHub

YangWang92 closed a pull request on microsoft/VPTQ
fix: a small bug fix for the initialization of the residual index tensor.
* Fixed a small bug in the initialization of the residual index tensor. * Modified the README to prevent a single line of code from being too long to display on a single line.
YangWang92 starred hkust-nlp/mstar
YangWang92 starred bytedance/Valley
YangWang92 pushed 1 commit to main microsoft/VPTQ
  • Update README.md (#146) add algrothm link c951bf5

View on GitHub

YangWang92 closed a pull request on microsoft/VPTQ
Update README.md
add algrothm link
YangWang92 pushed 1 commit to patch-4 YangWang92/VPTQ

View on GitHub

YangWang92 opened a pull request on microsoft/VPTQ
Update README.md
add algrothm link
YangWang92 pushed 1 commit to patch-4 YangWang92/VPTQ

View on GitHub

YangWang92 starred yt-dlp/yt-dlp
YangWang92 starred byjlw/video-analyzer
Load more