YangWang92 Events in 2024 - Ecosyste.ms: Timeline

YangWang92 pushed 1 commit to main microsoft/VPTQ

December 7, 2024 3:16pm

Update vptq_example.ipynb (#138) fix vptq install in ipynb 171af83

View on GitHub

YangWang92 closed a pull request on microsoft/VPTQ

December 7, 2024 3:16pm

YangWang92 opened a pull request on microsoft/VPTQ

December 7, 2024 3:16pm

YangWang92 created a branch on microsoft/VPTQ

December 7, 2024 3:15pm

YangWang92-patch-1 - VPTQ, A Flexible and Extreme low-bit quantization algorithm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 7, 2024 3:13pm

As we have updated quant_config in config.json, and you may need to update `config.json` file or VPTQ installation. Thanks for point this out, and let me update ipynb. ```bash pip install vptq -U ...

View on GitHub

YangWang92 pushed 1 commit to m300 VPTQ/hessian_collector

December 7, 2024 12:54pm

fix bug 001b476

View on GitHub

YangWang92 pushed 2 commits to m300 VPTQ/hessian_collector

December 7, 2024 12:18pm

fix llm hessian 9434824
add cli data 3bf1599

View on GitHub

YangWang92 forked HanGuo97/flute

December 6, 2024 12:41pm

YangWang92/flute

YangWang92 starred lzd19981105/quip-sharp-qwenvl

December 6, 2024 5:44am

YangWang92 starred AXERA-TECH/ax650n_bsp_sdk

December 5, 2024 7:00am

YangWang92 starred WangXuan95/FPGA-FixedPoint

December 4, 2024 2:26pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 1:46pm

Hi @Duncan1115, I just noticed that you are the author of LLM-CODEBOOK. I’d like to ask if you are interested in continuing to improve VPTQ? Thank you!

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 1:33pm

I have some early results on FP8, and I’ll share them here this week~ Thanks for following~

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 6:18am

Yes, I encountered the same issue at the time. It seemed to be related to the data type handling in NCCL. My workaround was to save the unpacked model as a .pt file and use approaches like viewing ...

View on GitHub

YangWang92 starred linux-msm/dsp-binaries

December 4, 2024 3:13am

YangWang92 starred quic/fastrpc

December 4, 2024 3:10am

YangWang92 starred abdelfattah-lab/attamba

December 3, 2024 2:37pm

YangWang92 starred Tencent/HunyuanVideo

December 3, 2024 12:45pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 3, 2024 9:19am

> Firstly, thank you for your great effort to make this project. > > When will the fine-tuning code be released? If it's delayed, could you please let me know the learning rate used in the end-to-...

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

December 3, 2024 9:17am

> Firstly, thank you for your great effort to make this project. > > When will the fine-tuning code be released? If it's delayed, could you please let me know the learning rate used in the end-to-...

View on GitHub

YangWang92 forked pytorch/ao

December 3, 2024 8:58am

YangWang92/ao

YangWang92 starred ADaM-BJTU/O1-CODER

December 3, 2024 7:58am

YangWang92 starred aialt/awesome-mobile-agents

December 2, 2024 2:47pm

YangWang92 starred PallavAg/claude-computer-use-macos

December 2, 2024 5:43am

YangWang92 starred PrimeIntellect-ai/prime

December 2, 2024 2:20am

YangWang92 pushed 12 commits to main VPTQ/compressed-tensors

December 1, 2024 4:00pm

Observer Restructure: Remove Observers, `calibration`, and applying `frozen` steps from lifecycle (#189) * temporary... 2b79056
Clean up observer defaulting logic, better error message (#200) 37df2dd
apply style and quality (#201) a43dad2
fix group quant bug (#203) db6ccb2
bump version (#204) Signed-off-by: Dipika <[email protected]> ff121cc
skip accelerate tests (#208) 9010372
remove QuantizationScheme.default_scheme (#202) Signed-off-by: Kyle Sayers <[email protected]> 7103a27
Allow ModelCompressor.from_pretrained to load from quantization_config, not compression config (#207) a26c03a
Quantization Scheme Validation (#209) * add model validator to quantization scheme Signed-off-by: Kyle Sayers <ky... c6197ce
Fix uninitialized variable in quantized compressors (#205) Both compressors have a can_quantize() check, which if ev... 525ef3a
Implement aliasable mixin and alias activation ordering (#213) * implement aliasable mixin and alias activation orde... 724d5ce
Revert "Implement aliasable mixin and alias activation ordering (#213)" (#217) This reverts commit 724d5cedc53097107... 8571339

View on GitHub

YangWang92 pushed 31 commits to main VPTQ/llm-compressor

December 1, 2024 3:21pm

Support Model Offloading Tied Tensors Patch (#872) * update parameter of offloaded modules Signed-off-by: Kyle Sa... c62f2e3
add advice about dealing with non-invertable hessians (#875) Signed-off-by: Kyle Sayers <[email protected]> a268a25
seed commit workflow (#877) * seed commit workflow Signed-off-by: andy-neuma <[email protected]> * tickle ... 08125e2
[Observer Restructure]: Add Observers; Add `calibration` and `frozen` steps to `QuantizationModifier` (#837) * updat... 18e9a9f
Bugfix get observer from name (#883) Signed-off-by: Rahul Tuli <[email protected]> 60c766f
BugFix: Fix Sparsity Reload Testing (#882) * fix * fix remaining test cases * add comments * fix d7f09c1
Use custom unique test names for e2e tests (#892) * Include `testconfig_path` in parsed config data Signed-off-by... 10facf2
Revert "Use custom unique test names for e2e tests (#892)" (#893) This reverts commit 10facf2633e58778e82d5d53bd661d... 1c0af10
Move config["testconfig_path"] assignment (#895) * Use custom unique test names for e2e tests (#892) * Include `t... 9c10486
cap accelerate version to avoid bug (#897) Signed-off-by: Kyle Sayers <[email protected]> f622450
Fix observing offloaded weight (#896) * load weight within onloading Signed-off-by: Kyle Sayers <kylesayrs@gmail.... f54989e
Update image in README.md (#861) Co-authored-by: Dipika Sikka <[email protected]> 067c27c
update accelerate version (#899) Signed-off-by: Kyle Sayers <[email protected]> 8bc2293
[GPTQ] Iterative Parameter Updating (#863) * Implement iterative parameter updating Signed-off-by: Kyle Sayers <k... cd1449d
Small fixes for release (#901) * fix device map * expose one gpu for finetune; update to use a better moodel and ... d918350
use smaller portion of dataset (#902) 644a500
Update example to not fail hessian inversion (#904) * update Signed-off-by: Dipika <[email protected]> * ... a173a0c
bump version (#907) Signed-off-by: Dipika <[email protected]> 93832a6
add default mappings (#906) Signed-off-by: Kyle Sayers <[email protected]> 86b4d56
[SparseAutoModelForCausalLM Deprecation] Feature change (#881) * src and tests updates * save model if output_dir... 3d60221
and 11 more ...

View on GitHub

YangWang92 closed an issue on microsoft/VPTQ

December 1, 2024 3:03pm

ops.gemm

If I have two residual codebooks and indice, how do I do quantization of the network layer with two residual codebooks?How is the ops.gemm method compatible with the input and calculation of multip...

YangWang92 closed an issue on microsoft/VPTQ

December 1, 2024 9:38am

Detailed code of the implementation of ops.gemm && ops.dequant

In vqliner.py,where can I see the detailed code of the implementationof ops.dequant(line 402) and ops.gemm(line:322) functions?Thanks and look forward to your answer.

Ecosyste.ms: Timeline

YangWang92

YangWang92 pushed 1 commit to main microsoft/VPTQ

December 7, 2024 3:16pm

YangWang92 closed a pull request on microsoft/VPTQ

December 7, 2024 3:16pm

Update vptq_example.ipynb

YangWang92 opened a pull request on microsoft/VPTQ

December 7, 2024 3:16pm

Update vptq_example.ipynb

YangWang92 created a branch on microsoft/VPTQ

December 7, 2024 3:15pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 7, 2024 3:13pm

YangWang92 pushed 1 commit to m300 VPTQ/hessian_collector

December 7, 2024 12:54pm

YangWang92 pushed 2 commits to m300 VPTQ/hessian_collector

December 7, 2024 12:18pm

YangWang92 forked HanGuo97/flute

December 6, 2024 12:41pm

YangWang92 starred lzd19981105/quip-sharp-qwenvl

December 6, 2024 5:44am

YangWang92 starred ag2ai/ag2

December 5, 2024 2:53pm

YangWang92 starred AXERA-TECH/ax650n_bsp_sdk

December 5, 2024 7:00am

YangWang92 starred WangXuan95/FPGA-FixedPoint

December 4, 2024 2:26pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 1:46pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 1:33pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 4, 2024 6:18am

YangWang92 starred linux-msm/dsp-binaries

December 4, 2024 3:13am

YangWang92 starred quic/fastrpc

December 4, 2024 3:10am

YangWang92 starred abdelfattah-lab/attamba

December 3, 2024 2:37pm

YangWang92 starred Tencent/HunyuanVideo

December 3, 2024 12:45pm

YangWang92 created a comment on an issue on microsoft/VPTQ

December 3, 2024 9:19am

YangWang92 created a comment on an issue on microsoft/VPTQ

December 3, 2024 9:17am

YangWang92 forked pytorch/ao

December 3, 2024 8:58am

YangWang92 starred ADaM-BJTU/O1-CODER

December 3, 2024 7:58am

YangWang92 starred aialt/awesome-mobile-agents

December 2, 2024 2:47pm

YangWang92 starred PallavAg/claude-computer-use-macos

December 2, 2024 5:43am

YangWang92 starred PrimeIntellect-ai/prime

December 2, 2024 2:20am

YangWang92 pushed 12 commits to main VPTQ/compressed-tensors

December 1, 2024 4:00pm

YangWang92 pushed 31 commits to main VPTQ/llm-compressor

December 1, 2024 3:21pm

YangWang92 closed an issue on microsoft/VPTQ

December 1, 2024 3:03pm

ops.gemm

YangWang92 closed an issue on microsoft/VPTQ

December 1, 2024 9:38am

Detailed code of the implementation of ops.gemm && ops.dequant