YangWang92 Events in 2024 - Ecosyste.ms: Timeline

YangWang92 created a comment on an issue on microsoft/VPTQ

December 1, 2024 9:37am

https://github.com/microsoft/VPTQ/tree/main/csrc Thanks!

View on GitHub

YangWang92 starred tenstorrent/tt-firmware

December 1, 2024 4:41am

YangWang92 starred MartinVerges/genset-control

December 1, 2024 2:30am

YangWang92 starred samuelsadok/dji_protocol

December 1, 2024 2:30am

YangWang92 created a comment on an issue on THUDM/GLM-Edge

November 30, 2024 2:09pm

> 这部分由于我无法拿到对应的来自高通的开源工具包，因此，我们也没办法将对应属于高通的工具包进行开源 It seems that running models on QNN doesn't require the "open-source" Qualcomm toolkit, right?

View on GitHub

YangWang92 opened an issue on THUDM/GLM-Edge

November 30, 2024 2:06pm

Reuqest: QNN Qualcomm inference example

### Feature request / 功能建议 Hi all, the README mentions testing on the Qualcomm 8 Elite (Gen4) platform with all models running on the NPU. Is there an early demo available for testing? Which p...

YangWang92 starred THUDM/GLM-Edge

November 30, 2024 2:01pm

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:59pm

I plan to write a blog later to explain the algorithm details in depth. The algorithm in the paper is written quite concisely, so feel free to ask more questions!

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:58pm

Indeed, our code has a somewhat research-oriented style, haha, so some parts might not be very clear. Let me quickly explain: 1. **`quant_data`**: This is used to compute the proxy error for eval...

View on GitHub

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:58pm

tow stage code

1.In the two-stage quantization, two similar functions, the init_centroids_indices&& init_res_centroids_indices function and the quantize_vector &&quantize_residual_vector function, are defined res...

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:48pm

Additionally, I speculate that a 70B model at 2-bit might achieve stronger performance on certain benchmarks. Although I can’t prove this yet, I plan to conduct a thorough analysis on this in the f...

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:45pm

Yes, I really appreciate your question—it’s thought-provoking, and I’m seriously reflecting on it while trying to pursue rigorous research. If you're interested, we could collaborate directly. You ...

View on GitHub

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:45pm

Question about the design motivation behind VPTQ

Thank you for your insightful and thought-provoking work. I have a question regarding the motivation behind low-bit quantization and its potential as a solution for enabling extremely low-bit quant...

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:41pm

Yes, in the paper, we included layer-wise fine-tuning. However, we recently found that running end-to-end fine-tuning performs better than layer-wise fine-tuning. I removed the code for layer-wise ...

View on GitHub

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:41pm

where are the layer-wise fine-tune codes in algorithm branch?

In section 4, you suggest to perform layer-wise fine-tune after perform vptq stage, but I cannot find any code in algorithm branch.

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:39pm

The llama 3-8b models in the paper have been fine-tuned, especially around ~2-bit, where even just a few hundred iterations of end-to-end fine-tuning significantly improved model accuracy. We plan ...

View on GitHub

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:39pm

Question about result reproduction

![Image](https://github.com/user-attachments/assets/68a478b6-e9c4-433d-89ea-3a08c8cbefdd)  I tried this config to quantize the model but didn't get as good res...

YangWang92 starred RfidResearchGroup/proxmark3

November 30, 2024 11:30am

YangWang92 starred huggingface/autotrain-advanced

November 30, 2024 11:04am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 8:00am

> Thank you for your quick response. I set `--vector_lens -1 12` because, in line 226 of `./vptq/quantizer.py`, it notes: > > if num_centroids == -1: # Do not quantize, keep original data > I ass...

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 7:28am

This configuration looks a bit odd. When we set `--npercent 1`, it extracts 1% of the outliers to build a separate lookup table. However, with `--vector_lens -1 12`, the vector length of the outlie...

View on GitHub

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 7:04am

> > Hi [@ShawnzzWu](https://github.com/ShawnzzWu) > > Would you mind sharing your quantized model so I can debug into it? > > Sorry, for information security reasons, I'm not allowed to share my f...

View on GitHub

YangWang92 starred NetX-lab/Yala

November 29, 2024 2:30am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 1:11am

> I've been trying to quantize and run the Meta-Llama-3.1-8B-Instruct-2.3bit model with group number set to 4, and successfully run the model when k1(centroids) is 4096 as in the paper. However, an...

View on GitHub

Ecosyste.ms: Timeline

YangWang92

YangWang92 created a comment on an issue on microsoft/VPTQ

December 1, 2024 9:37am

YangWang92 starred tenstorrent/tt-firmware

December 1, 2024 4:41am

YangWang92 starred MartinVerges/genset-control

December 1, 2024 2:30am

YangWang92 starred samuelsadok/dji_protocol

December 1, 2024 2:30am

YangWang92 created a comment on an issue on THUDM/GLM-Edge

November 30, 2024 2:09pm

YangWang92 opened an issue on THUDM/GLM-Edge

November 30, 2024 2:06pm

Reuqest: QNN Qualcomm inference example

YangWang92 starred THUDM/GLM-Edge

November 30, 2024 2:01pm

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:59pm

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:58pm

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:58pm

tow stage code

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:48pm

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:45pm

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:45pm

Question about the design motivation behind VPTQ

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:41pm

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:41pm

where are the layer-wise fine-tune codes in algorithm branch?

YangWang92 created a comment on an issue on microsoft/VPTQ

November 30, 2024 1:39pm

YangWang92 closed an issue on microsoft/VPTQ

November 30, 2024 1:39pm

Question about result reproduction

YangWang92 starred RfidResearchGroup/proxmark3

November 30, 2024 11:30am

YangWang92 starred huggingface/autotrain-advanced

November 30, 2024 11:04am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 8:00am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 7:28am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 7:04am

YangWang92 starred NetX-lab/Yala

November 29, 2024 2:30am

YangWang92 created a comment on an issue on microsoft/VPTQ

November 29, 2024 1:11am

YangWang92 starred HorizonRobotics/alf

November 28, 2024 3:11pm

YangWang92 starred int8/monte-carlo-tree-search

November 28, 2024 3:10pm

YangWang92 starred alexforencich/verilog-dsp

November 28, 2024 11:20am

YangWang92 starred mlabonne/llm-datasets

November 28, 2024 10:15am

YangWang92 starred showlab/ShowUI

November 28, 2024 9:18am

YangWang92 starred python-trio/trio

November 28, 2024 6:31am