Ecosyste.ms: Timeline

Browse the timeline of events for every public repo on GitHub. Data updated hourly from GH Archive.

scguang301

scguang301 opened an issue on THUDM/GLM-Edge
是否可以用 高通开源的 ai-hub-models 进行处理,这里能提供量化的encoding 文件吗
### System Info / 系統信息 无 ### Who can help? / 谁可以帮助到您? 无 ### Information / 问题信息 - [ ] The official example scripts / 官方的示例脚本 - [ ] My own modified scripts / 我自己修改的脚本和任务 ### Reproduction / 复现过程...
scguang301 created a comment on an issue on quic/ai-hub-models
I reviewed the code for qwen2_7b_instruct_quantized, but this code uses a model that has already been converted by QNN. I want to use the native Qwen2 model with SHALlamaAttention.

View on GitHub

scguang301 starred THUDM/GLM-Edge
scguang301 starred Lightricks/LTX-Video
scguang301 starred antgroup/echomimic_v2
scguang301 starred OpenBuddy/OpenBuddy
scguang301 starred microsoft/T-MAC
scguang301 opened an issue on quic/ai-hub-models
[BUG] Auto BYOM Issue: <insert issue here>
**Describe the issue** I see there is a Qwen2 7B model. I am trying to use SHALlamaAttention with the Qwen2 0.5B model, but I am getting gibberish. How do I adapt it here? **To Reproduce** ...