microsoft/unilm Events in 2024 - Ecosyste.ms: Timeline

JooyoungPark73 starred microsoft/unilm

October 22, 2024 1:45pm

ZekunLinOfficial starred microsoft/unilm

October 22, 2024 12:03pm

showgood163 starred microsoft/unilm

October 22, 2024 9:44am

tianyaqu starred microsoft/unilm

October 22, 2024 8:08am

Leiyj66 starred microsoft/unilm

October 22, 2024 7:28am

shuijing1987 starred microsoft/unilm

October 22, 2024 6:25am

hanlinxuy closed an issue on microsoft/unilm

October 22, 2024 6:07am

why need nan_to_num?

https://github.com/microsoft/unilm/blob/master/Diff-Transformer/multihead_diffattn.py#L99 I found there is a line attn_weights = torch.nan_to_num(attn_weights)， which looks weird. When I try to t...

hqsiswiliam starred microsoft/unilm

October 22, 2024 5:50am

aniloymagil starred microsoft/unilm

October 22, 2024 5:49am

sftblw starred microsoft/unilm

October 22, 2024 5:33am

jiwonseo1212 starred microsoft/unilm

October 22, 2024 5:08am

CQHofsns starred microsoft/unilm

October 22, 2024 3:55am

cultivater starred microsoft/unilm

October 22, 2024 3:48am

andrewmagill starred microsoft/unilm

October 22, 2024 3:09am

YTianZHU created a comment on an issue on microsoft/unilm

October 22, 2024 2:22am

Hi, thank you for sharing the code. I think nan values in your attention weights may be related to loading pretrained weights and fine-tuning the model. In our paper, all models were trained from ...

View on GitHub

xzb777 starred microsoft/unilm

October 22, 2024 1:57am

hanlinxuy created a comment on an issue on microsoft/unilm

October 22, 2024 1:30am

Thanks for clarification! I did not use exactly the official code but my implementation. Just to see if we can inherit the weight of pretrained model. Maybe I made some bugs in my code. Training is...

View on GitHub

nima-kam starred microsoft/unilm

October 21, 2024 10:37pm

cloveremgreen opened an issue on microsoft/unilm

October 21, 2024 7:52pm

Invalid download link for pretrined VLMO large

**Describe** I'm trying to download large version of pretrained VLMO, but the link seems expired. Could you please update the download link? https://github.com/wenhui0924/vlmo_ckpts/releases/down...