sam-hey Events in 2025 - Ecosyste.ms: Timeline

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 11, 2025 7:48pm

> it's a bit surprising that ModelMeta.similarity_fn_name isn't being utilized. We would love to switch to that one and would encourage a PR for this. It was only recently added to...

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 11, 2025 7:48pm

View on GitHub

sam-hey opened an issue on embeddings-benchmark/mteb

January 11, 2025 7:36pm

[v2] Getting password prompt for task and tests

Branch: v2.0.0 Steps to Reproduce: ``` import mteb from mteb import MTEB model = mteb.get_model("intfloat/multilingual-e5-small") tasks = mteb.get_tasks(tasks=["NFCorpus"], languages=["en...

sam-hey pushed 1 commit to fix_contriever sam-hey/mteb

January 11, 2025 5:43pm

ref: rename EvaluationFunction to ScoringFunction Co-authored-by: Isaac Chung <[email protected]> e4a692f

View on GitHub

sam-hey pushed 1 commit to fix_contriever sam-hey/mteb

January 11, 2025 5:37pm

ref: rename EvaluationFunction to ScoringFunction Co-authored-by: Isaac Chung <[email protected]> 6d04547

View on GitHub

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 11, 2025 5:35pm

> What do you mean by `global similarity functions`? Consolidate all similarity functions into a single **new** file or retain them in `/mteb/evaluation/evaluators/utils.py` for better organizat...

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 11, 2025 5:35pm

View on GitHub

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 9:00pm

I see your point, but I personally find it unintuitive to update the information in multiple places. In my opinion, there should be a single source of truth. Since ModelMeta already has the appropr...

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 10, 2025 9:00pm

View on GitHub

sam-hey opened a pull request on embeddings-benchmark/mteb

January 10, 2025 8:33pm

add similarity_fn in ModelMeta

## Checklist  - [ ] Run tests locally to make sure nothing is broken using `make test`. - [x] Run the formatter to format the code using `make lint`. cc ...

sam-hey created a comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 7:59pm

<img width="560" alt="image" src="https://github.com/user-attachments/assets/a96935ed-c410-43fa-bdb3-dd5ac2a7c03a" /> Please take a look at the screenshot. The MTEB `ModelMeta` is already availa...

View on GitHub

sam-hey created a comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 4:09pm

Just as an idea: I believe the `ModelMeta` object already has this information `similarity_fn_name="cosine"`. Why not add an Enum type to specify the evaluation function? This would make it much mo...

View on GitHub

sam-hey created a comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 3:44pm

@Samoed, if I’m not mistaken, the ability to use the `Dot product` seems to have been lost with this PR. Could you please confirm?

View on GitHub

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 3:19pm

You are right, I hadn't considered this case. I've added a commit to address it

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 10, 2025 3:19pm

View on GitHub

sam-hey pushed 1 commit to main sam-hey/mteb

January 10, 2025 3:18pm

fix: add similarity to SentenceTransformerWrapper d37d526

View on GitHub

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 3:08pm

Yes, this is correct, but I changed the code in `DRESModel` to handle this: [https://github.com/sam-hey/mteb/blob/main/mteb/evaluation/evaluators/RetrievalEvaluator.py#L379](https://github.com/s...

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 10, 2025 3:08pm

View on GitHub

sam-hey created a review comment on a pull request on embeddings-benchmark/mteb

January 10, 2025 2:48pm

ColBERT does not implement SentenceTransformerWrapper, so it is not required at this point. The object mentioned above is of type DRESModel, which can handle this functionality.

View on GitHub

sam-hey created a review on a pull request on embeddings-benchmark/mteb

January 10, 2025 2:48pm

View on GitHub

sam-hey pushed 0 commits to main sam-hey/mteb

January 10, 2025 2:33pm

View on GitHub

sam-hey pushed 1 commit to main sam-hey/mteb

January 10, 2025 2:33pm

Revert "fix(#1594): fixes implementation of similarity()" This reverts commit 0ebec781b3c000dc36f1dbf9dd59500d6f9ff555. 32a790f

View on GitHub

sam-hey pushed 4 commits to main sam-hey/mteb

January 10, 2025 2:30pm

fix: Fixed task_type aggregation on leaderboard (#1746) * Fixed task_type aggregation in leaderboard * Fixed an e... 23086af
1.28.2 Automatically generated by python-semantic-release 88143c7
fix: Fixed definition of zero-shot in ModelMeta (#1747) * Corrected zero_shot definition to be based on task names... 0c9b9d8
1.28.3 Automatically generated by python-semantic-release dfc76d1

View on GitHub

sam-hey pushed 78 commits to main sam-hey/mteb

January 10, 2025 1:32pm

Add NanoBEIR Datasets (#1588) * add NanoClimateFeverRetrieval task, still requires some debugging * move task to co... 6731b94
Update tasks table 9de7f20
Feat: Evaluate missing languages (#1584) * init * fix tests * update mock retrieval * update tests * use subsets... 48cb97d
Add IBM Granite Embedding Models (#1613) * add IBM granite embedding models * lint formatting * add adapted_from a... ad05983
fix: disable co2_tracker for API models (#1614) 7c8e094
1.25.2 Automatically generated by python-semantic-release d8c015f
fix: set `use_instructions` to True in models using prompts (#1616) feat: set `use_instructions` to True in models u... 0c44482
1.25.3 Automatically generated by python-semantic-release 2024338
fix: override existing results (#1617) * fix override existing results * lint * fix tests * add tests with overwr... 272adb1
1.25.4 Automatically generated by python-semantic-release bd782d6
add MSMARCO eval split in MTEB English (classic) benchmark (#1620) * add MSMARCO eval split in MTEB English (classic... e1b74f2
fix: GermanDPR Dataset Causes Cross-Encoder Failure Due to Unexpected dict (#1621) Fixes #1609 748033e
fix: properly add mteb_model_meta to model object (#1623) 72a457e
1.25.5 Automatically generated by python-semantic-release d8dd96c
Feat: Add jasper (#1591) * init jasper * init jasper * add to overview * add to overview * remove some p... ef5a068
fix: Update results_to_dataframe to use BenchmarkResults class (#1628) 02ae4fa
1.25.6 Automatically generated by python-semantic-release e8e1a50
Speed up test_save_predictions (#1631) 1b06601
fix: Correction of discrepancies for gte-Qweb model (#1637) 2de61b1
1.25.7 Automatically generated by python-semantic-release eb643a7
and 58 more ...

View on GitHub

sam-hey closed an issue on embeddings-benchmark/mteb

January 9, 2025 7:47pm

Discrepancy in Cross-Encoder Scores Between MTEB and SentenceTransformers

I conducted a comparison between the scores generated by MTEB for Cross-Encoders and those produced by the `sentence_transformers` library's `CrossEncoder` for the same model and dataset. Unfortuna...

sam-hey created a comment on an issue on embeddings-benchmark/mteb

January 9, 2025 7:36pm

Happy to hear that v2 is close😊 ! All files are available in the [repository](https://github.com/sam-hey/mteb-cross-discrepancy/tree/cross-discrepancy). Check out [`eval_mteb_results.py`](htt...

View on GitHub

sam-hey created a comment on an issue on embeddings-benchmark/mteb

January 9, 2025 3:55pm

Sure @orionw, I hope you’re able to pinpoint the cause. Since I needed to modify some code, I forked the project and installed it from source. The last commit hash before my changes is: `ab8805c612...

View on GitHub

sam-hey created a comment on an issue on embeddings-benchmark/mteb

January 9, 2025 8:59am

Hello @orionw , Thanks for reaching out so quickly! Unfortunately, the problem seems to be a bit more general. I updated the code to use the Cross-Encoder just as a reranker to match the origi...

View on GitHub

sam-hey pushed 1 commit to cross-discrepancy sam-hey/mteb-cross-discrepancy

January 9, 2025 8:57am

use bm25 rerank new results d6b201c

View on GitHub

sam-hey opened an issue on embeddings-benchmark/mteb

January 8, 2025 9:06pm

Discrepancy in Cross-Encoder Scores Between MTEB and SentenceTransformers

I conducted a comparison between the scores generated by MTEB for Cross-Encoders and those produced by the `sentence_transformers` library's `CrossEncoder` for the same model and dataset. Unfortuna...