Evaluation Variability with Random Instruction Selection #5

leeesangwon · 2024-03-05T10:05:08Z

First of all, thank you for sharing your excellent research with the community.

I have a question regarding the code implementation for evaluations using the M-BEIR dataset. It appears that the instruction is chosen randomly when conducting evaluations. This leads me to believe that if the random seed is not fixed, the evaluation performance could vary depending on which instruction is selected. Do you have any experimental results where you fixed a specific instruction for evaluation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation Variability with Random Instruction Selection #5

Evaluation Variability with Random Instruction Selection #5

leeesangwon commented Mar 5, 2024

Evaluation Variability with Random Instruction Selection #5

Evaluation Variability with Random Instruction Selection #5

Comments

leeesangwon commented Mar 5, 2024