Change the repository type filter
All
Repositories list
25 repositories
LiveCodeBench
Publiclm-evaluation-harness-hf
Publicvllm
Public- run tests in https://swj0419.github.io/detect-pretrain.github.io/ for contamination
xformers
PublicLong-Context
PublicThis repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.streaming-javascript
Publicflake8-obey-import-goat
Publicxai-bench
Publicnotebooks
Public- Code to accompany NeurIPS paper https://arxiv.org/abs/2006.08564