Benchmarking Local LLMs on HumanEval: Setup and Methodology

2025-02-01

Setup, pipeline architecture, and methodology for evaluating 35+ local LLMs on the HumanEval benchmark via Ollama.