Benchmarking Local LLMs on HumanEval: Setup and Methodology
2025-02-01Setup, pipeline architecture, and methodology for evaluating 35+ local LLMs on the HumanEval benchmark via Ollama.
Setup, pipeline architecture, and methodology for evaluating 35+ local LLMs on the HumanEval benchmark via Ollama.