Methods
Replication guide
2026
Methods & Replication: Git-Based Evaluation of LLM Code Generation
Scientific-grade methodology document. Covers corpus construction (exact Jira REST API
endpoint), outcome-based tier labelling with indicator function notation, git benchmark
extraction algorithm, evaluation protocol with verbatim prompts, Jaccard scoring
definition, per-model per-tier statistics with standard deviations, 7 documented
limitations, and a shell-script replication guide. Designed for independent replication
without access to private data or proprietary models.