Current issue state, recent activity, and per-issue timelines from the indexed issue data.
| Date | Opened | Closed | Comments | Events | Open Backlog |
|---|---|---|---|---|---|
| 2026-04-09 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-10 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-11 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-12 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-13 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-14 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-15 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-16 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-17 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-18 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-19 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-20 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-21 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-22 | 0 | 0 | 0 | 0 | 0 |
Opened: 0
Closed: 0
Comments: 0
Events: 0
| Issue | State | Labels | Comments | Reactions | Updated |
|---|---|---|---|---|---|
# Opened by unknown 1 month ago | No labels | 0 | 0 | 1 month ago | |
#318 Question: is there room for long horizon "spec tension" tests around complex code tasks? Opened by onestardao 2 months ago | closed - completed | No labels | 1 | 0 | 1 month ago |
# Opened by unknown 2 months ago | No labels | 0 | 0 | 2 months ago | |
# Opened by unknown 4 months ago | No labels | 0 | 0 | 4 months ago | |
# Opened by unknown 5 months ago | No labels | 0 | 0 | 5 months ago | |
#304 Unable to execute the MultiPL-E task for C++ Opened by JohnneyQin 1 year ago | open | No labels | 1 | 0 | 5 months ago |
# Opened by unknown 7 months ago | No labels | 0 | 0 | 7 months ago | |
#315 Wrong Hugging Face link for Spider dataset in bigcode-evaluation-harness/docs/README.md line 434 Opened by 354246695 7 months ago | open | No labels | 1 | 0 | 7 months ago |
# Opened by unknown 8 months ago | No labels | 0 | 0 | 8 months ago | |
#311 Improve pass@1 Score on Humaneval Opened by showlibia 1 year ago | closed - completed | No labels | 0 | 1 | 8 months ago |
#313 Support configurability of FIM tokens on SantaCoder Opened by Jay-Roberts 10 months ago | open | No labels | 0 | 0 | 10 months ago |
#224 Multiple-E Go test file name suffix does not contain _test.go Opened by sagtanih 2 years ago | closed - completed | No labels | 0 | 1 | 10 months ago |
# Opened by unknown 10 months ago | No labels | 0 | 0 | 10 months ago | |
#240 Some questions about APPS Opened by virt9 2 years ago | closed - completed | No labels | 2 | 0 | 1 year ago |
# Opened by unknown 1 year ago | No labels | 0 | 0 | 1 year ago | |
#308 testing Humaneval of qwen-2.5-7B-coder-instruct Opened by zxiangx 1 year ago | open | No labels | 0 | 2 | 1 year ago |
#307 how to add new model? Opened by pengzhangzhi 1 year ago | open | No labels | 0 | 0 | 1 year ago |
#306 When executing languages such as JS and GO in Multiple-E, the generated results suddenly end Opened by fxnie 1 year ago | open | No labels | 0 | 0 | 1 year ago |
#266 What is `fine-tuning` in task submission? Opened by zhimin-z 2 years ago | open | No labels | 0 | 1 | 1 year ago |
# Opened by unknown 1 year ago | No labels | 0 | 0 | 1 year ago | |
#303 Code-Llama-7B-Python 4 Bit Error on HumanEval Opened by wilyub 1 year ago | open | No labels | 0 | 0 | 1 year ago |
#300 is there a benchmark page on the benchmark results evaluated using bigcode-evaluation-harness Opened by yxchng 1 year ago | open | No labels | 1 | 0 | 1 year ago |
# Opened by unknown 1 year ago | No labels | 0 | 0 | 1 year ago | |
#131 'HumanEval' object has no attribute 'dataset' Opened by dongguanting 3 years ago | closed - completed | No labels | 7 | 4 | 1 year ago |
# Opened by unknown 1 year ago | No labels | 0 | 0 | 1 year ago |