Repository Issue Activity (beta feature)

openai/evals

Current issue state, recent activity, and per-issue timelines from the indexed issue data.

Open Issues
120
New in 7 Days
0
Closed in 7 Days
0
Average Open Age
793 days
Stale 30+ Days
119
Stale 90+ Days
109
Last 14 days
DateOpenedClosedCommentsEventsOpen Backlog
2026-04-0900000
2026-04-1000000
2026-04-1100000
2026-04-1200000
2026-04-1300000
2026-04-1400000
2026-04-1500000
2026-04-1600000
2026-04-1700000
2026-04-1800000
2026-04-1900000
2026-04-2000000
2026-04-2100000
2026-04-2200000
This week

Opened: 0

Closed: 0

Comments: 0

Events: 0

Top labels
bug (52)
Idea for Eval (15)
Issue explorer
IssueStateLabelsCommentsReactionsUpdated

#1636 Feature Request: HarmActionsEval benchmark for evaluation of agent action safety

Opened by prane-eth 30 days ago
open
No labels
0022 days ago

#

Opened by unknown 27 days ago
No labels
0027 days ago

#

Opened by unknown 27 days ago
No labels
0027 days ago

#1635 Proposal: narrow factual-overelaboration pairwise eval (human-reviewed, no custom code)

Opened by joaquinhuigomez 1 month ago
open
No labels
001 month ago

#

Opened by unknown 1 month ago
No labels
001 month ago

#1634 test

Opened by aartipswc-dot 1 month ago
open
bug
001 month ago

#

Opened by unknown 1 month ago
No labels
001 month ago

#1527 What is this

Opened by DXv-3 2 years ago
open
No labels
201 month ago

#1632 Measuring hallucination rates in production systems

Opened by terrywerk 1 month ago
open
No labels
001 month ago

#1629 Proposal: add WFGY 16-problem RAG failure map as a taxonomy for eval analysis

Opened by onestardao 2 months ago
open
No labels
002 months ago

#1628 Feature Request: Add Explainability / Visibility Mode to `HumanCliSolver`

Opened by AftabHussain 2 months ago
open
No labels
002 months ago

#1627 Add speciesist bias evaluation category

Opened by stuckvgn 2 months ago
open
No labels
002 months ago

#

Opened by unknown 2 months ago
No labels
002 months ago

#1576 Link miss in readme

Opened by yufansong 1 year ago
open
bug
202 months ago

#1493 `OpenAIChatCompletionFn` should `__init__` should accept `**kwargs`

Opened by ezraporter 2 years ago
open
bug
303 months ago

#

Opened by unknown 3 months ago
No labels
003 months ago

#

Opened by unknown 3 months ago
No labels
003 months ago

#1556 o1 release breaks token usage stats

Opened by lucapericlp 2 years ago
open
bug
113 months ago

#

Opened by unknown 3 months ago
No labels
003 months ago

#

Opened by unknown 3 months ago
No labels
003 months ago

#1608 Pull request template is obsolete

Opened by omonimus1 4 months ago
open
No labels
104 months ago

#

Opened by unknown 4 months ago
No labels
004 months ago

#1606 Update python version in GitHub workflows to python 3.12

Opened by omonimus1 4 months ago
open
No labels
004 months ago

#

Opened by unknown 5 months ago
No labels
005 months ago

#1604 “Proposal: Thermodynamic trip-wire metric for AI consciousness threshold”

Opened by bordode 6 months ago
open
No labels
206 months ago

Rows per page:

1–25 of 332