Current issue state, recent activity, and per-issue timelines from the indexed issue data.
| Date | Opened | Closed | Comments | Events | Open Backlog |
|---|---|---|---|---|---|
| 2026-04-09 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-10 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-11 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-12 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-13 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-14 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-15 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-16 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-17 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-18 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-19 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-20 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-21 | 0 | 0 | 0 | 0 | 0 |
| 2026-04-22 | 0 | 0 | 0 | 0 | 0 |
Opened: 0
Closed: 0
Comments: 0
Events: 0
| Issue | State | Labels | Comments | Reactions | Updated |
|---|---|---|---|---|---|
#70 [Security Alert] Exposed API key(s) detected: AWS Access Key Opened by hdhdn 1 month ago | open | No labels | 0 | 0 | 1 month ago |
#68 During your processing, have you ever encountered the need to extract part of the code? How was it handled? Opened by cistinej 2 years ago | open | No labels | 0 | 0 | 2 years ago |
#65 Most CMake files missed when categorizing by extension Opened by mdewing 2 years ago | open | No labels | 0 | 0 | 2 years ago |
#62 百度云 连接 cloud cleaned database? Opened by willshion 2 years ago | open | No labels | 0 | 0 | 2 years ago |
#59 When I do pii_inference, cannot load bigcode/bigcode-encoder-pii-ner-v2 Opened by RuochenLowes 3 years ago | open | No labels | 0 | 0 | 3 years ago |
#55 Some file extensions excluded from the published dataset (Racket) Opened by flobbit1 3 years ago | open | No labels | 0 | 0 | 3 years ago |
#54 HuggingFace Need Data Access Approval Opened by heoun 3 years ago | open | No labels | 0 | 0 | 3 years ago |
#53 From GH Archive to bigcode/the-stack-github-issues Opened by yunzheng-r 3 years ago | open | No labels | 0 | 0 | 3 years ago |
#44 Question: File Counts and Dataset Size Opened by darien-schettler 3 years ago | open | No labels | 1 | 0 | 3 years ago |
#35 Deduplication also removes data < ngram_size Opened by cceyda 3 years ago | closed - completed | No labels | 3 | 0 | 3 years ago |
#13 Build StackerFlow datasets Opened by lvwerra 3 years ago | closed - completed | help wanted TF: StackOverflow | 0 | 0 | 3 years ago |
#33 Create text-code pairs from Jupyter Notebooks Opened by loubnabnl 3 years ago | closed - completed | No labels | 0 | 0 | 3 years ago |
#32 Define filters for git commits Opened by lvwerra 3 years ago | closed - completed | No labels | 1 | 0 | 3 years ago |
#31 Define filters for cleaning GitHub issues Opened by lvwerra 3 years ago | closed - completed | No labels | 1 | 0 | 3 years ago |
#30 Run language detection GitHub issues Opened by lvwerra 3 years ago | closed - completed | No labels | 5 | 0 | 3 years ago |
#28 NER models for PII Opened by loubnabnl 3 years ago | closed - completed | No labels | 0 | 2 | 3 years ago |
#27 Refactor PII Code Opened by loubnabnl 3 years ago | closed - completed | No labels | 0 | 0 | 3 years ago |
#16 Decontaminate pretraining dataset from evaluation benchmarks Opened by lvwerra 3 years ago | closed - completed | help wanted TF: Dataset Curation and Filtering | 0 | 0 | 3 years ago |
#15 Build dataset index Opened by lvwerra 3 years ago | closed - completed | help wanted TF: Dataset index | 0 | 0 | 3 years ago |
#12 Create dataset with GitHub metadata Opened by lvwerra 3 years ago | closed - completed | help wanted TF: Dataset Curation and Filtering | 0 | 0 | 3 years ago |
#3 Suggest datasets for Code Dataset Catalogue Opened by lvwerra 4 years ago | closed - completed | good first issue help wanted | 7 | 0 | 3 years ago |
#2 Which languages to include? Opened by lvwerra 4 years ago | closed - completed | question | 20 | 0 | 3 years ago |
#6 Parse code dataset into AST Opened by harm-devries 4 years ago | closed - completed | No labels | 3 | 0 | 3 years ago |
#19 Create dataset with git commits Opened by lvwerra 3 years ago | closed - completed | TF: Dataset Curation and Filtering | 0 | 0 | 3 years ago |
#34 Convert Jupyter Notebooks to scripts Opened by loubnabnl 3 years ago | closed - completed | No labels | 0 | 1 | 3 years ago |