ML Intern backlog prioritization report - 2026-05-04

# ML Intern Backlog Prioritization

Generated: 2026-05-04T13:18:56.926357+00:00
Model: `anthropic/claude-opus-4-6`

Sources: github_issue=24, github_pr=50, hf_discussion=14

## Summary

Critical security PRs (#96, #85, #83) and agent-quality fix (#88) are reviewed and ready to merge. Hosted Space has multiple reliability regressions (session zombies, Pro detection, onboarding CSS, blank pages) blocking paying users. Provider expansion (local models, Bedrock, Azure, Gemini) is the top feature theme with 6+ PRs. Cost guardrails and session durability are strategic priorities.

## Can Be Closed

No high-confidence resolved-in-main candidates found.

## Highest Impact Next

1. **Merge 3 security PRs: CVE patch, sandbox auth, SSRF fix** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Merge PR #96 (authlib CVE), #85 (sandbox bearer auth), #83 (SSRF origin validation) — all reviewed APPROVE/safe-to-merge, effort 1 each
   - Rationale: Unauthenticated sandbox RCE + HF_TOKEN exfil, SSRF token leak, and CRITICAL CVE — all exploitable today with reviewed fixes waiting
   - Next action: Rebase #85 to resolve dirty state; approve and merge all three in one batch
   - Sources: [github_pr#96](https://github.com/huggingface/ml-intern/pull/96), [github_pr#85](https://github.com/huggingface/ml-intern/pull/85), [github_pr#83](https://github.com/huggingface/ml-intern/pull/83)

2. **Merge thinking_blocks fix (PR #88)** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Merge PR #88 — reviewed 'SAFE TO MERGE, LOW RISK', 51 lines, 1 file. Fixes silent quality degradation on every Anthropic session.
   - Rationale: Extended thinking silently drops after first tool call on all Anthropic models — degrades agent quality for the entire session
   - Next action: Confirm rebase is clean and merge
   - Sources: [github_pr#88](https://github.com/huggingface/ml-intern/pull/88)

3. **Fix onboarding CSS overflow blocking Start button** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Change onboarding container overflow:hidden to overflow-y:auto — root cause fully documented with screenshots and confirmed fix
   - Rationale: Completely blocks first-time users on 1366x768 screens; users must open DevTools to use the product
   - Next action: Push 1-line CSS fix to main
   - Sources: [hf_discussion#19](https://huggingface.co/spaces/smolagents/ml-intern/discussions/19)

4. **Fix MCP tool stall (hub_repo_details hangs 5+ min)** (impact 5/5, effort 2/5, confidence 0.9)
   - Recommendation: Adopt PR #190's 30s timeout + running-state emission into a maintainer branch since PR was closed but issue #127 persists
   - Rationale: Maintainer-filed bug; agent hangs silently forcing Ctrl+C, losing all accumulated state
   - Next action: Cherry-pick timeout logic from PR #190 into maintainer fix; add integration test
   - Sources: [github_issue#127](https://github.com/huggingface/ml-intern/issues/127), [github_pr#190](https://github.com/huggingface/ml-intern/pull/190)

5. **Fix session zombie state and quota cleanup** (impact 5/5, effort 3/5, confidence 0.85)
   - Recommendation: Fix session lifecycle so stopped sessions free slots; merge PR #77 (race condition) after pending review completes
   - Rationale: Users permanently locked out after hitting session limit; all remaining sessions stuck in 'catching up' — completely blocks further use
   - Next action: Audit session cleanup on stop/delete; review PR #77 race condition fix; deploy together
   - Sources: [hf_discussion#17](https://huggingface.co/spaces/smolagents/ml-intern/discussions/17), [github_pr#77](https://github.com/huggingface/ml-intern/pull/77), [hf_discussion#20](https://huggingface.co/spaces/smolagents/ml-intern/discussions/20)

6. **Fix Pro plan detection and entitlement checks** (impact 4/5, effort 2/5, confidence 0.85)
   - Recommendation: Merge community PR fix from discussion #21; verify entitlement logic covers all subscription states
   - Rationale: Paying Pro users can't access paid features — 3 separate users confirmed; directly harms revenue trust
   - Next action: Review and merge community PR from discussion #21; add test for subscription edge cases
   - Sources: [hf_discussion#18](https://huggingface.co/spaces/smolagents/ml-intern/discussions/18), [hf_discussion#25](https://huggingface.co/spaces/smolagents/ml-intern/discussions/25)

7. **Merge Bedrock region prefix fix (PR #185)** (impact 4/5, effort 1/5, confidence 0.9)
   - Recommendation: Cherry-pick or merge PR #185 — 17-line targeted fix, verified with live Bedrock calls, blocks all non-US AWS users
   - Rationale: Hard-coded 'us.' prefix completely breaks research sub-agent for non-US Bedrock users
   - Next action: Approve and merge; CI failure confirmed unrelated to fix
   - Sources: [github_issue#184](https://github.com/huggingface/ml-intern/issues/184), [github_pr#185](https://github.com/huggingface/ml-intern/pull/185)

8. **Add LICENSE file to unblock adoption** (impact 4/5, effort 1/5, confidence 0.9)
   - Recommendation: Add Apache 2.0 LICENSE file — 5 thumbs-up, enterprise users explicitly blocked from testing
   - Rationale: No license = legal blocker for enterprises and external contributors; 5 reactions + multiple +1 comments
   - Next action: Add LICENSE file and README badge; use PR #178 as reference
   - Sources: [github_issue#41](https://github.com/huggingface/ml-intern/issues/41), [github_pr#178](https://github.com/huggingface/ml-intern/pull/178)

## Features

1. **Local model support (Ollama/vLLM/OpenAI-compat)** (impact 5/5, effort 3/5, confidence 0.75)
   - Recommendation: Review PR #166 (improved local model support with feature gate); use #44 Ollama work as reference. Highest-demand feature.
   - Next action: Maintainer review of PR #166 security model; decide on ENABLE_LOCAL_MODELS gate
   - Sources: [github_issue#94](https://github.com/huggingface/ml-intern/issues/94), [github_pr#166](https://github.com/huggingface/ml-intern/pull/166), [github_pr#44](https://github.com/huggingface/ml-intern/pull/44), [github_pr#68](https://github.com/huggingface/ml-intern/pull/68)

2. **Provider adapter refactor + Bedrock/Azure/Gemini** (impact 4/5, effort 3/5, confidence 0.75)
   - Recommendation: Merge PR #55 (adapter refactor, reviewed SAFE) first, then #80 (Azure, LGTM) and #66 (Bedrock). Gemini PRs #131/#95 need conflict resolution.
   - Next action: Rebase and merge PR #55; queue #80 and #66 immediately after
   - Sources: [github_pr#55](https://github.com/huggingface/ml-intern/pull/55), [github_pr#66](https://github.com/huggingface/ml-intern/pull/66), [github_pr#80](https://github.com/huggingface/ml-intern/pull/80), [github_pr#182](https://github.com/huggingface/ml-intern/pull/182), [github_pr#131](https://github.com/huggingface/ml-intern/pull/131), [github_pr#95](https://github.com/huggingface/ml-intern/pull/95)

3. **Image, file, and dataset attachments for web+CLI** (impact 4/5, effort 4/5, confidence 0.6)
   - Recommendation: Run automated review on PR #186; it addresses both #157 and #158 with CLI+web support
   - Next action: Trigger PR review; resolve dirty merge state
   - Sources: [github_issue#157](https://github.com/huggingface/ml-intern/issues/157), [github_issue#158](https://github.com/huggingface/ml-intern/issues/158), [github_pr#186](https://github.com/huggingface/ml-intern/pull/186)

4. **OpenRouter / custom OpenAI-compatible base URL** (impact 4/5, effort 1/5, confidence 0.8)
   - Recommendation: Implement OPENAI_BASE_URL passthrough — minimal change, follows ecosystem convention, unlocks 300+ models
   - Next action: Accept a small PR respecting OPENAI_BASE_URL env var
   - Sources: [github_issue#197](https://github.com/huggingface/ml-intern/issues/197), [github_pr#188](https://github.com/huggingface/ml-intern/pull/188)

5. **Cost guardrails: prompt caching, iteration cap, research concurrency** (impact 5/5, effort 4/5, confidence 0.85)
   - Recommendation: Sprint on P0 items: add cache_control markers, lower max_iterations 300→40, cap concurrent research subagents, default cheaper research model
   - Next action: Internal sprint; start with max_iterations cap and prompt caching
   - Sources: [github_issue#61](https://github.com/huggingface/ml-intern/issues/61)

6. **Expand pre-flight checks for hf_jobs approval** (impact 4/5, effort 2/5, confidence 0.7)
   - Recommendation: Add 4 static checks (timeout, hub_model_id, flash-attn, trackio) + credit pre-check to prevent wasted GPU spend
   - Next action: Accept PR for reliability_checks.py expansion; add test coverage
   - Sources: [github_issue#203](https://github.com/huggingface/ml-intern/issues/203), [github_issue#125](https://github.com/huggingface/ml-intern/issues/125)

7. **Notify user when agent needs input/approval** (impact 3/5, effort 1/5, confidence 0.7)
   - Recommendation: Review PR #106 (--notify-on-block CLI flag) after CI fix #107 lands; simpler and safer than PR #71
   - Next action: Merge PR #107 first, then review #106
   - Sources: [github_issue#65](https://github.com/huggingface/ml-intern/issues/65), [github_issue#103](https://github.com/huggingface/ml-intern/issues/103), [github_pr#106](https://github.com/huggingface/ml-intern/pull/106)

8. **Evaluation and benchmarking CLI** (impact 4/5, effort 4/5, confidence 0.5)
   - Recommendation: Request author fix Claude-flagged correctness issues in PR #98 before re-review
   - Next action: Comment on PR #98 with specific fix requests; set deadline
   - Sources: [github_issue#84](https://github.com/huggingface/ml-intern/issues/84), [github_pr#98](https://github.com/huggingface/ml-intern/pull/98)

9. **Background sessions on Mongo control plane** (impact 5/5, effort 5/5, confidence 0.7)
   - Recommendation: Fix P0 blockers (_enforce_gated_model_quota crash, reconnect 404) before merge; PR has 83 tests and thorough design
   - Next action: Author to fix remaining P0s; re-trigger automated review
   - Sources: [github_pr#206](https://github.com/huggingface/ml-intern/pull/206)

10. **Claude Code project-mode / plugin support** (impact 4/5, effort 4/5, confidence 0.5)
   - Recommendation: Strategic decision needed: is Claude Code a supported frontend? If yes, review PR #113 first; defer #114 until #113 lands
   - Next action: Maintainer decision on Claude Code scope
   - Sources: [github_pr#113](https://github.com/huggingface/ml-intern/pull/113), [github_pr#114](https://github.com/huggingface/ml-intern/pull/114), [github_issue#74](https://github.com/huggingface/ml-intern/issues/74)

11. **Opt-in LangFuse observability callback** (impact 3/5, effort 2/5, confidence 0.65)
   - Recommendation: Adopt env-gated design from PR #198 (mandatory LANGFUSE_HOST); implement as litellm callback
   - Next action: Reopen or reimplement based on PR #198 approach
   - Sources: [github_issue#196](https://github.com/huggingface/ml-intern/issues/196), [github_pr#198](https://github.com/huggingface/ml-intern/pull/198)

12. **Frontend UX: example prompts, sidebar, copy/regenerate** (impact 3/5, effort 3/5, confidence 0.8)
   - Recommendation: Fix isProcessing stuck-true bug and accessibility issues in PR #38, then merge — frontend-only, good screenshots
   - Next action: Author to fix regenerate error handling; rebase and merge
   - Sources: [github_pr#38](https://github.com/huggingface/ml-intern/pull/38), [hf_discussion#28](https://huggingface.co/spaces/smolagents/ml-intern/discussions/28)

## Fixes

1. **CVE-2026-27962 authlib upgrade** (impact 5/5, effort 1/5, confidence 0.9)
   - Recommendation: Fix title mismatch (says 1.6.9, is 1.7.0), vet joserfc dep, merge urgently
   - Sources: [github_pr#96](https://github.com/huggingface/ml-intern/pull/96)

2. **Sandbox API unauthenticated RCE** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Rebase and merge — reviewed APPROVE, adds bearer-token auth to all sandbox endpoints
   - Sources: [github_pr#85](https://github.com/huggingface/ml-intern/pull/85)

3. **SSRF in fetch_hf_docs leaks HF token** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Merge — reviewed safe-to-merge, origin validation with good test coverage
   - Sources: [github_pr#83](https://github.com/huggingface/ml-intern/pull/83)

4. **Extended thinking drops after first tool call** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: Merge — 51 lines, 1 file, 'SAFE TO MERGE, LOW RISK'
   - Sources: [github_pr#88](https://github.com/huggingface/ml-intern/pull/88)

5. **Start session button hidden by CSS overflow** (impact 5/5, effort 1/5, confidence 0.95)
   - Recommendation: 1-line CSS fix: overflow:hidden → overflow-y:auto on onboarding container
   - Sources: [hf_discussion#19](https://huggingface.co/spaces/smolagents/ml-intern/discussions/19)

6. **Concurrent plan overwrites + CORS rejection on Spaces** (impact 4/5, effort 1/5, confidence 0.8)
   - Recommendation: Cherry-pick per-session plan isolation and CORS SPACE_HOST fix from closed PR #205
   - Sources: [github_pr#205](https://github.com/huggingface/ml-intern/pull/205)

7. **Read tool crashes on string offset/limit** (impact 4/5, effort 1/5, confidence 0.9)
   - Recommendation: Merge after also fixing bash_handler timeout (same bug); reviewed 'ready to merge'
   - Sources: [github_pr#110](https://github.com/huggingface/ml-intern/pull/110)

8. **HF Pro subscription not detected after upgrade** (impact 4/5, effort 2/5, confidence 0.85)
   - Recommendation: Review community PR fix from discussion #21; test all subscription states
   - Sources: [hf_discussion#18](https://huggingface.co/spaces/smolagents/ml-intern/discussions/18), [hf_discussion#25](https://huggingface.co/spaces/smolagents/ml-intern/discussions/25)

9. **Stopped sessions don't free quota; catch-up stuck** (impact 5/5, effort 3/5, confidence 0.85)
   - Recommendation: Fix session cleanup lifecycle; merge PR #77 race condition fix after review
   - Sources: [hf_discussion#17](https://huggingface.co/spaces/smolagents/ml-intern/discussions/17), [hf_discussion#20](https://huggingface.co/spaces/smolagents/ml-intern/discussions/20), [github_pr#77](https://github.com/huggingface/ml-intern/pull/77)

10. **Chat messages spontaneously disappear mid-session** (impact 4/5, effort 3/5, confidence 0.65)
   - Recommendation: Audit chat state persistence for race conditions or storage limits
   - Sources: [hf_discussion#26](https://huggingface.co/spaces/smolagents/ml-intern/discussions/26), [hf_discussion#29](https://huggingface.co/spaces/smolagents/ml-intern/discussions/29)

11. **Dotenv ignored when shell has stale env var** (impact 3/5, effort 1/5, confidence 0.8)
   - Recommendation: Confirm maintainer intent (ambiguous close comment); merge if acceptable — reviewed LGTM
   - Sources: [github_pr#120](https://github.com/huggingface/ml-intern/pull/120)

12. **401 Client Error on sandbox creation** (impact 4/5, effort 3/5, confidence 0.55)
   - Recommendation: Investigate regression; merge PR #219 for better logging; add sandbox-creation alerting
   - Sources: [github_issue#214](https://github.com/huggingface/ml-intern/issues/214), [hf_discussion#33](https://huggingface.co/spaces/smolagents/ml-intern/discussions/33)

## Other / Watchlist

1. **Add LICENSE file (Apache 2.0)** (impact 4/5, effort 1/5)
   - Recommendation: Add immediately — blocks enterprise adoption; use PR #178 as reference
   - Sources: [github_issue#41](https://github.com/huggingface/ml-intern/issues/41), [github_pr#178](https://github.com/huggingface/ml-intern/pull/178)

2. **Add CONTRIBUTING.md** (impact 2/5, effort 1/5)
   - Recommendation: Merge PR #130 — docs-only, 156 additions, no code changes
   - Sources: [github_pr#130](https://github.com/huggingface/ml-intern/pull/130)

3. **Update README model example + env var annotations** (impact 2/5, effort 1/5)
   - Recommendation: Quick-merge both — trivial doc fixes, zero risk
   - Sources: [github_pr#220](https://github.com/huggingface/ml-intern/pull/220), [github_pr#109](https://github.com/huggingface/ml-intern/pull/109)

4. **Fix CI review workflow for fork PRs** (impact 3/5, effort 1/5)
   - Recommendation: Rebase and merge — blocks automated review for all external contributors
   - Sources: [github_pr#107](https://github.com/huggingface/ml-intern/pull/107)

5. **Restructure system prompt tone** (impact 3/5, effort 2/5)
   - Recommendation: Re-add 3 dropped behavioral rules flagged in review before merging
   - Sources: [github_pr#33](https://github.com/huggingface/ml-intern/pull/33)

6. **Use huggingface_hub.get_token() as fallback** (impact 3/5, effort 1/5)
   - Recommendation: Review and merge — fixes real onboarding friction (issue #23)
   - Sources: [github_pr#34](https://github.com/huggingface/ml-intern/pull/34)

## Clusters

- **Security vulnerabilities**
  - Summary: CVE in authlib, unauthenticated sandbox RCE, and SSRF token leak — all have reviewed PRs ready to merge
  - Sources: [github_pr#96](https://github.com/huggingface/ml-intern/pull/96), [github_pr#85](https://github.com/huggingface/ml-intern/pull/85), [github_pr#83](https://github.com/huggingface/ml-intern/pull/83)

- **Session reliability & state persistence**
  - Summary: Sessions get stuck as zombies, catch-up broken, messages disappear, race conditions on creation — users lose work and get locked out
  - Sources: [hf_discussion#17](https://huggingface.co/spaces/smolagents/ml-intern/discussions/17), [hf_discussion#20](https://huggingface.co/spaces/smolagents/ml-intern/discussions/20), [hf_discussion#26](https://huggingface.co/spaces/smolagents/ml-intern/discussions/26), [hf_discussion#29](https://huggingface.co/spaces/smolagents/ml-intern/discussions/29), [github_pr#77](https://github.com/huggingface/ml-intern/pull/77), [github_pr#205](https://github.com/huggingface/ml-intern/pull/205)

- **Pro/payment detection failures**
  - Summary: Pro subscribers can't access paid features; 402 errors silently swallowed on training jobs
  - Sources: [hf_discussion#18](https://huggingface.co/spaces/smolagents/ml-intern/discussions/18), [hf_discussion#25](https://huggingface.co/spaces/smolagents/ml-intern/discussions/25), [github_issue#125](https://github.com/huggingface/ml-intern/issues/125)

- **Provider expansion (local, Azure, Bedrock, Gemini)**
  - Summary: Users locked to Anthropic/OpenAI APIs; demand for local models, Bedrock, Azure, Gemini, OpenRouter across 10+ PRs and issues
  - Sources: [github_issue#94](https://github.com/huggingface/ml-intern/issues/94), [github_pr#166](https://github.com/huggingface/ml-intern/pull/166), [github_pr#44](https://github.com/huggingface/ml-intern/pull/44), [github_pr#55](https://github.com/huggingface/ml-intern/pull/55), [github_pr#60](https://github.com/huggingface/ml-intern/pull/60), [github_pr#66](https://github.com/huggingface/ml-intern/pull/66), [github_pr#80](https://github.com/huggingface/ml-intern/pull/80), [github_pr#131](https://github.com/huggingface/ml-intern/pull/131), [github_pr#95](https://github.com/huggingface/ml-intern/pull/95), [github_pr#182](https://github.com/huggingface/ml-intern/pull/182), [github_issue#197](https://github.com/huggingface/ml-intern/issues/197), [github_pr#188](https://github.com/huggingface/ml-intern/pull/188), [github_pr#68](https://github.com/huggingface/ml-intern/pull/68)

- **File & data uploads**
  - Summary: Users can't provide local files, images, or datasets to the agent — blocks real ML workflows
  - Sources: [github_issue#157](https://github.com/huggingface/ml-intern/issues/157), [github_issue#158](https://github.com/huggingface/ml-intern/issues/158), [github_pr#186](https://github.com/huggingface/ml-intern/pull/186), [github_issue#187](https://github.com/huggingface/ml-intern/issues/187)

- **Notifications when agent needs input**
  - Summary: Users multitasking miss approval prompts; two PRs compete — #106 is simpler and safer
  - Sources: [github_issue#65](https://github.com/huggingface/ml-intern/issues/65), [github_issue#103](https://github.com/huggingface/ml-intern/issues/103), [github_pr#71](https://github.com/huggingface/ml-intern/pull/71), [github_pr#106](https://github.com/huggingface/ml-intern/pull/106)

- **Agent stalls & cost runaway**
  - Summary: Agent hangs silently on MCP calls; research subagents run to 100k+ tokens with no cost cap or caching
  - Sources: [github_issue#127](https://github.com/huggingface/ml-intern/issues/127), [github_pr#190](https://github.com/huggingface/ml-intern/pull/190), [github_issue#61](https://github.com/huggingface/ml-intern/issues/61)

- **Bedrock region & sub-agent routing**
  - Summary: Hard-coded 'us.' prefix breaks research sub-agent for all non-US AWS regions; 17-line fix ready
  - Sources: [github_issue#184](https://github.com/huggingface/ml-intern/issues/184), [github_pr#185](https://github.com/huggingface/ml-intern/pull/185)

- **Sandbox creation & auth errors**
  - Summary: 401/429 errors on sandbox creation; model default regression blocked all users; logging gaps hide root cause
  - Sources: [github_issue#214](https://github.com/huggingface/ml-intern/issues/214), [hf_discussion#33](https://huggingface.co/spaces/smolagents/ml-intern/discussions/33), [hf_discussion#31](https://huggingface.co/spaces/smolagents/ml-intern/discussions/31), [github_pr#219](https://github.com/huggingface/ml-intern/pull/219)

- **Claude Code integration**
  - Summary: Claude Code project-mode and plugin support to use ml-intern tools within Claude Max subscriptions
  - Sources: [github_pr#113](https://github.com/huggingface/ml-intern/pull/113), [github_pr#114](https://github.com/huggingface/ml-intern/pull/114), [github_issue#74](https://github.com/huggingface/ml-intern/issues/74)

- **Documentation & i18n**
  - Summary: Missing LICENSE, no CONTRIBUTING.md, outdated README, translation requests
  - Sources: [github_issue#41](https://github.com/huggingface/ml-intern/issues/41), [github_pr#178](https://github.com/huggingface/ml-intern/pull/178), [github_pr#130](https://github.com/huggingface/ml-intern/pull/130), [github_pr#220](https://github.com/huggingface/ml-intern/pull/220), [github_pr#109](https://github.com/huggingface/ml-intern/pull/109), [github_pr#149](https://github.com/huggingface/ml-intern/pull/149), [github_issue#147](https://github.com/huggingface/ml-intern/issues/147)

- **Observability & evaluation**
  - Summary: No LLM trace export, no tool-level telemetry, no benchmark evaluation pipeline
  - Sources: [github_issue#196](https://github.com/huggingface/ml-intern/issues/196), [github_pr#198](https://github.com/huggingface/ml-intern/pull/198), [github_issue#155](https://github.com/huggingface/ml-intern/issues/155), [github_issue#84](https://github.com/huggingface/ml-intern/issues/84), [github_pr#98](https://github.com/huggingface/ml-intern/pull/98)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML Intern backlog prioritization report - 2026-05-04 #223

ML Intern Backlog Prioritization

Summary

Can Be Closed

Highest Impact Next

Features

Fixes

Other / Watchlist

Clusters

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

ML Intern backlog prioritization report - 2026-05-04 #223

Description

ML Intern Backlog Prioritization

Summary

Can Be Closed

Highest Impact Next

Features

Fixes

Other / Watchlist

Clusters

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions