feat: resource management and zero-waste scoring update#391
feat: resource management and zero-waste scoring update#391thegeekybeng wants to merge 3 commits intosantifer:mainfrom
Conversation
- Introduced early processing gate to eliminate roles scoring below 4.0, saving tokens - Disassociated Compensation score from main equation, expanding total potential job matches - Relegated comp scoring to post-match evaluation for user ROI decision - Implemented smart negative reinforcement logging: rejected roles (mismatches, duplicates, expired) are routed to a distinct format subset and actively blacklisted under learned_negative configuration to permanently prevent future token waste - Minor fixes to update-system directory matching logic for user paths
📝 WalkthroughWalkthroughThis pull request updates documentation and configuration for a job pipeline system. It adds an "Early Fit Gate" step to the pipeline with revised scoring logic, updates scoring thresholds, expands job board search targets in configuration, and adds a git ignore rule for PDF reports. Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes 🚥 Pre-merge checks | ✅ 2 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (2 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
thegeekybeng
left a comment
There was a problem hiding this comment.
suggested improved for overall zero-waste in tokens, and more importantly efforts by the compute and us to read through what is essentially not a role worth chasing.
There was a problem hiding this comment.
Actionable comments posted: 4
Caution
Some comments are outside the diff and can’t be posted inline due to platform limitations.
⚠️ Outside diff range comments (1)
modes/_shared.md (1)
45-99: 🛠️ Refactor suggestion | 🟠 MajorRemove the duplicated Block G section.
Lines 45-71 repeat the same “Posting Legitimacy (Block G)” content already present on Lines 73-99. Keep a single source of truth to avoid prompt bloat and future drift.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed. In `@modes/_shared.md` around lines 45 - 99, The document contains a duplicated "Posting Legitimacy (Block G)" section; remove one of the copies so only a single instance of the Block G heading and its associated content remains. Locate the duplicate by searching for the heading "Posting Legitimacy (Block G)" (and the following table and "Ethical framing (MANDATORY):" bullet list) and delete the earlier or later block so the file retains one canonical Block G section to avoid prompt bloat and drift.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.
Inline comments:
In `@modes/pipeline.md`:
- Around line 96-103: The PDF generation step currently uses the early gate pass
as the trigger, but must instead use the final full A-G evaluation score; update
the pipeline logic where "Full Auto-Pipeline (Gate Passed Only)" and the
"Generate PDF" step are implemented so that after running the "full A-G
evaluation" (the function/process performing the complete evaluation), you
recalculate and read the final score and only generate the PDF if that final
score >= 4.0; ensure the report save and tracker registration still occur in the
same sequence (save report to reports/{###}-{company-slug}-{YYYY-MM-DD}.md then
generate PDF conditionally based on final score, then write TSV to
batch/tracker-additions/) and reference the final evaluation result
variable/return value from the full A-G evaluation routine when deciding to
create the PDF.
- Around line 9-17: Parallel agents can collide on REPORT_NUM allocation and
concurrently run Playwright (browser_navigate/browser_snapshot), causing
filename races and violating the single-Playwright rule; change the pipeline so
REPORT_NUM allocation and Playwright extraction are performed serially by a
single coordinator/extractor task (or guarded by a distributed mutex) before
spawning background agents with run_in_background to do the remaining work
(early fit gate, A-G evaluation, report generation). Specifically, move the
logic that computes the next REPORT_NUM and all calls to Playwright
(browser_navigate, browser_snapshot) into a dedicated synchronous function/agent
(the "extractor") that returns the extracted data and reserved REPORT_NUM, and
have parallel agents consume that result instead of performing REPORT_NUM
allocation or Playwright themselves.
- Around line 58-83: Change the automatic update behavior that appends to
portals.yml::title_filter.learned_negative so it only adds patterns when the
rejection is truly about title/archetype/hard-blockers (title patterns,
archetype mismatch, specific-certification/onsite-only blockers) and NOT when
the gate reason is geography, seniority, expired posting, or company-specific
policy; implement this in the logic that handles low scores (the pipeline-gate
path that writes scan-history.tsv and appends learned_negative) by checking the
derived signals before mutating portals.yml, skip adding/creating
learned_negative for geography/seniority/expired/company reasons, and if adding,
ensure learned_negative exists under the negative list before appending the
structured entry (pattern, reason, date, source).
In `@templates/portals.example.yml`:
- Around line 281-385: The file contains a duplicated block of portal entries
(starting with "Remotive — AI/DevRel/SA/FDE" and including entries like
"WeWorkRemotely — AI & DX", "Working Nomads — AI/DevRel/SA", through "HN Who's
Hiring — Remote AI") which repeats the same enabled: true search queries from
the earlier block; remove the duplicate block so each portal (e.g., names
"Remotive — AI/DevRel/SA/FDE", "ai-jobs.net — DevRel/SA/FDE", "Manfred — Tech
Senior España", "DevRelX Jobs", "Himalayas — Remote AI/Dev-tools", "HN Who's
Hiring — Remote AI") appears only once and ensure no duplicate enabled: true
entries remain.
---
Outside diff comments:
In `@modes/_shared.md`:
- Around line 45-99: The document contains a duplicated "Posting Legitimacy
(Block G)" section; remove one of the copies so only a single instance of the
Block G heading and its associated content remains. Locate the duplicate by
searching for the heading "Posting Legitimacy (Block G)" (and the following
table and "Ethical framing (MANDATORY):" bullet list) and delete the earlier or
later block so the file retains one canonical Block G section to avoid prompt
bloat and drift.
🪄 Autofix (Beta)
Fix all unresolved CodeRabbit comments on this PR:
- Push a commit to this branch (recommended)
- Create a new PR with the fixes
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: ASSERTIVE
Plan: Pro
Run ID: b90aacea-fb9a-4c07-bf1f-79b61c0d9749
📒 Files selected for processing (4)
.gitignoremodes/_shared.mdmodes/pipeline.mdtemplates/portals.example.yml
Netflix Software Engineer Intern (Summer 2026) discovered in parallel scan-v108 session (2026-04-21). URL was genuinely absent from all prior scan history across v1-v107. Evaluated at 4.2/5: strong CV match on Finch production backend (35K-line, blue/green CI/CD), TLS/HMAC/OAuth security engineering, and 5-LLM pipeline — maps to Netflix backend infra, security, and ML/AI tracks. Comp $40-110/hr, FAANG-adjacent brand. Report: santifer#391 (reports/391-netflix-swe-intern-summer-2026-2026-04-21.md) https://claude.ai/code/session_01Y4JdPrE19PAzP6JXJ2aKkW
…ne entries Reports created for evaluations previously tracked in applications.md but missing report files: santifer#388 ByteDance Security Assurance (4.3/5), santifer#389 Verkada Security SWE New Grad (4.1/5), santifer#390 Rockstar Security Automation (4.0/5 — CLOSED deadline Apr 5), santifer#391 Netflix SWE Intern (4.2/5), santifer#392 TikTok Product Security Intern (4.0/5). v144 scan: 0 new qualifying ≥4.0. Found 2 genuinely new URLs: TikTok PGC-LLM Applications and AI Agents (~3.8/5) and DoorDash SWE Intern (~3.0/5) — both added to pipeline below threshold. Crusoe Product Security Applied AI Intern flagged as possible re-open (appears active on Simplify/ LinkedIn/Climate Draft but Ashby 503 — needs manual browser verify). Restored modes/_profile.md from template (was missing). career-ops update v1.1.0→v1.3.0 available (user to apply when ready). https://claude.ai/code/session_01VqxRjn4z1kgcFEsVwX1UAD
Problem
Previously, a lot of tokens and time were wasted running extensive evaluations against roles that fundamentally failed to align with basic profile parameters or blocked on strict requirements. Additionally, bundling the Compensation match directly into the core scoring equation prematurely filtered out otherwise potentially highly-resonant jobs.
Improvements & Changes
[~]formatting section and the system extracts exactly why they failed. It then actively appends those signals to thelearned_negativeconfiguration array to automatically prevent future token waste on similar redundant/bad roles online.update-system.mjs.Summary by CodeRabbit
New Features
Documentation
Chores