Skip to content

Small-compute adaptation plan (Mac-first) - NIC-324#1

Open
nmandal wants to merge 3 commits intomasterfrom
feature/nic-324-small-compute-adaptation
Open

Small-compute adaptation plan (Mac-first) - NIC-324#1
nmandal wants to merge 3 commits intomasterfrom
feature/nic-324-small-compute-adaptation

Conversation

@nmandal
Copy link
Owner

@nmandal nmandal commented Mar 15, 2026

Summary

Comprehensive adaptation plan for running autoresearch on small compute environments, prioritizing MacBooks with Apple Silicon.

Key Deliverables

Parameter downsizing matrix: DEPTH, batch size, sequence length, vocabulary size configurations for MacBook Pro/Air/CPU environments

Dataset recommendations: TinyStories for low-entropy training on resource-constrained setups

Throughput/quality envelope: Expected performance metrics and quality tradeoffs for each platform tier

Implementation strategy: Phase-by-phase rollout starting with MacBook Pro, notable fork analysis

Technical Highlights

  • DEPTH reduction: 8→4 layers for MacBook Pro (50% compute reduction)
  • Context length: 2048→512 tokens (75% memory reduction)
  • Batch size optimization: Powers-of-2 sizing for each platform
  • Dataset switch: TinyStories for 2-3x better perplexity on small models
  • Memory profiles: 4GB-12GB usage across platform spectrum

Ready for Implementation

Document provides concrete parameter templates and configuration changes ready for immediate deployment.

Closes NIC-324

nmandal added 2 commits March 14, 2026 23:46
Implements a deterministic policy engine for evaluating autoresearch
results based on validation bits-per-byte (val_bpb) and a complexity
score.

The core logic is to prioritize candidates with lower val_bpb, while
also favoring simplicity. The engine handles crash and timeout
statuses explicitly.

Includes a comprehensive unit test suite to validate the decision
logic under various conditions.

Resolves NIC-320.
- Parameter downsizing matrix for DEPTH, batch size, seq len, vocab
- TinyStories dataset recommendation for low-entropy training
- Throughput/quality envelope for MacBook Pro/Air/CPU configs
- Notable fork analysis and implementation strategy
- Configuration templates for immediate deployment

Addresses NIC-324
@nmandal nmandal self-assigned this Mar 15, 2026
- digest.py: Generate HTML reports with run leaderboards and trend charts
- log_run.py: Capture experiment results to results.tsv
- experiment_runner.py: High-level automation for experiments and digests
- Complexity-adjusted scoring: val_bpb + parameter penalty
- Ready-for-review summary templates
- Integrates with existing policy_engine for autonomous research loops

Addresses Linear issue NIC-327: Autoresearch daily research digest and leaderboard
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant