Our review
Orchestrates a full research-to-code pipeline from problem specification through theory, design, implementation, testing, benchmarking, to final report.
Strengths
- Systematic coverage of all research stages, from theory to production code.
- Built-in feedback loops for optimization and error correction across stages.
- Consistency checks ensure no contradictions between stages.
- Supports automatic, interactive, and aggressive optimization modes.
Limitations
- Requires a precise initial specification and checkpoint approvals at each stage.
- Pipeline complexity may be overkill for simple coding tasks.
- Aggressive optimization mode can significantly extend execution time.
Best for end-to-end research projects that require rigorous implementation, benchmarking, and documentation.
Avoid when a quick, iterative coding approach suffices or when the problem is well-understood and doesn't need full pipeline orchestration.
Security analysis
CautionThe skill is an orchestrator that uses Bash, Write, and network tools for legitimate research workflows. While no destructive or obfuscated actions are explicitly instructed, the combination of these powerful tools warrants caution due to potential misuse in a running agent environment.
- •Allows Bash execution, which can run arbitrary system commands.
- •Allows WebFetch and WebSearch, enabling network access and potential data exfiltration if misused.
- •Writes and edits files, which could overwrite important data.
Examples
I want to develop a new caching algorithm. Use the research-to-code pipeline to go from problem definition to a complete implementation with benchmarks and a report.Design an efficient algorithm for graph shortest paths. Activate aggressive optimization mode and challenge the solution iteratively.Implement a concurrent hash map in Rust. Run the research-to-code pipeline in automatic mode, including benchmarking and a final consistency check.name: research_to_code_pipeline description: Orchestrates the full research pipeline from problem specification through theory, design, implementation, benchmarking, to final report. Tracks dependencies and feedback loops. Use for end-to-end research projects. allowed-tools: Read, Write, Edit, Bash, Grep, Glob, WebFetch, WebSearch
Research-to-Code Pipeline Orchestrator
Purpose
Coordinate the complete research workflow, ensuring each stage builds on previous results and maintains consistency.
Pipeline Stages
Stage 1: Problem Definition
Skills: problem_specification Output: Formal spec with types, constraints, invariants Checkpoint: Spec reviewed and approved
Stage 2: Theoretical Analysis
Skills: algorithmic_analysis, comparative_complexity Output: Complexity analysis, design comparison table Checkpoint: Theory sound, alternatives evaluated
Stage 2.5: Optimization Loop (Optional)
Skills: iterative_optimizer Output: Improved algorithm or confirmation of optimality Checkpoint: User satisfied or proven optimal Mode: User-enabled via "aggressive optimization" or "can we do better?" Loop: Repeats Stage 2 with challenges until user stops
Stage 3: Design Selection
Skills: systems_design_patterns, microarchitectural_modeling, language_comparative_runtime Output: Design decision with justification Checkpoint: Design choice documented and justified
Stage 4: Implementation
Skills: multi_language_codegen, safety_invariants Output: Working code in Java, C++, Rust Checkpoint: Code compiles, passes basic tests
Stage 5: Testing
Skills: unit_test_generation Output: Comprehensive test suites Checkpoint: All tests pass, coverage ≥90%
Stage 6: Benchmarking
Skills: test_data_design (FIRST), benchmark_design, performance_interpretation Output: Test data catalog, performance measurements, and analysis Checkpoint: Comprehensive test data covers edge cases; results match theoretical predictions (within ±20%)
Stage 7: Reporting
Skills: reporting_visualization, technical_exposition, pedagogical_reflection, temporal_style_adapter Output: Complete technical report Checkpoint: Document ready for review
Stage 8: Consistency Check
Skills: self_consistency_checker Output: Verification report Checkpoint: No contradictions detected
Orchestration Logic
Sequential Dependencies
problem_specification
↓
algorithmic_analysis + comparative_complexity
↓
[OPTIONAL LOOP: iterative_optimizer]
↓ (if improvements found)
↑ (re-run analysis with new approach)
↓ (user stops or proven optimal)
systems_design_patterns + microarchitectural_modeling
↓
multi_language_codegen + safety_invariants
↓
unit_test_generation
↓
benchmark_design → performance_interpretation
↓
reporting_visualization + technical_exposition
↓
pedagogical_reflection
↓
self_consistency_checker
Feedback Loops
- Optimization loop (Stage 2.5): iterative_optimizer challenges solution → re-run Stage 2 with improvements
- Benchmarks contradict theory: Revisit analysis or modeling
- Tests fail: Revisit implementation or specification
- Consistency check fails: Revisit affected stages
Execution Strategy
Automatic Mode
- Execute stages sequentially
- Use skill_context_cache to pass results forward
- Halt on checkpoint failures
- Generate final report automatically
- Optimization loop disabled by default (enables on "aggressive optimization" keyword)
Interactive Mode
- Execute stage, present results
- User approves or requests revision
- Continue to next stage
- User can jump back to any stage
- Optimization loop: After Stage 2, ask "Optimize further or continue?" If optimize → Stage 2.5
Aggressive Optimization Mode
- After Stage 2, automatically activate iterative_optimizer
- Challenge current solution with 3-5 improvements
- User selects challenge or stops
- If challenge selected: re-run Stage 2 with new approach
- Loop until user satisfied or proven optimal
- Trigger: User says "aggressive optimization", "find optimal", "can we do better?"
Parallel Opportunities
- Implement Java, C++, Rust in parallel (after design)
- Run benchmarks in parallel (after all implementations)
- Generate sections of report in parallel (after benchmarks)
Context Handoff
Use skill_context_cache to pass:
- Spec → Analysis: Types, constraints, invariants
- Analysis → Design: Complexity, operations profile
- Design → Implementation: Data structures, algorithms
- Implementation → Testing: Code structure, edge cases
- Testing → Benchmarking: Validated implementations
- Benchmarking → Reporting: Measurements, analysis
Pipeline Invocation
Full Pipeline:
"Research optimal k-way merge, implement in all languages, benchmark, and write CS500 report"
Partial Pipeline:
"Implement heap-based merge from existing spec"
→ Starts at Stage 4
Resume Pipeline:
"Benchmarks showed unexpected results; re-analyze and update report"
→ Loops back to Stage 2, continues to Stage 7
Optimization Loop:
"Research k-way merge with aggressive optimization"
→ Runs Stage 1-2
→ Stage 2.5: "O(nk) found. Can we do better?" → Challenge with heap O(n log k)
→ Loop: User approves → re-run Stage 2 with heap
→ Stage 2.5: "O(n log k) achieved. Can we do better?" → Challenge with constants
→ Loop: User stops → Continue to Stage 3
Quality Gates
Each stage has acceptance criteria:
| Stage | Gate | Criteria | |-------|------|----------| | 1 | Spec Complete | All invariants documented | | 2 | Theory Sound | Complexity proven, alternatives compared | | 2.5 | Optimization Complete | User satisfied or proven optimal (optional) | | 3 | Design Justified | Trade-offs documented | | 4 | Code Correct | Compiles, no warnings | | 5 | Tests Pass | 100% pass, ≥90% coverage | | 6 | Results Plausible | Within 2× of predictions | | 7 | Report Complete | All sections present | | 8 | Consistency OK | No contradictions |
Error Handling
Checkpoint Failure
- Document issue
- Identify root cause
- Determine which stage to revisit
- Re-execute from that stage
Unexpected Results
- Treat as scientific finding, not error
- Investigate cause (measurement issue vs model issue)
- Update model or document limitation
- Continue pipeline
Cross-Skill Integration
Coordinates: All 19 other skills (including iterative_optimizer) Uses: skill_context_cache for state management Feeds into: Final deliverable (complete research artifact) Optional loop: iterative_optimizer challenges solutions after Stage 2
Next.js App Router Expert
Development
A skill that turns Claude into a Next.js App Router expert.
README Generator
Development
Creates professional and comprehensive README.md files for your projects.
API Documentation Writer
Development
Generates comprehensive API documentation in OpenAPI/Swagger format.