Notre avis
Orchestre un pipeline de recherche complet allant de la spécification du problème à la rédaction du rapport, en passant par la conception, l'implémentation, les tests et le benchmarking.
Points forts
- Couverture systématique de toutes les étapes de la recherche, de la théorie au code final.
- Intègre des boucles de rétroaction pour l'optimisation et la correction d'erreurs.
- Vérifications de cohérence entre les étapes pour garantir la fiabilité des résultats.
- Modes automatique et interactif pour s'adapter au niveau d'implication souhaité.
Limites
- Nécessite une spécification initiale précise et une validation à chaque étape.
- La complexité du pipeline peut être excessive pour des projets simples.
- Le mode d'optimisation agressive peut prolonger le temps d'exécution.
Idéal pour des projets de recherche complets nécessitant une implémentation robuste, des benchmarks rigoureux et un rapport détaillé.
Déconseillé pour des tâches de codage simples ou quand une approche itérative rapide est préférée à un pipeline structuré.
Analyse de sécurité
PrudenceThe skill is an orchestrator that uses Bash, Write, and network tools for legitimate research workflows. While no destructive or obfuscated actions are explicitly instructed, the combination of these powerful tools warrants caution due to potential misuse in a running agent environment.
- •Allows Bash execution, which can run arbitrary system commands.
- •Allows WebFetch and WebSearch, enabling network access and potential data exfiltration if misused.
- •Writes and edits files, which could overwrite important data.
Exemples
I want to develop a new caching algorithm. Use the research-to-code pipeline to go from problem definition to a complete implementation with benchmarks and a report.Design an efficient algorithm for graph shortest paths. Activate aggressive optimization mode and challenge the solution iteratively.Implement a concurrent hash map in Rust. Run the research-to-code pipeline in automatic mode, including benchmarking and a final consistency check.name: research_to_code_pipeline description: Orchestrates the full research pipeline from problem specification through theory, design, implementation, benchmarking, to final report. Tracks dependencies and feedback loops. Use for end-to-end research projects. allowed-tools: Read, Write, Edit, Bash, Grep, Glob, WebFetch, WebSearch
Research-to-Code Pipeline Orchestrator
Purpose
Coordinate the complete research workflow, ensuring each stage builds on previous results and maintains consistency.
Pipeline Stages
Stage 1: Problem Definition
Skills: problem_specification Output: Formal spec with types, constraints, invariants Checkpoint: Spec reviewed and approved
Stage 2: Theoretical Analysis
Skills: algorithmic_analysis, comparative_complexity Output: Complexity analysis, design comparison table Checkpoint: Theory sound, alternatives evaluated
Stage 2.5: Optimization Loop (Optional)
Skills: iterative_optimizer Output: Improved algorithm or confirmation of optimality Checkpoint: User satisfied or proven optimal Mode: User-enabled via "aggressive optimization" or "can we do better?" Loop: Repeats Stage 2 with challenges until user stops
Stage 3: Design Selection
Skills: systems_design_patterns, microarchitectural_modeling, language_comparative_runtime Output: Design decision with justification Checkpoint: Design choice documented and justified
Stage 4: Implementation
Skills: multi_language_codegen, safety_invariants Output: Working code in Java, C++, Rust Checkpoint: Code compiles, passes basic tests
Stage 5: Testing
Skills: unit_test_generation Output: Comprehensive test suites Checkpoint: All tests pass, coverage ≥90%
Stage 6: Benchmarking
Skills: test_data_design (FIRST), benchmark_design, performance_interpretation Output: Test data catalog, performance measurements, and analysis Checkpoint: Comprehensive test data covers edge cases; results match theoretical predictions (within ±20%)
Stage 7: Reporting
Skills: reporting_visualization, technical_exposition, pedagogical_reflection, temporal_style_adapter Output: Complete technical report Checkpoint: Document ready for review
Stage 8: Consistency Check
Skills: self_consistency_checker Output: Verification report Checkpoint: No contradictions detected
Orchestration Logic
Sequential Dependencies
problem_specification
↓
algorithmic_analysis + comparative_complexity
↓
[OPTIONAL LOOP: iterative_optimizer]
↓ (if improvements found)
↑ (re-run analysis with new approach)
↓ (user stops or proven optimal)
systems_design_patterns + microarchitectural_modeling
↓
multi_language_codegen + safety_invariants
↓
unit_test_generation
↓
benchmark_design → performance_interpretation
↓
reporting_visualization + technical_exposition
↓
pedagogical_reflection
↓
self_consistency_checker
Feedback Loops
- Optimization loop (Stage 2.5): iterative_optimizer challenges solution → re-run Stage 2 with improvements
- Benchmarks contradict theory: Revisit analysis or modeling
- Tests fail: Revisit implementation or specification
- Consistency check fails: Revisit affected stages
Execution Strategy
Automatic Mode
- Execute stages sequentially
- Use skill_context_cache to pass results forward
- Halt on checkpoint failures
- Generate final report automatically
- Optimization loop disabled by default (enables on "aggressive optimization" keyword)
Interactive Mode
- Execute stage, present results
- User approves or requests revision
- Continue to next stage
- User can jump back to any stage
- Optimization loop: After Stage 2, ask "Optimize further or continue?" If optimize → Stage 2.5
Aggressive Optimization Mode
- After Stage 2, automatically activate iterative_optimizer
- Challenge current solution with 3-5 improvements
- User selects challenge or stops
- If challenge selected: re-run Stage 2 with new approach
- Loop until user satisfied or proven optimal
- Trigger: User says "aggressive optimization", "find optimal", "can we do better?"
Parallel Opportunities
- Implement Java, C++, Rust in parallel (after design)
- Run benchmarks in parallel (after all implementations)
- Generate sections of report in parallel (after benchmarks)
Context Handoff
Use skill_context_cache to pass:
- Spec → Analysis: Types, constraints, invariants
- Analysis → Design: Complexity, operations profile
- Design → Implementation: Data structures, algorithms
- Implementation → Testing: Code structure, edge cases
- Testing → Benchmarking: Validated implementations
- Benchmarking → Reporting: Measurements, analysis
Pipeline Invocation
Full Pipeline:
"Research optimal k-way merge, implement in all languages, benchmark, and write CS500 report"
Partial Pipeline:
"Implement heap-based merge from existing spec"
→ Starts at Stage 4
Resume Pipeline:
"Benchmarks showed unexpected results; re-analyze and update report"
→ Loops back to Stage 2, continues to Stage 7
Optimization Loop:
"Research k-way merge with aggressive optimization"
→ Runs Stage 1-2
→ Stage 2.5: "O(nk) found. Can we do better?" → Challenge with heap O(n log k)
→ Loop: User approves → re-run Stage 2 with heap
→ Stage 2.5: "O(n log k) achieved. Can we do better?" → Challenge with constants
→ Loop: User stops → Continue to Stage 3
Quality Gates
Each stage has acceptance criteria:
| Stage | Gate | Criteria | |-------|------|----------| | 1 | Spec Complete | All invariants documented | | 2 | Theory Sound | Complexity proven, alternatives compared | | 2.5 | Optimization Complete | User satisfied or proven optimal (optional) | | 3 | Design Justified | Trade-offs documented | | 4 | Code Correct | Compiles, no warnings | | 5 | Tests Pass | 100% pass, ≥90% coverage | | 6 | Results Plausible | Within 2× of predictions | | 7 | Report Complete | All sections present | | 8 | Consistency OK | No contradictions |
Error Handling
Checkpoint Failure
- Document issue
- Identify root cause
- Determine which stage to revisit
- Re-execute from that stage
Unexpected Results
- Treat as scientific finding, not error
- Investigate cause (measurement issue vs model issue)
- Update model or document limitation
- Continue pipeline
Cross-Skill Integration
Coordinates: All 19 other skills (including iterative_optimizer) Uses: skill_context_cache for state management Feeds into: Final deliverable (complete research artifact) Optional loop: iterative_optimizer challenges solutions after Stage 2
Expert Next.js App Router
Developpement
Un skill qui transforme Claude en expert Next.js App Router.
Générateur de README
Developpement
Crée des README.md professionnels et complets pour vos projets.
Rédacteur de Documentation API
Developpement
Génère de la documentation API complète au format OpenAPI/Swagger.