ruvnet / worker-benchmarks
Install for your project team
Run this command in your project directory to install the skill for your entire team:
mkdir -p .claude/skills/worker-benchmarks && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/1052" && unzip -o skill.zip -d .claude/skills/worker-benchmarks && rm skill.zip
Project Skills
This skill will be saved in .claude/skills/worker-benchmarks/ and checked into git. All team members will have access to it automatically.
Important: Please verify the skill by reviewing its instructions before using it.
Run comprehensive worker system benchmarks and performance analysis
0 views
0 installs
Skill Content
---
name: worker-benchmarks
description: Run comprehensive worker system benchmarks and performance analysis
version: 1.0.0
invocable: true
author: agentic-flow
capabilities:
- performance_testing
- metrics_collection
- optimization_recommendations
---
# Worker Benchmarks Skill
Run comprehensive performance benchmarks for the agentic-flow worker system.
## Quick Start
```bash
# Run full benchmark suite
npx agentic-flow workers benchmark
# Run specific benchmark
npx agentic-flow workers benchmark --type trigger-detection
npx agentic-flow workers benchmark --type registry
npx agentic-flow workers benchmark --type agent-selection
npx agentic-flow workers benchmark --type concurrent
```
## Benchmark Types
### 1. Trigger Detection (`trigger-detection`)
Tests keyword detection speed across 12 worker triggers.
- **Target**: p95 < 5ms
- **Iterations**: 1000
- **Metrics**: latency, throughput, histogram
### 2. Worker Registry (`registry`)
Tests CRUD operations on worker entries.
- **Target**: p95 < 10ms
- **Iterations**: 500 creates, gets, updates
- **Metrics**: per-operation latency breakdown
### 3. Agent Selection (`agent-selection`)
Tests performance-based agent selection.
- **Target**: p95 < 1ms
- **Iterations**: 1000
- **Metrics**: selection confidence, agent scores
### 4. Model Cache (`cache`)
Tests model caching performance.
- **Target**: p95 < 0.5ms
- **Metrics**: hit rate, cache size, eviction stats
### 5. Concurrent Workers (`concurrent`)
Tests parallel worker creation and updates.
- **Target**: < 1000ms for 10 workers
- **Metrics**: per-worker latency, memory usage
### 6. Memory Key Generation (`memory-keys`)
Tests memory pattern key generation.
- **Target**: p95 < 0.1ms
- **Iterations**: 5000
- **Metrics**: unique patterns, throughput
## Output Format
```
═══════════════════════════════════════════════════════════
📈 BENCHMARK RESULTS
═══════════════════════════════════════════════════════════
✅ Trigger Detection
Operation: detect
Count: 1,000
Avg: 0.045ms | p95: 0.120ms (target: 5ms)
Throughput: 22,222 ops$s
Memory Δ: 0.12MB
✅ Worker Registry
Operation: crud
Count: 1,500
Avg: 1.234ms | p95: 3.456ms (target: 10ms)
Throughput: 810 ops$s
Memory Δ: 2.34MB
───────────────────────────────────────────────────────────
📊 SUMMARY
───────────────────────────────────────────────────────────
Total Tests: 6
Passed: 6 | Failed: 0
Avg Latency: 0.567ms
Total Duration: 2345ms
Peak Memory: 8.90MB
═══════════════════════════════════════════════════════════
```
## Integration with Settings
Benchmark thresholds are configured in `.claude$settings.json`:
```json
{
"performance": {
"benchmarkThresholds": {
"triggerDetection": { "p95Ms": 5 },
"workerRegistry": { "p95Ms": 10 },
"agentSelection": { "p95Ms": 1 },
"memoryKeyGeneration": { "p95Ms": 0.1 },
"concurrentWorkers": { "totalMs": 1000 }
}
}
}
```
## Programmatic Usage
```typescript
import { workerBenchmarks, runBenchmarks } from 'agentic-flow$workers$worker-benchmarks';
// Run full suite
const suite = await runBenchmarks();
console.log(suite.summary);
// Run individual benchmarks
const triggerResult = await workerBenchmarks.benchmarkTriggerDetection(1000);
const registryResult = await workerBenchmarks.benchmarkRegistryOperations(500);
```
## Performance Optimization Tips
1. **Model Cache**: Enable with `CLAUDE_FLOW_MODEL_CACHE_MB=512`
2. **Parallel Workers**: Enable with `CLAUDE_FLOW_WORKER_PARALLEL=true`
3. **Warning Suppression**: Enable with `CLAUDE_FLOW_SUPPRESS_WARNINGS=true`
4. **SQLite WAL Mode**: Automatic for better concurrent performance