worker-benchmarks

Run comprehensive worker system benchmarks and performance analysis

INSTALLATION
npx skills add https://github.com/ruvnet/ruflo --skill worker-benchmarks
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$27

1. Trigger Detection ( trigger-detection )

Tests keyword detection speed across 12 worker triggers.

  • Target: p95 < 5ms
  • Iterations: 1000
  • Metrics: latency, throughput, histogram

2. Worker Registry ( registry )

Tests CRUD operations on worker entries.

  • Target: p95 < 10ms
  • Iterations: 500 creates, gets, updates
  • Metrics: per-operation latency breakdown

3. Agent Selection ( agent-selection )

Tests performance-based agent selection.

  • Target: p95 < 1ms
  • Iterations: 1000
  • Metrics: selection confidence, agent scores

4. Model Cache ( cache )

Tests model caching performance.

  • Target: p95 < 0.5ms
  • Metrics: hit rate, cache size, eviction stats

5. Concurrent Workers ( concurrent )

Tests parallel worker creation and updates.

  • Target: < 1000ms for 10 workers
  • Metrics: per-worker latency, memory usage

6. Memory Key Generation ( memory-keys )

Tests memory pattern key generation.

  • Target: p95 < 0.1ms
  • Iterations: 5000
  • Metrics: unique patterns, throughput

Output Format

═══════════════════════════════════════════════════════════

📈 BENCHMARK RESULTS

═══════════════════════════════════════════════════════════

✅ Trigger Detection

   Operation: detect

   Count: 1,000

   Avg: 0.045ms | p95: 0.120ms (target: 5ms)

   Throughput: 22,222 ops$s

   Memory Δ: 0.12MB

✅ Worker Registry

   Operation: crud

   Count: 1,500

   Avg: 1.234ms | p95: 3.456ms (target: 10ms)

   Throughput: 810 ops$s

   Memory Δ: 2.34MB

───────────────────────────────────────────────────────────

📊 SUMMARY

───────────────────────────────────────────────────────────

Total Tests: 6

Passed: 6 | Failed: 0

Avg Latency: 0.567ms

Total Duration: 2345ms

Peak Memory: 8.90MB

═══════════════════════════════════════════════════════════

Integration with Settings

Benchmark thresholds are configured in .claude$settings.json:

{

  "performance": {

    "benchmarkThresholds": {

      "triggerDetection": { "p95Ms": 5 },

      "workerRegistry": { "p95Ms": 10 },

      "agentSelection": { "p95Ms": 1 },

      "memoryKeyGeneration": { "p95Ms": 0.1 },

      "concurrentWorkers": { "totalMs": 1000 }

    }

  }

}

Programmatic Usage

import { workerBenchmarks, runBenchmarks } from 'agentic-flow$workers$worker-benchmarks';

// Run full suite

const suite = await runBenchmarks();

console.log(suite.summary);

// Run individual benchmarks

const triggerResult = await workerBenchmarks.benchmarkTriggerDetection(1000);

const registryResult = await workerBenchmarks.benchmarkRegistryOperations(500);

Performance Optimization Tips

  • Model Cache: Enable with CLAUDE_FLOW_MODEL_CACHE_MB=512
  • Parallel Workers: Enable with CLAUDE_FLOW_WORKER_PARALLEL=true
  • Warning Suppression: Enable with CLAUDE_FLOW_SUPPRESS_WARNINGS=true
  • SQLite WAL Mode: Automatic for better concurrent performance
BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card