DevamShah
diff --git a/‎README.md‎
Lines changed: 98 additions & 13 deletions b/‎README.md‎
Lines changed: 98 additions & 13 deletions
diff --git a/‎apps/cli/src/commands/start.ts‎
Lines changed: 2 additions & 0 deletions b/‎apps/cli/src/commands/start.ts‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎apps/cli/src/docker.ts‎
Lines changed: 10 additions & 0 deletions b/‎apps/cli/src/docker.ts‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎apps/cli/src/index.ts‎
Lines changed: 19 additions & 0 deletions b/‎apps/cli/src/index.ts‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎apps/worker/package.json‎
Lines changed: 4 additions & 2 deletions b/‎apps/worker/package.json‎
Lines changed: 4 additions & 2 deletions
diff --git a/‎apps/worker/src/__tests__/sarif-output-provider.test.ts‎
Lines changed: 145 additions & 0 deletions b/‎apps/worker/src/__tests__/sarif-output-provider.test.ts‎
Lines changed: 145 additions & 0 deletions
diff --git a/‎apps/worker/src/services/index.ts‎
Lines changed: 1 addition & 0 deletions b/‎apps/worker/src/services/index.ts‎
Lines changed: 1 addition & 0 deletions
@@ -1,26 +1,111 @@
->[!NOTE]
-> **[📢 New: Shannon is now available via `npx @keygraph/shannon`. →](https://github.com/KeygraphHQ/shannon/discussions/249)**
-
 <div align="center">
 
-<img src="./assets/github-banner.png" alt="Shannon — AI Pentester for Web Applications and APIs" width="100%">
+# Vedha — Autonomous AI Pentester
+
+**A friendly fork of [Shannon by Keygraph](https://github.com/KeygraphHQ/shannon),
+hardened and extended for production-side use.**
 
-# Shannon — AI Pentester by Keygraph
+[![Upstream: Shannon](https://img.shields.io/badge/upstream-KeygraphHQ%2Fshannon-blue?logo=github)](https://github.com/KeygraphHQ/shannon)
+[![License: AGPL-3.0](https://img.shields.io/badge/license-AGPL--3.0-blue.svg)](LICENSE)
 
-<a href="https://trendshift.io/repositories/15604" target="_blank"><img src="https://trendshift.io/api/badge/repositories/15604" alt="KeygraphHQ%2Fshannon | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>
+</div>
 
-Shannon is an autonomous, white-box AI pentester for web applications and APIs. <br />
-It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities before they reach production.
+## Credit & lineage
 
----
+Vedha is built on top of **[Shannon](https://github.com/KeygraphHQ/shannon)**,
+the open-source autonomous AI pentester developed by
+**[Keygraph](https://keygraph.io)**. The full pipeline architecture
+(Temporal workflows, the five-domain agent topology, browser-driven
+exploitation, the Wolfi container image, the report assembler) is
+Shannon's. The license is AGPL-3.0, inherited from upstream.
+
+If you're evaluating an AI pentester for the first time, **start with
+upstream Shannon** — it's the canonical project. Vedha exists for
+three reasons:
+
+1. To carry **security hardening patches** that haven't been upstreamed
+   yet (see "What Vedha adds" below).
+2. To act as a **testbed for additions** — like SARIF output — that
+   we're proposing back to upstream via PR.
+3. To let me run a customised build inside the Archeon agent stack
+   without forking the upstream's release cadence.
+
+Where this README still says "Shannon," that's because the upstream
+docs accurately describe behaviour Vedha inherits unchanged. Wherever
+behaviour differs, Vedha's section takes precedence.
+
+## What Vedha adds over upstream Shannon
+
+Three categories of changes layered on top of Shannon Lite:
+
+### 1. Security hardening (8 issues)
+
+| ID | What | File |
+|---|---|---|
+| **S-1** | `sanitizePromptValue()` neutralises `{{...}}` placeholder syntax and `@include(...)` directives in every user-controlled prompt interpolation site (config description, focus/avoid rule descriptions, credentials, auth-context). Prevents prompt injection by anyone who can write a Shannon config. | `apps/worker/src/services/prompt-manager.ts` |
+| **S-2** | Credentials (username / password / TOTP secret) are sanitised before reaching the prompt template. | same |
+| **S-3** | Rule descriptions in `config.avoid` and `config.focus` are sanitised before interpolation. | same |
+| **S-4** | `SHANNON_HOST_UID` / `SHANNON_HOST_GID` are validated as numeric and within `1..2_000_000` before they reach `groupadd`/`useradd`. Rejects 0 (root), negatives, and non-numeric input that would otherwise feed `userdel ; rm -rf /`-style payloads into a privileged command. | `entrypoint.sh` |
+| **S-5** | Container temp dirs (`/app`, `/tmp/.cache`, `/tmp/.config`, `/tmp/.npm`) drop from `chmod 777` to `chmod 770`. | `Dockerfile` |
+| **S-6** | URL is parsed once up front with a try/catch and an `http`/`https` scheme allowlist, instead of crashing mid-setup with a raw `TypeError` on a malformed input. | `apps/cli/src/index.ts` |
+| **S-7** | The `session.json` polling loop now distinguishes `ENOENT` (the expected steady-state) and `SyntaxError` (worker mid-write) from real I/O errors (`EACCES`, `EIO`, `ENOTDIR`), so a permissions issue surfaces with a real diagnostic instead of an indefinite spinner. | same |
+| **S-8** | Splash falls back to plain ASCII when the terminal doesn't advertise UTF-8, instead of emitting `?`/mojibake on raw cmd.exe / locale-less SSH / some CI log streams. | `apps/cli/src/splash.ts` |
+
+These patches were originally written for Vedha and have also been
+proposed back to upstream Shannon as
+[KeygraphHQ/shannon#322](https://github.com/KeygraphHQ/shannon/pull/322).
+
+### 2. SARIF 2.1.0 report output
 
-<a href="https://discord.gg/9ZqQPuhJB7"><img src="./assets/discord.png" height="40" alt="Join Discord"></a>
-<a href="https://keygraph.io/"><img src="./assets/Keygraph_Button.png" height="40" alt="Visit Keygraph.io"></a>
+A new `--report-format sarif` flag emits a SARIF 2.1.0 file alongside
+the markdown report, so findings can be ingested by:
+
+- **GitHub Code Scanning** (auto-uploaded by `github/codeql-action/upload-sarif`)
+- **GitLab CI** security dashboards
+- **Defect Dojo**, **SonarQube**, and any other SARIF-aware scanner UI
+
+```bash
+./vedha start --url https://example.com --repo my-repo --report-format sarif
+# writes:
+#   <repo>/.shannon/deliverables/comprehensive_security_assessment_report.md
+#   <repo>/.shannon/deliverables/comprehensive_security_assessment_report.sarif
+```
+
+The tool driver advertises five rules tagged with their CWE IDs:
+`vedha.injection` (CWE-74), `vedha.xss` (CWE-79),
+`vedha.auth` (CWE-287), `vedha.ssrf` (CWE-918),
+`vedha.authz` (CWE-285). Default behaviour (`md`) is unchanged —
+SARIF is opt-in.
+
+### 3. Branding & integration
+
+- CLI rebranded to `./vedha` / `npx @archeon/vedha` (Shannon's `./shannon` invocation works too via the legacy entrypoint).
+- State directory at `~/.vedha/` instead of `~/.shannon/`.
+- Logo and ASCII splash refreshed.
+
+## Versioning & sync policy
+
+Vedha tracks Shannon mainline at coarse cadence — typically a couple of
+releases behind. When upstream ships a meaningful change (a CVE fix, a
+new vulnerability domain, a workflow refactor), Vedha syncs and tests
+before tagging.
+
+| Vedha version | Based on Shannon | What's new in Vedha |
+|---|---|---|
+| **v1.0.0** | pre-v1.1.0 main | Initial fork; 8 security fixes (S-1..S-8) |
+| **v1.1.0** *(this release)* | pre-v1.1.0 main | + SARIF 2.1.0 output (`--report-format sarif`) |
+
+For all upstream features not listed under "What Vedha adds," refer to
+[Shannon's documentation](https://github.com/KeygraphHQ/shannon) —
+Vedha inherits them unchanged.
 
 ---
-</div>
 
-## What is Shannon?
+> The remainder of this README is Shannon's documentation, lightly
+> edited for Vedha. Behaviour described below applies to Vedha
+> identically unless explicitly noted.
+
+## What is Shannon? (inherited)
 
 Shannon is an AI pentester developed by [Keygraph](https://keygraph.io). It performs white-box security testing of web applications and their underlying APIs by combining source code analysis with live exploitation.
 
 
@@ -24,6 +24,7 @@ export interface StartArgs {
   output?: string;
   pipelineTesting: boolean;
   router: boolean;
+  reportFormat: 'md' | 'sarif';
   version: string;
 }
 
@@ -125,6 +126,7 @@ export async function start(args: StartArgs): Promise<void> {
     taskQueue,
     containerName,
     envFlags: buildEnvFlags(),
+    reportFormat: args.reportFormat,
     ...(config && { config }),
     ...(hasCredentials && { credentials: credentialsPath }),
     ...(promptsDir && { promptsDir }),
 
@@ -196,6 +196,7 @@ export interface WorkerOptions {
   outputDir?: string;
   workspace: string;
   pipelineTesting?: boolean;
+  reportFormat?: 'md' | 'sarif';
 }
 
 /**
@@ -244,6 +245,15 @@ export function spawnWorker(opts: WorkerOptions): ChildProcess {
   // Environment
   args.push(...opts.envFlags);
 
+  // Forward Vedha-specific runtime flags as env. Done as env (rather than
+  // CLI args) because the worker reads them inside the activity to gate
+  // optional output emission, and env survives Temporal serialisation
+  // without needing pipeline-input plumbing.
+  if (opts.reportFormat && opts.reportFormat !== 'md') {
+    args.push('-e', `VEDHA_REPORT_FORMAT=${opts.reportFormat}`);
+  }
+  args.push('-e', `VEDHA_VERSION=${opts.version}`);
+
   // Container settings
   args.push('--shm-size', '2gb', '--security-opt', 'seccomp=unconfined');
 
 
@@ -70,6 +70,10 @@ Options for 'start':
   -w, --workspace <name>    Named workspace (auto-resumes if exists)
       --pipeline-testing    Use minimal prompts for fast testing
       --router              Route requests through claude-code-router
+      --report-format <fmt> Report output format: 'md' (default) or 'sarif'
+                            'sarif' emits a SARIF 2.1.0 file alongside the
+                            markdown report for ingestion by GitHub Code
+                            Scanning, GitLab, Defect Dojo, etc.
 
 Examples:
   ${prefix} start -u https://example.com -r ${mode === 'local' ? 'my-repo' : './my-repo'}
@@ -87,6 +91,8 @@ Monitor workflows at http://localhost:8233
 `);
 }
 
+type ReportFormat = 'md' | 'sarif';
+
 interface ParsedStartArgs {
   url: string;
   repo: string;
@@ -95,6 +101,7 @@ interface ParsedStartArgs {
   output?: string;
   pipelineTesting: boolean;
   router: boolean;
+  reportFormat: ReportFormat;
 }
 
 function parseStartArgs(argv: string[]): ParsedStartArgs {
@@ -105,6 +112,7 @@ function parseStartArgs(argv: string[]): ParsedStartArgs {
   let output: string | undefined;
   let pipelineTesting = false;
   let router = false;
+  let reportFormat: ReportFormat = 'md';
 
   for (let i = 0; i < argv.length; i++) {
     const arg = argv[i];
@@ -152,6 +160,16 @@ function parseStartArgs(argv: string[]): ParsedStartArgs {
       case '--router':
         router = true;
         break;
+      case '--report-format':
+        if (next && !next.startsWith('-')) {
+          if (next !== 'md' && next !== 'sarif') {
+            console.error(`ERROR: --report-format must be 'md' or 'sarif', got '${next}'`);
+            process.exit(1);
+          }
+          reportFormat = next;
+          i++;
+        }
+        break;
       default:
         console.error(`Unknown option: ${arg}`);
         console.error(`Run "${getMode() === 'local' ? './vedha' : 'npx @archeon/vedha'} help" for usage`);
@@ -182,6 +200,7 @@ function parseStartArgs(argv: string[]): ParsedStartArgs {
     repo,
     pipelineTesting,
     router,
+    reportFormat,
     ...(config && { config }),
     ...(workspace && { workspace }),
     ...(output && { output }),
 
@@ -16,7 +16,8 @@
   "scripts": {
     "build": "tsc",
     "check": "tsc --noEmit",
-    "clean": "rm -rf dist"
+    "clean": "rm -rf dist",
+    "test": "vitest run"
   },
   "dependencies": {
     "@anthropic-ai/claude-agent-sdk": "catalog:",
@@ -32,6 +33,7 @@
     "zx": "^8.0.0"
   },
   "devDependencies": {
-    "@types/js-yaml": "^4.0.9"
+    "@types/js-yaml": "^4.0.9",
+    "vitest": "^4.1.2"
   }
 }
@@ -0,0 +1,145 @@
+/**
+ * Behavioural tests for SarifReportOutputProvider.
+ *
+ * Covers the contract that consumers (GitHub Code Scanning, GitLab,
+ * Defect Dojo) actually depend on:
+ *   - SARIF 2.1.0 envelope with the expected top-level fields
+ *   - Tool driver advertises the five built-in vulnerability rules
+ *   - One result per non-empty evidence file, ruleId matching the rule
+ *   - Empty / missing evidence files do not produce results
+ *   - Result messages are truncated rather than dropping out at limit
+ */
+
+import fs from 'node:fs/promises';
+import os from 'node:os';
+import path from 'node:path';
+import { afterEach, describe, expect, it } from 'vitest';
+import { SarifReportOutputProvider } from '../services/sarif-output-provider.js';
+import type { ActivityLogger } from '../types/activity-logger.js';
+
+const noopLogger: ActivityLogger = {
+  info: () => undefined,
+  warn: () => undefined,
+  error: () => undefined,
+};
+
+async function setupRepoWithDeliverables(
+  evidence: Record<string, string>,
+): Promise<{ repoPath: string; cleanup: () => Promise<void> }> {
+  const repoPath = await fs.mkdtemp(path.join(os.tmpdir(), 'vedha-sarif-test-'));
+  const deliverablesPath = path.join(repoPath, '.shannon', 'deliverables');
+  await fs.mkdir(deliverablesPath, { recursive: true });
+  for (const [name, body] of Object.entries(evidence)) {
+    await fs.writeFile(path.join(deliverablesPath, name), body, 'utf8');
+  }
+  return {
+    repoPath,
+    cleanup: () => fs.rm(repoPath, { recursive: true, force: true }),
+  };
+}
+
+function makeInput(repoPath: string): { repoPath: string } {
+  return { repoPath };
+}
+
+describe('SarifReportOutputProvider', () => {
+  let cleanup: (() => Promise<void>) | null = null;
+
+  afterEach(async () => {
+    if (cleanup) {
+      await cleanup();
+      cleanup = null;
+    }
+  });
+
+  it('emits a valid SARIF 2.1.0 envelope when at least one finding exists', async () => {
+    const setup = await setupRepoWithDeliverables({
+      'injection_exploitation_evidence.md': '## SQL injection in /api/users\n\nProof: `' + "1' OR '1'='1" + '`',
+    });
+    cleanup = setup.cleanup;
+
+    const provider = new SarifReportOutputProvider('1.1.0');
+    const result = await provider.generate(makeInput(setup.repoPath), noopLogger);
+
+    expect(result.outputPath).toBeDefined();
+    const sarif = JSON.parse(await fs.readFile(result.outputPath as string, 'utf8'));
+    expect(sarif.version).toBe('2.1.0');
+    expect(sarif.$schema).toMatch(/sarif-schema-2\.1\.0/);
+    expect(sarif.runs).toHaveLength(1);
+    expect(sarif.runs[0].tool.driver.name).toBe('Vedha');
+    expect(sarif.runs[0].tool.driver.version).toBe('1.1.0');
+    expect(sarif.runs[0].tool.driver.rules).toHaveLength(5);
+    expect(sarif.runs[0].results).toHaveLength(1);
+    expect(sarif.runs[0].results[0].ruleId).toBe('vedha.injection');
+    expect(sarif.runs[0].results[0].message.text).toMatch(/SQL injection/);
+  });
+
+  it('emits one result per non-empty evidence file', async () => {
+    const setup = await setupRepoWithDeliverables({
+      'injection_exploitation_evidence.md': 'finding',
+      'xss_exploitation_evidence.md': 'finding',
+      'authz_exploitation_evidence.md': 'finding',
+    });
+    cleanup = setup.cleanup;
+
+    const provider = new SarifReportOutputProvider();
+    const result = await provider.generate(makeInput(setup.repoPath), noopLogger);
+    const sarif = JSON.parse(await fs.readFile(result.outputPath as string, 'utf8'));
+
+    expect(sarif.runs[0].results.map((r: { ruleId: string }) => r.ruleId).sort()).toEqual([
+      'vedha.authz',
+      'vedha.injection',
+      'vedha.xss',
+    ]);
+  });
+
+  it('skips empty and missing evidence files', async () => {
+    const setup = await setupRepoWithDeliverables({
+      'injection_exploitation_evidence.md': '',
+      'xss_exploitation_evidence.md': '   \n  \t\n',
+      // auth/ssrf/authz: not written at all
+    });
+    cleanup = setup.cleanup;
+
+    const provider = new SarifReportOutputProvider();
+    const result = await provider.generate(makeInput(setup.repoPath), noopLogger);
+    const sarif = JSON.parse(await fs.readFile(result.outputPath as string, 'utf8'));
+
+    expect(sarif.runs[0].results).toHaveLength(0);
+    // Even with zero results, the envelope must be valid.
+    expect(sarif.version).toBe('2.1.0');
+    expect(sarif.runs[0].tool.driver.rules).toHaveLength(5);
+  });
+
+  it('truncates oversized evidence rather than dropping it', async () => {
+    const huge = 'A'.repeat(64 * 1024); // 64 KiB, well above the 16 KiB limit
+    const setup = await setupRepoWithDeliverables({
+      'auth_exploitation_evidence.md': huge,
+    });
+    cleanup = setup.cleanup;
+
+    const provider = new SarifReportOutputProvider();
+    const result = await provider.generate(makeInput(setup.repoPath), noopLogger);
+    const sarif = JSON.parse(await fs.readFile(result.outputPath as string, 'utf8'));
+    const messageText = sarif.runs[0].results[0].message.text as string;
+
+    expect(sarif.runs[0].results).toHaveLength(1);
+    expect(messageText.length).toBeLessThan(huge.length);
+    expect(messageText).toMatch(/\[truncated\]$/);
+  });
+
+  it('writes the SARIF file alongside the markdown report', async () => {
+    const setup = await setupRepoWithDeliverables({
+      'ssrf_exploitation_evidence.md': 'finding',
+    });
+    cleanup = setup.cleanup;
+
+    const provider = new SarifReportOutputProvider();
+    const result = await provider.generate(makeInput(setup.repoPath), noopLogger);
+
+    expect(result.outputPath).toBe(
+      path.join(setup.repoPath, '.shannon', 'deliverables', 'comprehensive_security_assessment_report.sarif'),
+    );
+    await expect(fs.access(result.outputPath as string)).resolves.toBeUndefined();
+  });
+});
@@ -20,3 +20,4 @@ export { Container, getContainer, getOrCreateContainer, removeContainer } from '
 export { ExploitationCheckerService } from './exploitation-checker.js';
 export { loadPrompt } from './prompt-manager.js';
 export { assembleFinalReport, injectModelIntoReport } from './reporting.js';
+export { SarifReportOutputProvider } from './sarif-output-provider.js';