Schema Metadata Quality Audit - December 29, 2025 #8130
Replies: 2 comments 1 reply
-
|
/plan |
Beta Was this translation helpful? Give feedback.
-
Plan Created ✅I've created a structured implementation plan to address the schema metadata quality findings from this audit: Parent Tracking Issue: Improve schema metadata and documentation quality
5 Sub-Issues Created:
Total Estimated Effort: ~5-7 hours These focused, actionable tasks can be assigned to GitHub Copilot agents for implementation. Each sub-issue has clear objectives, files to modify, implementation guidance, and acceptance criteria.
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Executed Strategy-008 (Schema Completeness & Internal Reference Analysis) focusing on schema metadata quality, internal consistency, and documentation completeness across all three schema files.
Key Results:
This is a regression check confirming that schema quality has not degraded since November 2025. All identified issues remain from previous audits, indicating stable but unimproved metadata quality.
Critical Issues
1. Missing Top-Level Metadata (All 3 Schemas)
Priority: HIGH | Category: Schema Metadata / IDE Integration
All three schemas are missing critical top-level metadata:
$idfield for schema identificationtitlefield for human-readable namedescriptionfield for schema purposeAffected Files:
pkg/parser/schemas/main_workflow_schema.jsonpkg/parser/schemas/included_file_schema.jsonpkg/parser/schemas/mcp_config_schema.jsonImpact:
Recommendation: Add metadata to all schemas:
{ "$schema": "(redacted) "$id": "https://github.com/githubnext/gh-aw/schemas/main_workflow_schema.json", "title": "GitHub Agentic Workflow Schema", "description": "JSON Schema for validating agentic workflow frontmatter configuration", "version": "1.0.0", ... }2. applyTo Field Missing Description
Priority: MEDIUM | Category: Documentation Completeness
The
applyTofield in included_file_schema.json is the only root-level property without a description.Location:
pkg/parser/schemas/included_file_schema.json-applyTopropertyRecommendation: Add description explaining that
applyTofilters which jobs use the included file.Moderate Issues
3. Major Fields Missing Examples
Priority: MEDIUM | Category: Discoverability / Learning
Critical workflow configuration fields lack inline examples in schema.
Missing Examples:
on- Trigger events (most important field)engine- AI engine configurationtools- Tool configurationsafe-outputs- Safe output configurationmcp-servers- MCP server configurationsteps- Workflow stepsCurrent Coverage: Root properties with examples: 19 of 39 (49%)
Impact: Steeper learning curve, reduced IDE autocomplete usefulness, users rely on external documentation
4. Implicit vs Explicit Default Values
Priority: LOW | Category: Schema Quality
6 fields describe default values in description text but lack explicit
defaultfield:name,timeout-minutes,timeout_minutes,concurrency,roles,github-tokenOnly 2 fields have explicit
default:engine: "copilot",strict: true5. Single-Value Enums Should Use const
Priority: LOW | Category: Schema Style
10 instances use
enum: ["value"]instead ofconst: "value"(JSON Schema best practice).6. Zero Format Constraints
Priority: LOW | Category: Validation Strength
No
formatconstraints used despite opportunities for stronger validation (email, uri, date-time, etc.).Pattern constraints: 16 ✓ | Format constraints: 0 ✗
7. 66 Nested Properties Missing Descriptions
Priority: LOW | Category: Documentation Completeness
Deep nested fields lack descriptions (many are structural artifacts from oneOf patterns).
8. MCP Schema: Zero Examples
Priority: MEDIUM | Category: Usability
mcp_config_schema.jsonhas zero example fields despite being complex configuration.Positive Findings
✓ All Root-Level Descriptions Complete (Main Schema): 39 of 39 properties (100%)
✓ All $defs Are Used (No Dead Code):
engine_config: 2 referencesgithub_token: 31 referenceshttp_mcp_tool: 1 referencestdio_mcp_tool: 1 reference✓ Excellent additionalProperties Discipline: 167 constraints prevent typos and invalid fields
✓ Good $comment Usage: 28 fields document implementation notes and compiler behavior
✓ Pattern Constraints Present: 16 pattern constraints validate string formats
✓ Array Constraints Present: 15 minItems + 1 uniqueItems constraints
Key Insights
Recommendations
Immediate Actions (Priority 1)
Add Top-Level Metadata to All 3 Schemas
$id,title,descriptionfieldsAdd applyTo Description
Short-Term Actions (Priority 2)
Add Examples to Major Fields
on,engine,tools,steps,safe-outputs,mcp-serversAdd Examples to MCP Schema
Long-Term Actions (Priority 3)
Strategy Performance
Comparison with Last Run (2025-11-18):
Analysis Date: December 29, 2025
Strategy: strategy-008 (Schema Completeness & Internal Reference Analysis)
Files Analyzed: main_workflow_schema.json, included_file_schema.json, mcp_config_schema.json
Status: Regression check PASS - Quality stable, improvements needed
Beta Was this translation helpful? Give feedback.
All reactions