YAML and RAML - sgml/signature GitHub Wiki
YAML is a text based format allowing programmers to store structured data in a hierarchy. YAML is designed to be human and machine readable with a minimum of overhead. The YAML specification can be found at yaml.org. There is also a reference card
Comments start with # and go till newline, comments must be separated from other tokens by whitespace. Whitespace isn't free, indentation must be spaces, not tabs. YAML will consider that lines prefixed with more spaces than the parent key are contained inside it. Moreover, all lines must be prefixed with the same amount of spaces to belong to the same map.
YAML has sequences and mappings as collection types, both can be represented in flow and block style.
An sequence of scalar strings in YAML looks like:
[ one, two, three ]   # flow style
# or block style
- one
- two
- three
A mapping consists of key/value pairs:
index: 4  # block style
name: nali
# or 
{ index: 4, name: nali }   # flow style
# or nested (equivalent of { level: { one: { two: fun } } }):
level:
  one:
    two: fun
Splitting text strings over multiple lines
- Without quotes:
   You can just
   split a long piece of text like this.
- With quotes:
    "[But be careful:
     if you \"need\" punctuation, put double quotes around it. You can ev\
     en split without spaces by using backslashes."
- Or single quotes:
    'This works
     but isn''t as flexible'
- If you want to keep those new line characters: | 
    Then do
    it this way with 
    a pipe (|) character. (This string has three \n characters)
- Or you can have just the one final new line: >
    This string has
    just one \n character, at the very end.
- Block indicators:
    Look up >-, >+, |- and |+ for fine tuning.
Basic YAML types
integer: 25
string: "25"
float: 25.0
boolean: true
null type: null
- Aliases, &, are like XML DTD IDs
- Anchors, *, are like XML DTD IDRefs
default_env: &default_env
  os: "PlayStation OS"
  hardware_version: "PS5"
  gpu: "RDNA 2"
test_suite_1:
  environment: *default_env
  test_case: "Load Time Test"
svg:
  width: 200
  height: 200
  xmlns: "http://www.w3.org/2000/svg"
  elements:
    - &circle
      circle:
        id: myCircle
        cx: 50
        cy: 50
        r: 40
        stroke: black
        fill: blue
    - use:
        href: *circle
        x: 100
        fill: red
YAML supports three styles of escape notation:
- 
Entity Escapes a. space: " " b. colon: ":" c. ampersand: "&" 
- 
Unicode Escapes a. space: "\u0020" b. single quote: "\u0027" c. double quote: "\u0022" 
- 
Quoted Escapes a. double quote in single quote: 'Is "I always lie" a true statement?' b. nested double quote: " She said, "I quit" " c. nested single quote: ' He was speechless: '' ' 
I've been producing and consuming YAML lately. If you have objects serialized to text, you need some form of introspection to reconstruct them with a generic parser. The YAML parser does not know or care about the code in those other modules. It just assumes that the names in the input correspond to classes currently loaded by whomever called the parser. It relies on the language's introspection system to convert the strings in the YAML into calls to constructors.
https://yamllint.readthedocs.io/en/stable/rules.html
http://yaml-online-parser.appspot.com/
https://www.npmjs.com/package/yaml-template
https://github.com/mikestead/hx-yaml/blob/master/src/yaml/Parser.hx
+pyyaml "yaml lists" +dump
python yaml recipes activestate
yaml +python +ruby +dump
import yaml
# Function to check if a key contains any of the specified words
def contains_keywords(key, keywords):
    return any(keyword in key for keyword in keywords)
# Load the YAML file
with open('your_file.yaml', 'r') as file:
    data = yaml.safe_load(file)
# Keywords to check for
keywords = ["foo", "bar", "une"]
# Function to recursively check keys in the YAML data
def check_keys(data, keywords):
    if isinstance(data, dict):
        for key, value in data.items():
            if contains_keywords(key, keywords):
                print(f"Key '{key}' contains one of the keywords.")
            check_keys(value, keywords)
    elif isinstance(data, list):
        for item in data:
            check_keys(item, keywords)
# Check the keys in the loaded YAML data
check_keys(data, keywords)
To call Python code from custom directives in YAML, you can use a combination of YAML for configuration and Python for processing. Here's a high-level approach:
- Define the Custom Directive in YAML: Specify the directive and its parameters in YAML.
- Process the YAML with Python: Use a Python script to read the YAML file, interpret the custom directives, and execute the corresponding Python code.
%YAML 1.2
%TAG !mytag! http://example.com/tags/
---
!mytag!execute:
  script: "my_script.py"
  args:
    - "arg1"
    - "arg2"
### Step 2: Process the Directive in Python
Read the YAML file, check for the custom directive, and execute the specified script with the provided arguments.
import yaml
def execute_script(script, args): # Execute the Python script with the provided arguments exec(f"import {script}\n{script}.main(*{args})")
with open('config.yaml', 'r') as file: data = yaml.safe_load(file)
if 'execute' in data: execute_script(data['execute']['script'], data['execute']['args'])
### !include directive
Use a YAML parser which supports the [`!include` directive](https://pypi.org/project/pyyaml-include/) to use it for transclusion
#### Example
included_foo: !include foo.yaml included_bar: !include bar.yaml
key1: value1 key2: value2
key3: value3 key4: value4
## YAML Validation
yaml_validation_workflow: parsing_and_enforcement: description: "Ensure required YAML keys are present using structured parsing." tools: - ruamel.yaml: "Safely loads and verifies structured YAML configurations." - PyYAML: "Standard Python library for parsing and processing YAML files." example: python: script: | from ruamel.yaml import YAML yaml = YAML() required_keys = {"region", "instance_type"}
      with open("config.yaml", "r") as file:
          data = yaml.load(file)
      
      missing_keys = required_keys - data.keys()
      if missing_keys:
          raise ValueError(f"Missing keys: {missing_keys}")
default_value_injection: description: "Prevent execution failures by injecting default values." tools: - Python dict.get(): "Ensures missing keys get assigned fallback values." - Dynaconf: "Manages dynamic configuration defaults for YAML-based workflows." example: python: script: | config = { "region": data.get("region", "us-east-1"), "instance_type": data.get("instance_type", "t3.micro") }
schema-free validation: description: "Validate YAML structure without explicit schemas." tools: - yamale: "Lightweight YAML schema validator ensuring mandatory fields." - Pykwalify: "Schema-free YAML validation framework using flexible rulesets." example: python: command: "yamale validate config.yaml"
structured introspection: description: "Convert YAML to JSON for structured validation." tools: - jsonschema: "Enforces structured type validation for required keys." - Cerberus: "Flexible validation for YAML-based configurations using JSON rules." example: python: script: | import jsonschema
      schema = {
          "type": "object",
          "properties": {
              "region": {"type": "string"},
              "instance_type": {"type": "string"}
          },
          "required": ["region", "instance_type"]
      }
      
      jsonschema.validate(instance=data, schema=schema)
change_detection: description: "Detect missing key/value pairs by comparing YAML revisions." tools: - Python difflib: "Compares old vs. new YAML configurations to identify omitted keys." - DeepDiff: "Detects structural differences between YAML files." example: python: script: | from deepdiff import DeepDiff
      with open("reference.yaml") as ref_file, open("config.yaml") as new_file:
          ref_data = yaml.load(ref_file)
          new_data = yaml.load(new_file)
      
      diff = DeepDiff(ref_data, new_data)
      print(diff)
observability_and_logging: description: "Enable structured logging to catch missing keys before execution." tools: - Python logging module: "Outputs warnings if YAML keys are missing." - structlog: "Provides structured log formatting for real-time debugging." example: python: script: | import logging logging.basicConfig(level=logging.WARNING)
      if "region" not in data:
          logging.warning("Missing key: region")
python modules:
- "https://yamale.readthedocs.io/en/stable/"
- "https://pykwalify.readthedocs.io/en/latest/"
- "https://python-jsonschema.readthedocs.io/en/latest/"
- "https://python-deepdiff.readthedocs.io/en/latest/"
## References
https://www.rfc-editor.org/rfc/rfc9512.html
https://www.yaml.info/learn/document.html#directive
https://ref.coddy.tech/yaml/yaml-directives
https://docs.stacktape.com/configuration/directives/
https://metacpan.org/pod/Text::FrontMatter::YAML
https://pypi.org/project/python-frontmatter/
https://www.earthdata.nasa.gov/s3fs-public/imported/YAML%201.2%20Spec.pdf
https://builtin.com/software-engineering-perspectives/yaml-vs-json
https://www.snaplogic.com/blog/json-vs-yaml-whats-the-difference-and-which-one-is-right-for-your-enterprise
https://docs.ansible.com/ansible/latest/reference_appendices/YAMLSyntax.html
https://gist.github.com/oconnor663/9aeb4ed56394cb013a20
http://sweetohm.net/article/introduction-yaml.en.html
https://rtfm.co.ua/en/what-is-yaml-its-overview-basic-data-types-yaml-vs-json-and-pyyaml/
http://www.chrisrolle.com/en/blog/yaml-list
https://confit.readthedocs.io/en/latest/
https://itnext.io/what-is-yaml-its-overview-basic-data-types-yaml-vs-json-and-pyyaml-92059d1a24fa
https://www.redhat.com/sysadmin/yaml-tips
https://rollout.io/blog/yaml-tutorial-everything-you-need-get-started/
https://hackersandslackers.com/simplify-your-python-projects-configuration/
https://www.serendipidata.com/posts/safe-api-design-and-pyyaml
https://yaml.readthedocs.io/en/latest/api.html
http://docs.racket-lang.org/yaml/
https://news.ycombinator.com/item?id=17358103
https://www.infoq.com/articles/tosca-docker-orchestration/
https://garywoodfine.com/serverless-aws-lambda-dependency-injection/
## RAML
* https://www.sitepoint.com/raml-restful-api-modeling-language/
* https://www.genuitec.com/understanding-raml/
* https://web.archive.org/web/20141126115053/http://blogs.mulesoft.org/designing-restful-api-longevity/
* https://raml.org
* https://www.github.com/topics/raml
## CVE-2025-20282: Declarative Misconfiguration vs Imperative Safeguards
### Overview
- **CVE ID**: CVE-2025-20282
- **Affected Products**: Cisco Identity Services Engine (ISE) and ISE Passive Identity Connector (PIC)
- **Vulnerability Type**: Unauthenticated file upload → root-level code execution
- **Root Cause**: Declarative API configuration allowed unsafe file uploads without runtime validation
### Why Declarative Code Was the Culprit
- File upload paths were defined declaratively (e.g. static mappings, access scopes)
- No imperative logic to validate file names, paths, or execution flags
- Assumed trust in input structure and MIME types
- Lack of runtime checks allowed attackers to upload and execute arbitrary files
### How Imperative Code Could Have Prevented It
| Imperative Safeguard                     | Declarative Failure                    |
|------------------------------------------|----------------------------------------|
| `if (!isSafePath(filePath)) rejectUpload()` | Static path mapping without validation |
| Conditional logic for user roles         | Hardcoded access scopes                |
| Runtime checks for file type and flags   | Declarative MIME/type mapping          |
| Logging and rejection of suspicious input| No dynamic rejection logic             |
### Lessons for Infrastructure Architects
- Declarative systems simplify configuration but **must be paired with imperative guards** when:
  - Inputs come from untrusted sources
  - Execution paths depend on runtime context
  - Security boundaries rely on naming or structure
### Recommendation
Use imperative logic to:
- Validate inputs dynamically
- Enforce runtime constraints
- Log and reject unsafe operations
- Avoid assumptions based on static configuration
> Even minimal imperative checks can prevent critical exploits in declarative systems.