Data Contract Workshop

Bring data producers and consumers together to define data contracts in a facilitated workshop.

Goal

A defined and agreed upon data contract between data producers and consumers.

Participants

Recommendation: keep the group small (not more than 5 people)

Settings

Guidelines for the Data Contract Specification

  1. Info (get the context)
  2. Examples (example-driven facilitation)
  3. Model (you will spend most of your time here)
    • Use the Data Contract CLI to test the model against the previously created examples:
      datacontract test --examples datacontract.yaml
  4. Quality
  5. Terms
  6. Servers (if already applicable)
    • Start with a “local” server with actual, real data you downloaded
    • Use the Data Contract CLI to test the model against the actual data on a specific server:
      datacontract test datacontract.yaml
    • Switch to the actual remote server, if applicable

Tooling (Data Contract Specification)

Guidelines for ODCS

We recommend to use the Excel template for workshops as it is easier to work with in such a setting as it comes with a nice visualization.

  1. Fundamentals (get the context)
  2. Schema (you will spend most of your time here)
    • Fill in the schemas (tables) and their properties (columns) along with their name and logicalType as a start in the schema part.
    • After that, add information like description, classification, …
    • Use tags or customProperties add additional metadata where there is no direct support by ODCS
  3. Quality
    • Add quality checks at the schema or the property level. Start with quality checks of type text first to capture the requirements.
    • OPTIONAL Conver the text-based requirements into automated sql-based quality checks
  4. SLAs
    • Add SLAs that the data provider guarantees towards all data consumers.
  5. Team & Support
    • Add the team members so that the data consumer knows who is part of the team that owns the data protected by the data contracts.
    • Add a support channel so (potential) data consumers know how to get support and reach the data owners.
  6. Servers (if already applicable)

Tooling (ODCS)