Warning

🚧 Work in Progress: This page is currently under construction. Content may be incomplete or subject to change. To contribute, see the contribution guide.

Data Naming Standards

Mandatory conventions for naming resources on the data platform.


BigQuery datasets

{layer}_{domain}[_{subdomain}]

Examples:
  raw_investments
  stage_finance_accounts_payable
  gold_corporate_headcount
  gold_investments_fundraising

Layers: raw · stage · gold Domains: investments · finance · operations · corporate · risk


BigQuery tables

{entity}[_{qualifier}]

Examples:
  funds
  daily_shares
  capital_calls
  portfolio_companies
  active_employees
  • snake_case, consistently in English
  • Avoid non-obvious abbreviations

Columns

Column typeConventionExample
Primary key{entity}_idfund_id, employee_id
Foreign key{referenced_entity}_idfund_id in a shares table
Dates{context}_datereference_date, fundraising_date
Timestamps{context}_atcreated_at, updated_at
Boolean flagsis_{condition}is_active, is_closed
Monetary values{context}_amountshare_amount, investment_amount
Load metadata_load_date, _sourceTechnical columns with _ prefix

Airflow DAGs

{domain}_{subdomain}_{layer}_{action}

Examples:
  investments_fundraising_raw_ingest
  finance_accounts_payable_stage_transform
  corporate_headcount_gold_load