Warning

🚧 Work in Progress: This page is currently under construction. Content may be incomplete or subject to change. To contribute, see the contribution guide.

Data Classification

Sensitivity classification model applied to the data lake.


Classification levels

LevelDescriptionExamplesControls
PublicData that can be shared externallyPublic fund informationNone additional
InternalInternal use onlyOperational metrics, management reportsAuthenticated access (Entra ID)
ConfidentialRestricted to specific teamsInvestment strategies, portco dataSpecific AD group
RestrictedMinimal access, justified needSSN, individual LP financial dataAD group + access log

How to classify a new dataset

  1. Does it contain personal data? → if yes, at minimum Restricted
  2. Does it contain strategies or competitive information? → Confidential
  3. Document the classification in the Data Catalog
  4. Configure appropriate access groups in Entra ID