Warning
🚧 Work in Progress: This page is currently under construction. Content may be incomplete or subject to change. To contribute, see the contribution guide.
Data Classification
Sensitivity classification model applied to the data lake.
Classification levels
| Level | Description | Examples | Controls |
|---|---|---|---|
| Public | Data that can be shared externally | Public fund information | None additional |
| Internal | Internal use only | Operational metrics, management reports | Authenticated access (Entra ID) |
| Confidential | Restricted to specific teams | Investment strategies, portco data | Specific AD group |
| Restricted | Minimal access, justified need | SSN, individual LP financial data | AD group + access log |
How to classify a new dataset
- Does it contain personal data? → if yes, at minimum Restricted
- Does it contain strategies or competitive information? → Confidential
- Document the classification in the Data Catalog
- Configure appropriate access groups in Entra ID