Share via


Custom metadata (preview) in Unified Catalog

Custom metadata plays a crucial role in data governance by providing you with the flexibility to add user-defined context to describe and organize your data. In Microsoft Purview Unified Catalog, you can define and apply organization specific business attributes to your data assets and business concepts that foster better organization, accessibility, and utilization of your data. The benefits of using custom metadata include:

  • Enhanced data discovery: Custom metadata simplifies the search and retrieval of data by allowing users to apply specific tags, or attributes, that make datasets easier to locate.

  • Help with compliance: Organizations can use custom attributes to track regulatory requirements or flag sensitive data to help ensure adherence to legal and industry standards.

  • Streamlined communication: Custom metadata facilitates better collaboration among teams by providing clear descriptions and classifications of data assets.

  • Optimized data utilization: By categorizing data assets effectively, organizations can prioritize the use of high-value data for analytics and decision-making.

Custom metadata usage scenarios

Here are some examples of how you can use custom metadata to help achieve better data governance and compliance:

Example 1: Healthcare industry

A hospital's data catalog includes patient records, medical research, and operational data. By implementing custom metadata, administrators add attributes such as "Patient Age Group," "Disease Category," and "Research Priority Level" to the datasets. This enables the hospital to segment data effectively for clinical studies, prioritize urgent cases, and comply with healthcare regulations like HIPAA.

Example 2: Financial services

A financial institution uses custom metadata to govern its risk assessment data. Attributes like "Risk Rating," "Investment Sector," and "Regulatory Compliance Status" help ensure that analysts can quickly identify high-risk investments, provide insights into sector-specific trends, and maintain legal compliance with financial regulations.

Example 3: Academic institutions

Universities use custom metadata to manage their research databases. Attributes like "Research Field," "Funding Source," and "Publication Status" enable academic departments to track progress, allocate resources effectively, and collaborate across institutions.

Example 4: AI and compliance

Organizations can include AI and compliance related attributes to define various data product types. AI-related attributes might cover aspects such as algorithm transparency, explainability, and model performance metrics, while compliance attributes might address regulatory requirements, data privacy, and ethical considerations. By incorporating these factors, organizations can ensure their data products are robust, compliant with laws and standards, and aligned with responsible AI practices.

Example 5: Tracking translations

Language can be a pivotal attribute to describe and discover glossary terms in different languages. By incorporating multilingual capabilities into metadata attributes, organizations can ensure seamless communication and understanding across diverse linguistic groups. This approach not only facilitates global collaboration but also enhances accessibility, enabling users to interact with data in their preferred language while maintaining consistency and clarity in terminology usage.

Types of custom metadata: attributes

There are currently two types of custom metadata:

  1. Business concept attributes: Attributes created and applied to business concepts (governance domains, data products, critical data elements, glossary terms, and objectives and key results).
  2. Data asset attributes: Attributes created and applied to data assets.

You can find, create, and manage your custom metadata at Unified Catalog > Catalog management > Custom metadata (preview).

Each type of attribute is listed on its tab on the Custom metadata (preview) page. The attribute list on each tab is organized by attribute group, and each attribute is nested within its group.

Get started