Issue with Unity Catalog Lineage Not Appearing in Microsoft Purview

Sooriya E 10 Reputation points
2025-08-01T08:26:33.62+00:00

Hi Team,

We have data in bronze, silver and export container of ADLS gen2. We are orchestrating this data transfer using ADF. Source to bronze happens through a copy path job in ADF. The bronze to silver and silver to export transformations each occur in a databricks notebook. We have mounted the silver data in databricks unity catalog and we are able to see the notebook transformation lineage in unity catalog. We are also integrating with Microsoft Purview. Purview currently scans both ADLS and Databricks Unity catalog. It is also connected to the ADF pipeline. 

 

The problem we are facing is that we are not able to collect the lineage information from Unity catalog into Microsoft Purview. we have followed online documentation and enabled the Purview scan with Lineage extraction on. And we have enabled system tables in Unity catalog and assigned both the Purview MSI as well as the users to select permissions on the system.access column lineage and table lineage tables. We also tried providing data reader role both to purview and the user initiating the scan on both the system and the data catalog in Databricks UC. 

 

Please guide us on how we can see lineage in Purview

Thanks,
Sooriya E

Microsoft Security | Microsoft Purview
{count} votes

2 answers

Sort by: Most helpful
  1. Venkat Reddy Navari 5,330 Reputation points Microsoft External Staff Moderator
    2025-08-01T16:26:21.4933333+00:00

    Hi Sooriya E Thanks for the detailed info, from what you’ve described, it sounds like you’re pretty close to having everything configured correctly.

    Here are a few key things to double-check that usually cause this kind of issue with Unity Catalog lineage not showing up in Microsoft Purview:

    System Tables and Runtime Version Make sure your Databricks workspace is running on Databricks Runtime 13.2 or later because that’s when system tables and lineage capture fully kicked in. Also confirm the system tables like system.access.table_lineage and system.access.column_lineage are enabled.

    Permissions for Purview the Purview Managed Identity (or service principal) that runs the scan needs at least SELECT permissions on those system tables, plus USE CATALOG permission on the Unity Catalog. Without those, Purview can’t read the lineage metadata.

    Unity Catalog–Managed Tables Only Lineage only works for tables registered inside Unity Catalog if you’re writing data to mounted ADLS paths or unmanaged files, lineage won’t be tracked. Make sure your silver and export datasets are registered Delta tables within Unity Catalog.

    Purview Scan Settings Verify that your Purview scan is running a full scan with lineage extraction enabled for your Databricks source. Sometimes lineage doesn’t appear until after a full scan completes.

    Validate Lineage Inside Databricks Try running a query like this in Databricks to see if lineage records actually exist for your tables:

    
    SELECT * FROM system.access.table_lineage WHERE target_table_full_name = 'catalog.schema.table_name';
    

    If you don’t get any results here, the problem is on the Databricks side (lineage not being captured) rather than Purview.

    Finally:

    • Transformations run using Pandas or Python UDFs might not generate lineage.
    • Make sure there aren’t any network or firewall rules blocking Purview’s access to your Databricks workspace.
    • If your transformations are complex or you need more robust lineage, you could consider using the Purview ADB Lineage Solution Accelerator which uses OpenLineage standards for richer tracking.

    Hope this helps. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.


  2. Sooriya E 10 Reputation points
    2025-08-06T09:53:20.8+00:00

    Hi Venkat,

    We have enabled System.access.table_lineage and column_lineage. I am able to view those within the lineage tab of Unity catalog in Databricks. I am also able to query the system.access.table_lineage and column_lineage tables in the sql editor of Databricks. The Purview managed identity has been given data reader permission so it can select, browse and use catalog permissions are included in it. Can we please schedule a call so that we can resolve this ASAP.

    Thanks,

    Sooriya E

     


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.