Databricks and Select Star provide a powerful solution for automated data discovery on your Data Lake. Select Star integrates with your Databricks Delta Lake to uncover deep contextual insights in your data relationships, helping you democratize data access across teams. The platform also leverages query logs to provide insight into how data is joined.
Select Star provides automated, column-level lineage for all your data in Databricks data lake.You can explore upstream and downstream dependencies to better understand the impact of making changes to your data, all the way down to your dashboards and reports.
Popularity and usage metrics help users sift through the noise in large-scale data lake houses by surfacing the most current and relevant tables and columns based on query history and access. See which tables and columns are the most accessed and joined to understand how to better interpret the data
Select Star shows you the most popular joins run on your data lakehouse environment by leveraging the metadata in the SQL logs. Connect your Databricks environment to Select Star and see full ER-Diagrams and full join histories.