Figure 1 – The 5-step process to data quality and observability
At its heart is a data catalog and business glossary that documents the data assets of an organisation. Data can be classified and profiled, and business ownership can be assigned to the various data assets. Policies can be defined and enforced, so for example, workflows can be defined to check the authorisation of changes to a particular business hierarchy. Collibra has visual tools to assist users in navigating their data landscape, including a comprehensive data lineage solution that tracks the flow of data and can be used for impact analysis. This is the company’s own intellectual property, so they no longer rely, as they once did, on a partnership with a third-party data lineage specialist. There is also a “knowledge graph” to help visualise the connections between data assets as well as between data and other governance assets such as policies.
There is a search capability to allow data assets to be discovered, and this also has a natural language interface. Collibra has a reputation for having an interface that is digestible by business users, and this will go further in the next version with a comprehensive set of changes, including the ability to tailor the interface to specific data assets as well as different business roles. Collibra extends its capability to cover data access and security and overlaps to a degree with special security governance products.
Collibra’s platform leverages artificial intelligence in several ways, some of which appeared in their Q2 2024 release. This included the generation of content for data asset description as well as suggesting possible description data duplication. The vendor (correctly) believes that although generative AI can be very productive, it still needs human review for generated content, as AI content can sometimes be overly imaginative (the so-called “hallucination” problem inherent in large language models). Collibra can be used to manage AI models just like other data assets and has a comprehensive AI Governance application so customers can benefit from a unified data and AI governance solution. Earlier this year, Collibra introduced AI governance, another key pillar of their platform, that enables organizations to deliver trusted AI safely and effectively.
Collibra has extensive partnerships, so for example can exchange metadata with tools such as Microsoft Purview and SAP. Interestingly, there is what seems to be a quite productive partnership with SAP and Collibra provides their joint customers end-to-end data governance across the entire data landscape including both SAP and non-SAP data. Additional bridges exist for products like Databricks and Snowflake, with over a hundred connectors provided. Collibra aims to meet users where they already are and will extend this approach with integrations to productivity tools like Slack and Teams also, enabling for example alerts in Collibra to be visible in Slack. This is the company’s own intellectual property, so they no longer rely, as they once did, on a partnership with a third-party data lineage specialist.
Customer Quotes
“This is probably the easiest integration I’ve ever seen happen between two tools.”
Marla Dans, Head of Data Management and Governance, Chicago Trading Company
“Using the Collibra Data Quality platform we provide users assurance that insights have been created in roc solid, 100% clean data, ensuring credibility and the ability for our data stewards to act on degradation in the KPI.”
Jim Williamson, VP of Data Analytics North America. Element Fleet Management