The core component of TopBraid EDG is the knowledge graph, which is used to represent the data and metadata assets you hold within a particular knowledge domain. It also captures the relationships between those assets, and allows you to make inferences based on these relationships. In the same way, you can apply business rules that take advantage of this relationship structure. Multiple graphs (and thus multiple domains) can be connected together, and they can self-compose. You can also connect to a public knowledge graph and use its information to enrich your own.
Fig 01 - An example LineageGram in TopBraid EDG
The knowledge contained in your graph (or graphs) is exposed via the platform itself, allowing you to view, edit and govern your knowledge assets. The relationships between your assets are particularly emphasised: for example, asset views provide hyperlinks to all related assets. Governance assets, such as personalised dashboards, metrics, policies and so on are also included as part of these knowledge assets. Workflows are supported, and several workflow templates come pre-installed. Visualisations are available in the form of the LineageGram, an interactive lineage view (shown in Figure 1) that includes physical and logical data flows, and the NeighborGram, a graphical view of the relationships between your business terms and data assets (Figure 2). Search access is provided to all assets, as is a full audit trail, and the product boasts a number of features that promote ease of use and collaboration, including a personalisable, drag and drop, panel-based UI, an asset basket, the ability to create and share asset lists, commenting and tasking.
Fig 02 - An example NeighborGram in TopBraid EDG
Assets are organised into asset collections, over 300 of which may be provided out of the box depending on the packages you have purchased. You can customise the structure of your collections via ontologies, and, with the Data Graphs add-on module, create your own types of asset collection. The provided ontology editing and management capabilities even allow you to manually govern the knowledge graphs that underpin the platform itself, offering extreme flexibility and customisation. Role management and role-based views are provided, and can be configured separately for individual assets, groups of asset collections (‘governance areas’), and for the platform as a whole.
The product also provides substantial automation capabilities, including scheduled importing and automated cataloguing; automatic discovery of information and connections (such as dependencies) present in your data; the automated extraction of business rules implicit in your data using machine learning; and the use of business rules to infer connections between your business terms and your data assets. Data quality can be checked automatically using your business rules, and the product will suggest fixes for each failed check. Moreover, the Tagger and AutoClassifier add-on module will automatically tag and classify your data using machine learning. When the module is properly trained, this includes unstructured data (documents, for example), thus enabling integrated governance of both structured and unstructured information.
Finally, TopBraid EDG provides API support across a wide variety of integration points. This includes SPARQL and GraphQL endpoints, with the latter in particular enabling complete retrieval of model structure, write access via mutation, and support for introspective querying. Moreover, TopBraid EDG will automatically generate your GraphQL schemas, and also leverages SHACL in conjunction with GraphQL to create what TopQuadrant describes as “semantic GraphQL”. In terms of format support, the product will import data in a range of formats, including spreadsheets, RDF, XML/XSD, JSON, RDBMS and SQL DLL, and various system specific outputs. Likewise, it can export assets in several formats, including HTML, RDF, JSON, XML, and SharePoint.