Mixed messages for Master Data Management

Written By:
Published:
Content Copyright © 2006 Bloor. All Rights Reserved.

I recently attended IBM’s first European user conference on MDM (master data management) in Barcelona. The event was certainly interesting but I felt that the results were mixed. For example, some users clearly felt that the conference was useful, while others were disappointed. One investment bank that I talked to felt that the content was insufficiently technical and, in particular, that it did not address either scalability issues or the complex, heterogeneous environments which most companies have to manage. Still, I suppose you can’t please everybody all of the time.

Another criticism that I heard from several attendees was that the Gartner analyst who was supposed to set a framework for the conference was speaking at the end of the first day rather than the beginning. For myself, I had rather more serious reservations about this particular presentation as I, a) disagree with his segmentation of the market, and b) did not feel that a sufficient distinction was made between MDM on the one hand and instances of MDM such as CDI (customer data integration), GSM (global supplier management) and PIM (product information management) on the other.

I will start with this second issue. To me, CDI, PIM and the like are subsets of master data management. MDM is, essentially, non domain-specific. There are two reasons why this is important. The first is that if vendors, users, pundits et al continue to treat CDI, GSM and similar applications as separate and distinct then we will end up with exactly the same siloed applications that have been the bane of IT for the last two decades.

Suppose, for example, that you want to implement a master data management solution for SLAs (service level agreements). That would involve a customer, a supplier, various services, a contract and metrics. In other words, such an application would span multiple master data domains, some of which are about people (customers, suppliers) and some of which are about things (services, the contract itself, metrics). If we make the assumption that existing CDI solutions can be easily extended to support suppliers and that PIM solutions can encompass contracts and services (there seems to be a consensus that PIM and CDI require significantly different capabilities) then what we don’t want to have to do is to build an application that has to span two different systems, putting us straight back into the EAI (enterprise application integration) stuff that we’d rather avoid.

So, given that ‘people’ and ‘things’ are distinct, what is needed is a common set of services that support as much as possible of these in a single platform, with only a very specific and minimised set of differentiated capabilities on top of the platform that relate to CDI, PIM or whatever. This platform then becomes the MDM base upon which application instances are implemented.

As far as how you break down the market is concerned, I felt that the model presented was too simplistic. In our view, there are three approaches that you might take: a registry, which is where you have a reference mapping to the various participating applications; a repository, where you extend the registry by including a mapping to the elements that make up the “best record”; and a hub, where the best record is actually held within the hub. This contrasts with the model that was presented, which only differentiated between hubs (albeit with sub-classes) and registries. A more detailed discussion of our approach is included in Harriet Fryman’s recently published MDM Report.

In general, the conference spent very little time discussing registries but the importance of registries and repositories is that they are much less disruptive (and less expensive) to implement. For example, Purisma claims a typical implementation time of just 4–6 weeks compared to the months, or even years, that you might take over a hub. The scalability issues referred to above also go away, though there may be performance issues related to using a federated as opposed to a persistent model. It is our belief that what users would really like is a way to start with a registry (or repository) and the ability to migrate, progressively, to more complex implementations. To be fair, IBM did offer sessions on precisely this theme, which is supported by the company’s products, though you might not realise that from the company’s marketing. This is a subject I will return to in the next article in this series.