November 4, 2010
The Ramifications of Trusted Data
Topic: Data Integration in support of Business Intelligence
The term trusted data has been bandied about a lot lately, and everyone seems to have a different definition. It's an important concept, so in this column I will define the term and consider the ramifications for business intelligence and other data- driven business processes.
For some, trusted data is an emotional matter. In the context of business intelligence (BI), they want to feel confident that data presented in reports, cubes, dashboards, and other BI products is in the best condition possible because data in good condition makes their jobs easier and their actions more accurate, timely, effective, and compliant. People who consume data through operational applications have similar concerns.
Emotions aside, the condition of trusted data is easily quantified. Namely, condition is a technical measurement of data's completeness, quality, age, schema, profile, and documentation. The assumption is that trusted data should come from carefully selected sources, be transformed in accordance with data's intended use, and be delivered in formats and time frames that are appropriate to specific consumers of reports and other manifestations of data.
Hence, the trustworthiness of data is quantified mostly by the technical properties that define its condition. You still need to be mindful of the emotional impact that data's condition has on the perceptions of people who consume the data for BI.
Why We Need Trusted Data
A lack of trusted data leads to a number of poor practices. For example, if BI data isn't trustworthy because it's not in good condition, then poor decisions are based on poor data. Whether data is in good condition or not, the mere perception that the data's not trusted can lead users to ignore supplied BI data and instead build their own BI data stores, such as rogue data marts, spreadsheets, and personal productivity databases.
Less-than-trustworthy data creates problems in operational processes, too. When data is (or is perceived to be) of poor quality or incomplete, users and managers base tactical and operational decisions on guesses.
If you can turn these problems around, then trusted data has benefits. Whether in BI or operations, people will use data they trust, which, in turn, leads to greater consistency, compliance, and accuracy in business processes based on the data.
How to Get Trusted Data
Achieving trustworthiness for data is a multi-step process:
The process and its best practices involve a mix of:
A Final Word
Giving business and technical users data they trust is key to good business intelligence. If users don't have confidence in the data of a data warehouse and other BI data stores, they may argue over the data's accuracy, refuse to use the reports and analyses fed from the data, or build their own data stores. Non-trusted data likewise hinders operational excellence, as people side-step prescribed processes and misuse applications. All these paths are non-productive and lead to faulty decision making and operational actions.
Stay tuned. In an upcoming Experts in BI newsletter, I'll drill down into the Six Cs of Trusted Data, namely how data integration and related techniques build trust by making data complete, current, consistent, clean, compliant, and collaborative.
To learn more, replay Philip Russom's recent TDWI Webinar on Trusted Data for BI online.
Philip Russom is senior manager of TDWI Research. Philip can be reached at email@example.com .
Copyright 2010. TDWI. All rights reserved.