View online: tdwi.org/flashpoint
|
|||||
September 1, 2011 |
ANNOUNCEMENTS Submissions for the next Business Intelligence Journal are due December 2. Submission guidelines New TDWI Checklist Reports: Data Discovery and Unified Information Access and Integrating Data Governance into Your Operational Processes CONTENTS
|
||||
Data Modeling: Agile or Fragile Len Silverston |
|||||
Topic:
Data Modeling How can we create truly “agile” data models instead of fragile ones? What does “agile” mean, anyway? According to the Agile Manifesto, there are 12 principles for agile software development, with a focus on delivering quickly and providing adaptability for changing requirements. Merriam-Webster's definition of agile is “marked by ready ability to move with quick easy grace.” If something agile is characterized by speed and adaptability, and something fragile is easily broken, we must develop data models quickly, with flexibility, and in a way that prevents breakdown over time. Developing Data Models Quickly While template data models save time, we cannot completely circumvent data modeling; we must understand information requirements and skillfully apply and customize template data models when appropriate, and/or consciously choose alternative model structures. One of the greatest pitfalls in data modeling is thinking that there is only one correct way to model something. Dr. Graeme Simsion points out in his book, Data Modeling Theory and Practice, that the same information requirements can be modeled in many different ways. Because of this, we can model more quickly if we are already aware of ways to model common constructs such as categorizations, hierarchies, roles, statuses, contact information, and so on. We refer to these common constructs and options for modeling as Universal Patterns and distinguish them from reusable or Universal Data Models, which are models that are put together (often using Universal Patterns). Adaptability and Flexibility For example, say we need to quickly provide a customer contact data model for an agile effort. We develop a data model that simply shows the e-mail address, phone number, and postal address for a customer entity (or class), which seems relatively straightforward. Then we realize that we are maintaining numerous additional entities (e.g., supplier, partner, employee) with many attributes for each type of contact information (such as phone number or e-mail address). We find that we have added attributes haphazardly, such as additional contact numbers for Facebook, Twitter, Skype, secondary e-mail, and so on. Furthermore, each piece of contact information may have other associated data, such as the purpose of the e-mail or a do-not-solicit indicator. Eventually, we end up with a fragile data model that we continually patch where there are many different semantics, rules, and edits. How can we create more adaptability and flexibility? The more a model is generalized, the more adaptable and flexible it is. In the contact data model example, we could generalize the model and define entities for postal address, e-mail address, and phone number that flexibly maintain all different types of information--instead of storing contact information as separate attributes. We could also further generalize the models and maintain a “contact mechanism” entity that stores all types of contact information, thus providing flexibility to add any new types of contact mechanisms--even ones that are not yet invented! Why not model with the highest level of generalization? There are trade-offs; the more generalized the model, the more abstract and difficult it is to understand. Thus, it is important to find a good middle path when evaluating the level of generalization. Conclusion Len Silverston is a best-selling author, consultant, speaker, and expert in data management, data modeling, and human dynamics. He is the author of The Data Model Resource Book series and is a winner of the prestigious DAMA International Professional Achievement Award. He is a TDWI instructor and will be co-teaching Mastering BI with Best-Practice Architectures and Data Models: From Hub and Spoke to Agile Development with Claudia Imhoff on November 1, 2011, at the TDWI World Conference in Orlando. References Silverston, Len [2001]. The Data Model Resource Book, Revised Edition, Volume 1: A Library of Universal Data Models for All Enterprises, Wiley Computer Publishing. Silverston, Len [2001]. The Data Model Resource Book, Revised Edition, Volume 2: A Library of Universal Data Models for Specific Industries, Wiley Computer Publishing. Silverston, Len and Paul Agnew [2009]. The Data Model Resource Book, Volume 3: Universal Patterns for Data Modeling, Wiley Computer Publishing. Simsion, Graeme [2007]. Data Modeling Theory and Practice, Technics Publications, LLC.
Staffing Operational Data Integration Practices To get a handle on the groups staffing OpDI today, TDWI Research asked: “For DI outside data warehousing, who does the work most often?” (See Figure 7.) Responses show that OpDI is performed by a range of consultants and organizational units. Source: Operational Data Integration: A New Frontier for Data Management (TDWI Best Practices Report, Q2 2009). Click here to access the report.
Mistake:
Prematurely Launching a Council We see it all the time: An earnest visionary perceives the need for data governance. She decides that it’s time for formal consensus and enlists an executive to support her effort to convene a “council” of data stakeholders. An e-mail invitation goes out for the council meeting. It’s a lunchtime event--“Please make your selection from the attached menu”--and nearly every invitee shows up. The visionary begins discussing how data is an asset and that the company needs to begin managing it as such. Heads nod. The visionary goes on to explain that the newly formed council should meet regularly and discuss the company’s prevailing data issues and address any open problems. The follow-up meeting is scheduled. Fewer people show up to the next meeting. Someone complains that the company has never really defined the term “customer.” Someone else pipes up about bad data on the billing system. A sidebar conversation starts on CRM consolidation. The next meeting never happens. In this all-too-common example, data governance isn’t overtly cancelled. It simply fizzles like a damp firecracker. The problem? See Mistake Two. Well-meaning people who see the need will gravitate toward the “who” conversation (Who should be on the council? Who will sponsor it?) before understanding the “what” and the “how.” Until a core team of stakeholders deliberately designs a data governance framework (including guiding principles, decision rights, and the appropriate governing bodies), no sanctioned, cross-functional council will have either the clarity or the mission to affect change. There is no free lunch. Source: Ten Mistakes to Avoid When Launching a Data Governance Program (Q1 2008). Click here to access the publication. |
|
||||
EDUCATION & RESEARCH TDWI Seminar: TDWI Seminar: TDWI World Conference: |
WEBINARS Lifecycle Stages for Data Governance: Plan Ahead for Success and Sustainability Business Intelligence Solutions for SAP Operationalizing Information Governance for Business Process Success |
MARKETPLACE TDWI Solutions Gateway TDWI White Paper Library TDWI White Paper Library |
MANAGE YOUR TDWI PREMIUM MEMBERSHIP Renew your Premium Membership by: [-ENDDATE-] Renew | FAQ | Edit Your Profile | Contact Us
|
||
Copyright 2011. TDWI. All rights reserved. |