Page 218 - Data Architecture
P. 218
Chapter 6.1: Introduction to Data Vault 2.0
What Is Data Vault 1.0?
Data Vault 1.0 (DV1) is highly focused on the data vault modeling components and
relational database technology. A DV1 data model attaches surrogate sequence keys as
its primary key selection for each of the entity types. Unfortunately, surrogate sequences
exhibit the following problems:
• Introduce dependencies on the ETL/ELT loading paradigm
• Contain an upper bound/upper limit, when reached can cause issues
• Are meaningless numbers (mean absolutely nothing to the business)
• Cause performance problems (due to dependencies) on load of big data sets
• Reduce parallelism (again due to dependencies) of loading processes
• Cannot be utilized as MPP partition keys for data placement, to do so would potentially cause hot
spots in the MPP platform
• Cannot be reliably rebuilt or reassigned (reattached to old values) during recovery loads
• Are disparate across multiple source applications that are housing the same data sets
218