Testing Data Vault-based Data Warehouse < 4K — FHD >
Run the same load twice. The system should recognize the data exists and result in zero new inserts.
Test the performance of PIT tables; they should significantly speed up the joining of multiple Satellites.
You should be able to trace any record in a Star Schema back to its specific Hub and Source system. Testing Data Vault-Based Data Warehouse
Validate that business rules applied in the Business Vault match the requirements.
Since Data Vault is pattern-based, the first layer of testing ensures the "plumbing" follows the methodology. Run the same load twice
Intentionally feed "bad" data into the pipeline to ensure it is caught by error Satellites rather than crashing the load. Success Metrics (KPIs)
Run row counts and sum-checks between the Source Landing Zone and the Raw Vault. You should be able to trace any record
Verify that the hashing algorithm (MD5 or SHA-1) produces identical keys for the same business keys across different source systems. 4. Point-in-Time (PIT) & Bridge Table Validation