Start with the measurements again - one at a time. Use the ECCD approach (extraction, cleaning, reconciliation, delivery).
Be sure that in each dimension there is a BusinessKey that uniquely identifies the βobjectβ that describes the dimension string β for example, email for a person.
With the dimensions loaded, prepare the key search pipeline. In general, for each dimension table, you can prepare a key lookup table (BusinessKey, PrimaryKey). Some designers prefer to look directly at the size table, but key searches can often be easily cached into memory, which leads to faster loading of facts.
Use ECCD for these facts. The ECC part takes place in the intermediate area, you can select (auxiliary) tables or flat files for each ECC step, as you prefer.
When delivering fact tables, replace each BusinessKey in the fact line with the corresponding PrimaryKey, which you will get from the key lookup table. After all BusinessKeys are replaced with the corresponding PrimaryKeys, insert a row in the fact table.
Do not waste time using the ETL tool. You can download the Pentaho Kettle (Community Edition) for free β it has everything you need to achieve this.
source share