Tag Archives: pdi

Graph Databases: Loading Data with Neo4j

Graph databases are becoming more popular as a way of storing and analysing large connected datasets. Neo4j is a popular Graph DBMS because of its powerful querying language: Cypher and its growing community and excellent supporting tools. A new paradigm comes with a new set of challenges. In this case we are focused on the […]

Date Dimension Revisited: Bank Holidays

Everyone familiar with data warehousing knows about the date dimension. It’s one of the first steps in the creation of an ETL and exists in almost every data warehouse. Despite how ubiquitous it is, many still fall for some common pitfalls such as: Weeks shouldn’t be children of months Missing or wrong week of year […]