Category: Data Warehouse

  • Open source tools for Data Lake

    Data Lake is also known as Modern Data Warehouse (MDW), where we consolidate the data streams, and process it later. This MDW space is changing very fast and plenty of opens source tools can be used in the architecture. Below diagram is showing few of these open source tools.

  • Process Diagram for Data Warehouse Project

  • How to design Data Staging layer?

    Overview You usually have two to three layers approach for data warehousing solution. On of the layer is called a “staging layer”. Data from various sources is staged here temporary till it is processed and transformed into data warehouse. Data in this layer can be relational in nature. Different data flows of data staging Pull…