Data warehousing

Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

Clean data is made accessible using data engineering. It comprises building an efficient data architecture, streamlining data processing, and maintaining large-scale data systems using Shell (CLI), SQL, and Python/Scala. You will learn all the concepts of data warehousing and ETL.

What Will You Learn?

  • In this course, you will learn about
  • Evolution of data warehousing
  • Data warehouse architecture
  • What is ETL and what is the framework of ETL
  • How to perform Data extraction
  • How to perform Data transformation
  • Data warehouse modelling
  • BI concepts in data warehousing
  • Relational data modelling
  • Identifying facts and Dimensions
  • Physical design considerations
  • Dimension Hierarchies
  • In the end, you will be able to
  • You will be able to understand all data warehousing concepts and design a data warehouse that meets business needs

Course Content

Data warehousing

  • Data Warehouse – Introduction, Evolution of Data Warehousing, Data marts Vs Data Warehouses, Operational Data Stores, Warehouse components, Data Warehouse Architecture, Data Staging
    00:00
  • Data Warehouse – ETL, ETL – Definition, Framework of ETL, Data Extraction –Overview, Extraction Methods, Data Transformation Activities, Aggregation Techniques, Data Loading, Types of Load, Incremental Load, CDC, ETL Process Flow
    00:00
  • Data Warehouse Modeling: Schemas, Data Aggregation, Data Explosion, Dimensional modeling, Star and Snowflake Schema, Transactions and Snapshot Schemas, Staging Area Modeling Views of Bill Inmon and Ralph Kimbal, Data Warehouse – Design factors, Accessing Data Warehouse DW, Modeling techniques Requirement, Modeling & Analysis Modeling, Tool selection
    00:00
  • Business Intelligence Concepts: Meta Data, Accessing Data Warehouse, OLAP, Types of OLAP, OLTP vs. OLAP, BI Tools, Dashboards, Scorecards, Security, B. Dimensional Modeling Concepts
    00:00
  • Introduction to Data Model: History of Database, Database Fundamentals, DBMS & RDBMS, Codd’s rules, Data Modeling Terms and Concepts, Types of Data Model, ER Modeling, Entities and Attributes, Entity Relationships
    00:00
  • Relational Data Modeling: Keys and Referential, Integrity Relational, Database Design, Normalization, Dimensional Model Design Life Cycle
    00:00
  • Identify Business process requirements: Create and Study the enterprise business process list, Identify business process, Identify high level entities and measures for conformance, Identify data sources, Select requirements gathering approach, Requirements gathering, Requirements analysis
    00:00
  • Identify the grain: Fact table, Granularity, Multiple, Separate grains, Fact table types, Check grain atomicity, High level dimensions and facts from grain, Final output of the identify the grain phase
    00:00
  • Identify the Dimensions: Dimensions, Degenerate dimensions, Conformed dimensions, Dimensional attributes and hierarchies, Date and time granularity, Slowly changing dimensions, Fast changing dimensions, Cases for snow flaking, Cases for snow flaking
    00:00
  • Identify the Facts: Facts, Conformed facts, Fact types, Additive, Semi additive, Non additive, Factless fact table, Composite key design, Fact table sizing and growth
    00:00
  • Physical Design Considerations: Aggregations, Aggregate navigation, Indexing, Partitioning
    00:00
  • Metadata management: Identifying the meta data, Type of Dimensions, Conformed Dimensions, Time Dimensions, Role playing Dimensions, Parent-Child Dimensions, Junk Dimension, Fact Dimension
    00:00
  • Slowly Changing Dimension Techniques: Type 1, 2 and 3, Attribute Types, Surrogate Key
    00:00
  • Dimension Hierarchies: Attributes, Levels, Members, Balanced Hierarchy, Unbalanced Hierarchy, Ragged Hierarchy
    00:00