# Chp 4 Data Warehousing and Online Analytical Processing (跳過、未完成) ###### tags: `Data Mining 心得` ## Basic Concepts ### Def >A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile **collection of data** in support of **management’s decision making process** >[name=William H. Inmon] ### Warehouse Features * Subject-oriented 1. organized around major subjects 2. focuses on the modeling and analysis of data for decision makers. 3. exclude useless data * Integrated * Time-variant provide information from an historic perspective * Nonvolatile >[name=Book] semantically consistent data store that serves as a physical implementation of a decision support data model ### Differences between Operational Database Systems and Data Warehouses * **online transaction processing (OLTP**) systems. The major task of online operational database systems is to perform online transaction and query processing * **online analytical processing (OLAP)** systems organize and present data in various formats in order to accommodate the diverse needs of different users. ![](https://i.imgur.com/1JvD5FI.png) ### Multitiered Architecture usually 3-tier ![](https://i.imgur.com/YeSBYUY.png =80%x) ### Models: Enterprise Warehouse, Data Mart, and Virtual Warehouse #### Enterprise Warehouse ### Functions #### Data extraction ### Metadata Repository ## Modeling: Data Cube and OLAP ### Data Cube: A Multidimensional Data Model ### Stars, Snowflakes, and Fact Constellations: Schemas for Multidimensional Data Models ### Dimensions: The Role of Concept Hierarchies ### Measures: Their Categorization and Computation ### Typical OLAP Operations ### A Starnet Query Model for Querying Multidimensional Databases ## Data Warehouse Design and Usage ### A Business Analysis Framework for Data Warehouse Design ### Data Warehouse Design Process ### Data Warehouse Usage for Information Processing ### From Online Analytical Processing to Multidimensional Data Mining ## Data Warehouse Implementation ### Efficient Data Cube Computation: An Overview ### Indexing OLAP Data: Bitmap Index and Join Index ### Efficient Processing of OLAP Queries ### OLAP Server Architectures: ROLAP versus MOLAP versus HOLAP ## Data Generalization by Attribute-Oriented Induction ### Attribute-Oriented Induction for Data Characterization ### Efficient Implementation of Attribute-Oriented Induction ### Attribute-Oriented Induction for Class Comparisons