关键字:数据集成;ETL;数据仓库;NHibernate实体映射
The Design of Heteromerous Data Integration Based on Web Service
Abstract
Nowadays application of information technologies in enterprises has gone through several phases. No matter from languages to deployment platforms or from communication protocol to data format and schema, systems developed in different phases are diverse from each other. So connecting those isolated data islands, realizing information sharing and communication are the required and express way to achieve strategic goal in enterprise.
The foundation of application integration is data integration, which decides whether the integration will be successful. Data integration extracts data from isomerous, conflicted and isolated sources, transforms data and then loads it into the destination. It has three steps: Integration Analysis, Data Analysis, and Data Transference.
The design uses Hunan University Student MIS and Dormitory MIS as samples, analyzes data resources and their connection, produces the common data model and designs data warehouse. The Extract-Transform-Load (ETL) process is applied into data integration and an ETL tool is customized; it has following features: record and display the importing status, import and export status file, read status files and continue what have not been finished during last importing. The whole integration process must make full use of source applications and be extensible to potential requirements in future, so here DW uses NHibernate OR-Mapping to get rid of the confliction of data fields between DW and UI layer and provides data management to resource applications by web services, finally this data consumption process reaches the goal to update data in real time and achieve data consistence in all sources.
Key Words: Data Integration; ETL; Data Warehouse; NHibernate OR-Mapping
The Table of contents
1. Introduction 1
1.1 Thesis background 1
1.2 The research actualities between the domestic and abroad 2
1.3 The course points 3
1.4 The dissertation structure and research contents 3
2. Integration analysis and data analysis 5
2.1 Integration analysis 5
2.1.1 Overview 5
2.1.2 Integration requirement analysis 5
2.1.3 Integration design 7
2.2 Data analysis 8
2.2.1 Overview 8
2.2.2 Database design of source applications 9
2.2.3 Design of data warehouse 11
3. ETL 13
3.1 Overview 13
3.1.1 Concepts of ETL 13
3.1.2 Challenges facing to ETL 14
3.1.3 Related technologies about ETL 16
3.2 Data integration tool 16
3.2.1 Features of data integration tool 16
3.2.2 Design of data integration tool 17
3.2.3 Implementation 17
4. Implementation of data consumption 21
4.1 Requirements of data consumption 21
4.2 Architecture of integrated application 21
4.3 NHibernate OR mapping 22
4.3.1 Introduction to NHibernate 22
4.3.2 Application of NHibernate 23
4.4 Works presentation 25
4.5 Conclusion and prospect 26
Acknowledgments 28
References 29