Enterprise Information Integration
|
|
The neutrality of this article is disputed. Please see the discussion on the talk page. Please do not remove this message until the dispute is resolved. (February 2011) |
Enterprise Information Integration (EII), is a process of information integration, using data abstraction to provide a unified interface (known as uniform data access) for viewing all the data within an organization, and a single set of structures and naming conventions (known as uniform information representation) to represent this data; the goal of EII is to get a large set of heterogeneous data sources to appear to a user or system as a single, homogeneous data source.
Contents |
[edit] Overview
Data within an enterprise can be stored in various formats, including relational databases (which themselves come in a large number of varieties), text files, XML files, spreadsheets and a variety of proprietary storage methods, each with their own indexing and data access methods.
Standardized data access APIs have emerged, that offer a specific set of commands to retrieve and modify data from a generic data source. Many applications exist that implement these APIs' commands across various data sources, most notably relational databases. Such APIs include ODBC, JDBC, OLE DB, and more recently ADO.NET.
There are also standard formats for representing data within a file, that are very important to information integration. The best-known of these is XML, which has emerged as a standard universal representation format. There are also more specific XML "grammars" defined for specific types of data, such as Geography Markup Language for expressing geographical features, and Directory Service Markup Language, for holding directory-style information. In addition, non-XML standard formats exist, such as iCalendar, for representing calendar information, and vCard, for business card information.
Enterprise Information Integration (EII) applies data integration commercially. Despite the theoretical problems described above, the private sector shows more concern with the problems of data integration as a viable product.[1] EII emphasizes neither on correctness nor tractability, but speed and simplicity. An EII industry has emerged, but many professionals[who?] believe it does not perform to its full potential. Practitioners cite the following major issues which EII must address for the industry to become mature:[citation needed]
- simplicity of understanding
- Answering queries with views arouses interest from a theoretical standpoint, but difficulties in understanding how to incorporate it as an "enterprise solution".[citation needed] Some developers[who?] believe it should be merged with EAI. Others[who?] believe it should be incorporated with ETL systems, citing customers' confusion over the differences between the two services.[citation needed]
- simplicity of deployment
- Even if recognized as a solution to a problem, EII as of 2009[update] currently takes time to apply and offers complexities in deployment. People have proposed a variety of schema-less solutions such as "Lean Middleware",[2] but ease-of-use and speed of employment appear inversely proportional to the generality of such systems.[citation needed] Others[who?] cite the need for standard data interfaces to speed and simplify the integration process in practice.
- handling higher-order information
- Analysts experience difficulty — even with a functioning information integration system — in determining whether the sources in the database will satisfy a given application. Answering these kinds of questions about a set of repositories requires semantic information like metadata and/or ontologies. The few commercial tools[which?] that leverage this information remain in their infancy.
[edit] Applications
EII products enable loose coupling between homogeneous-data consuming client applications and services and heterogeneous-data stores. Such client applications and services include Desktop Productivity Tools (spreadsheets, word processors, presentation software, etc.), Development Environments and Frameworks (Java EE, .NET, Mono, SOAP or RESTful Web services, etc.), business intelligence (BI), business activity monitoring (BAM) software, enterprise resource planning (ERP), Customer Relationship Management (CRM), Business Process Management (BPM and/or BPEL) Software, and web content management (CMS).
[edit] Data access technologies
- ADO.NET
- JDBC
- ODBC
- OLE DB
- XQuery
- Service Data Objects (SDO) for Java, C++ and .Net clients and any type of data source
[edit] See also
- Business Intelligence 2.0 (BI 2.0)
- Data access
- Data integration
- Data virtualization
- Data Warehouse
- Enterprise Application Integration
- Enterprise integration
- Federated database system
- Resource Description Framework
- Semantic integration
- Semantic Web
- Web 2.0
- Web services
[edit] References
- ^ Alon Y. Halevy et al. (2005). "Enterprise information integration: successes, challenges and controversies". SIGMOD 2005. pp. 778–787. http://www.cs.washington.edu/homes/alon/files/eiisigmod05.pdf.
- ^ David A. Maluf et al. (2005). "Lean middleware". SIGMOD 2005. pp. 788–791. http://portal.acm.org/citation.cfm?id=1066157.1066247&coll=portal&dl=ACM&type=series&idx=1066157&part=Proceedings&WantType=Proceedings&title=International%20Conference%20on%20Management%20of%20Data&CFID=15151515&CFTOKEN=6184618.