Metadata publishing

Metadata publishing

Metadata publishing is the process of making metadata data elements available to external users, both people and machines using a formal review process and a commitment to change control processes.

Metadata publishing is the foundation upon which advanced distributed computing functions are being built. But like building foundations, care must be taken in metadata publishing systems to ensure the structural integrity of the systems built on top of them.

Contents

Definition of metadata publishing

Published metadata has the following characteristics:

  1. Metadata structures available to the general public on a public web site or by a download
  2. There is a documented review and approval process for adding or updating data elements to the system
  3. New releases are made available without disturbing prior versions
  4. A publishing organization that makes a commitment to change control process

Benefits of metadata publishing

When classifying benefits of metadata publishing two groups are usually considered. External parties are usually consumers of information that are not part of the publishing organization. Internal parties are usually the various business units or departments within an organization.

Benefits to external parties

  1. Allows external systems (both people and agents) to have a clear understanding of the semantics of data elements in a system
  2. Allows third parties to build semantic maps between data models and import and export data between systems
  3. Promotes service oriented architectures and allow horizontal sharing of information between traditional information silos
  4. Allows systems to participate in accurately indexed and federated search processes

Benefits to internal parties

  1. allows parties from diverse business units to agree on shared data definitions and separate department or function specific definitions
  2. makes Extract, transform, load (ETL) operations more precise for data warehousing
  3. allows user interface designers to access a common pool of screen and report header labels
  4. promotion of model-driven architecture

Objections to metadata publishing

  • Organizations that publish their metadata could make it easier for unauthorized people to find sensitive data if they breach an organization's firewall
  • Vendors that publish their metadata risk customers creating tools that could allow their customers to export their data from computer systems therefor making it easier to migrate off of a vendor's system

Core process in metadata publishing

The following are some of the core processes in metadata publishing

  1. Gathering of metadata requirements
  2. Selection of metadata registry and metadata publishing tools
  3. Training of metadata concepts to project participants
  4. Stakeholder group formation
  5. Metadata harvesting
  6. Glossary consolidation
  7. Initial upper ontology construction (abstract data elements)
  8. Draft data element loading
  9. Data element review process
  10. Publishing approved metadata elements in a variety of output formats (see below)
  11. Creation and maintenance of versions and depreciation of unused or redundant data elements

File format metadata publishing

Organizations that create applications that store data in file systems can also publish metadata definitions. One common way to perform this is to store application data in a compressed XML file format. The XML files can be uncompressed and validated against an external XML Schema. An example of this is done by the Open Source FreeMind tool.

Metadata publishing formats

  1. HTML - used for browsing a web site and indexing by text-based search engines
  2. Web Ontology Language (OWL) - used by metadata search engines such as Swoogle
  3. XML Metadata Interchange (XMI) - OMG standard for exchanging metadata
  4. Common Warehouse Metamodel (CMW) - OMG standard for data warehouse metadata
  5. Topic maps - an ISO standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
  6. KM3 or Kernel Meta Meta Model as used in the Metamodel Zoos. The AtlanticZoo is an open source library of more than 100 metamodels under EPL License. KM3 is a simple Domain Specific Language for specifying metamodels. A number of transformations are available to translate from KM3 to other notations like XMI.

See also

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Metadata registry — A metadata registry is a central location in an organization where metadata definitions are stored and maintained in a controlled method. Contents 1 Use of Metadata Registries 2 Common characteristics of a metadata registry 3 Clear separatio …   Wikipedia

  • Metadata — For the page on metadata about Wikipedia, see Wikipedia:Metadata. The term metadata is an ambiguous term which is used for two fundamentally different concepts (types). Although the expression data about data is often used, it does not apply to… …   Wikipedia

  • Metadata standards — are requirements which are intended to establish a common understanding of the meaning or semantics of the data, to ensure correct and proper use and interpretation of the data by its owners and users. To achieve this common understanding, a… …   Wikipedia

  • Metadata modeling — is a type of metamodeling used in software engineering and systems engineering for the analysis and construction of models applicable and useful some predefined class of problems. Meta modeling is the analysis, construction and development of the …   Wikipedia

  • Semantic publishing — on the Web or semantic web publishing refers to publishing information as data objects using a semantic web language or as documents with explicit semantic markups. Semantic publication is intended for computers to understand the structure and… …   Wikipedia

  • Oracle metadata — The Oracle Database contains tables which describe what database objects – i.e. tables, procedures, triggers etc. – exist within the database. This information about the information is known as metadata. Oracle metadata is information contained… …   Wikipedia

  • Oracle Enterprise Metadata Manager — The Oracle Enterprise Metadata Manager (EMM) is a product of the Oracle Corporation that provides an ISO/IEC 11179 metadata registry. Contents 1 Strategic nature of a metadata registry 2 Oracle product editions 3 See also …   Wikipedia

  • Media and Publishing — ▪ 2007 Introduction The Frankfurt Book Fair enjoyed a record number of exhibitors, and the distribution of free newspapers surged. TV broadcasters experimented with ways of engaging their audience via the Internet; mobile TV grew; magazine… …   Universalium

  • D-Scribe Digital Publishing — program of the University Library System, University of Pittsburgh D Scribe Digital Publishing is an open access electronic publishing program of the University Library System (ULS) of the University of Pittsburgh. It comprises over 100 thematic… …   Wikipedia

  • EBSCO Publishing — Type Subsidiary of EBSCO Industries Industry Publishing Founded 1984 Headquarters …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”