The Digital Object Identifier (DOI®) is a system for identifying and exchanging intellectual property in the digital environment. It provides a framework for managing intellectual content, for linking customers with content suppliers, for facilitating electronic commerce, and enabling automated copyright management for all types of media.
The Dublin Core Metadata Initiative (DCMI) is an organization dedicated to promoting the widespread adoption of interoperable metadata standards and developing specialized metadata vocabularies for describing resources that enable more intelligent information discovery systems. Membership and participation in any of the DCMI Working Groups is open to all interested parties.
http://www.iso.ch/iso/en/ISOOnline.frontpage
The International Organization for Standardization (ISO) is a worldwide federation of national standards bodies from more than 140 countries. ISO is a non-governmental organization established in 1947. The mission of ISO is to promote the development of standardization and related activities in the world with a view to facilitating the international exchange of goods and services, and to developing cooperation in the spheres of intellectual, scientific, technological and economic activity.
ISO has developed standards for both multilingual and monolingual thesauri (ISO 2788:1986 and ISO 5964:1985). In addition, ISO has a specific technical committee, ISO/TC 37: Terminology and other language resources, which focuses on standardization and computer exchange of terminology.
NISO, the National Information Standards Organization, is a non-profit association accredited by the American National Standards Institute (ANSI). NISO standards embrace all technologies dealing with information management, including information retrieval, data storage, metadata, and preservation of materials. NISO currently is sponsoring an initiative to revise ANSI/NISO Z39.19, Guidelines for the Construction, Format, and Management of Monolingual Thesauri to better address the needs of the Internet and Intranet communities. For more information on this revision initiative, see http://www.niso.org/committees/MT-info.html
The Semantic Web is the abstract representation of data on the World Wide Web, based on the RDF standards and other standards to be defined. It is being developed by the W3C, in collaboration with a large number of researchers and industrial partners. The Semantic Web is an extension of the current Web in which information is given well-defined meaning, better enabling computers and people to work in cooperation. It is part of the Technology & Society Domain of the W3C (World Wide Web Consortium).
Initially launched in 1987, the Text Encoding Initiative is an international and interdisciplinary standard that helps libraries, museums, publishers, and individual scholars represent all kinds of literary and linguistic texts for online research and teaching, using an encoding scheme that is maximally expressive and minimally obsolescent. In 2000, the TEI Consortium was established to continue to develop and maintain the standard.
http://trec.nist.gov/overview.html
The Text REtrieval Conference (TREC), co-sponsored by the National Institute of Standards and Technology (NIST) and the Defense Advanced Research Projects Agency (DARPA), was started in 1992. Its purpose is to support research within the information retrieval community by providing the infrastructure necessary for large-scale evaluation of text retrieval methodologies. In particular, the TREC workshop series has the following goals:
TREC provides large
collections of data that can be used for testing and evaluating automated
classification techniques.