Knowledge Models Used
Mark Doyle, *protected email*
Mark Doyle, *protected email*
PhySH (Physics Subject Headings) is a physics classification scheme developed by the American Physical Society to organize journal, meeting, and other content by topic, available for use starting in January 2016. It is intended initially to meet the specific goals of the APS, while a longer term goal is to make it available for use by the broader community. PhySH consists of hierarchies of concepts grouped into facets: Research Areas, Physical Systems, Properties, Techniques, and Professional Topics. The concepts are also organized by discipline for convenience. Individual concepts may belong to more than one facet or discipline.
An indexing system for the newspaper industry. A specialized group of terms with the newspaper industry’s indexing needs in mind. The vocabulary is divided into sections that correspond to the sections of a typical newspaper. An accompanying rule base enables highly accurate categorization of newspaper articles.
Customized version used by Acquire Media for categorization of news items, and RSS delivery according to customers’ interests.
Categorization of news stories (including archived stories) by and for newspaper publishers; indexing of 20th and 21st century historical studies.
Every news day, you can tag the articles as they are produced through a cloud service or installed on your own local servers. We automatically feed this data through NewsIndexer, which scans every article and searches for terms similar to those in its controlled vocabulary. NewsIndexer then displays these terms for the human indexer’s review and approval. For backfile collections you can just accept the indexing as an automatic batch process. For ongoing daily feeds you might want to review all or a random sample of the results on a regular basis for maintenance.
67 Bricks works with publishers who want to enrich their content to make it more structured, granular, flexible and reusable.
We help publishers develop content enrichment processes, systems and delivery channels that support more agile and flexible production workflows, increase the value of legacy and new content, increase revenues from existing channels, enable better reuse of content and deliver revenues from new digital products.
Because we support different publishers, we work with a wide variety of knowledge models appropriate to our clients.
Taylor & Francis Group publishes more than 1,800 journals and over 4,000 new books each year, with a books backlist in excess of 60,000 specialist titles. We are providers of quality information made available through our dedicated platforms Taylor & Francis Online, Taylor & Francis eBooks and CRCnetBASE, disseminating knowledge that enables our customers to perform their jobs efficiently, enhance their education, and help contribute to the advancement of their chosen market sectors.
Our aim is to facilitate discovery and allow our users to access relevant research and information quickly and easily, wherever they are.
Our content spans all areas of Humanities, Social Sciences, Behavioral Sciences, Science, Technology and Medicine, and we are one of the world’s leading publishers of scholarly journals, books, eBooks, text books and reference works, publishing under the imprints Taylor & Francis, Routledge, CRC Press, Garland Science, Psychology Press, and Focal Press.
The Unified Astronomy Thesaurus (UAT) is an open, interoperable and community-supported thesaurus which unifies the existing divergent and isolated Astronomy & Astrophysics thesauri into a single high-quality, freely-available open thesaurus formalizing astronomical concepts and their inter-relationships.
The Subject Areas belong to a thesaurus of over 10,000 terms initially built for us over the course of 2012 by Access Innovations. The 2012 thesaurus was based on the controlled vocabulary of classification terms that had been in use in PLOS Editorial Manager augmented by a specialist Science, Technology, Engineering and Medicine thesaurus built by Access Innovations. The entire corpus of PLOS articles was analysed to ensure that the thesaurus covers the research domain comprehensively.
Machine Aided Indexing (MAI)
e.g. PLOS One: http://www.plosone.org/taxonomy
The IOP thesaurus is a collection of ca. 6000 terms describing concepts in physics and related areas.
The model is not currently available externally.
IOP Publishing, applied manually and through auto-tagging software (Luxid/TEMIS), Editorial Staff and Referees
Not currently in production use for semantic enrichment.
Used for indexing of referees, within ScholarOne Manuscripts.
Faceted search, display of further relevant content, key-word driven advertising.
As the IOP thesaurus describes relationships between topics as well as providing keywords, it provides a powerful mechanism for determining relevant content and search faceting not available through statistical approaches. The machine-based indexing approach ensures consistency across the corpus, avoiding human bias in applying keywords.
KM is not currently available to license
SNOMED CT is an extensive clinical terminology that was formed by the merger, expansion, and restructuring of SNOMED RT® (Reference Terminology) and the United Kingdom National Health Service (NHS) Clinical Terms (also known as the Read Codes). It is the most comprehensive clinical vocabulary available in English (or any language). SNOMED CT is concept-oriented and has an advanced structure that meets most accepted criteria for a well-formed, machine-readable terminology. It has been designated as a US standard for electronic health information exchange in Interoperability Specifications produced by the Healthcare Information Technology Standards Panel and has also been adopted for use by the US Federal Government, through the Consolidated Health Informatics (CHI) Initiative, for several clinical domains
Wiley in autotagging software
Please describe the added-value that the KM delivers in comparison to “standard” full-text search indexes, author-supplied keywords or automated on-the-fly generation of topics based on purely statistical information.
MeSH is the National Library of Medicine’s controlled vocabulary thesaurus. It consists of sets of terms naming descriptors in a hierarchical structure that permits searching at various levels of specificity.
MeSH descriptors are arranged in both an alphabetic and a hierarchical structure. At the most general level of the hierarchical structure are very broad headings such as “Anatomy” or “Mental Disorders.” More specific headings are found at more narrow levels of the twelve-level hierarchy, such as “Ankle” and “Conduct Disorder.” There are 27,149 descriptors in 2014 MeSH. There are also over 218,000 entry terms that assist in finding the most appropriate MeSH Heading, for example, “Vitamin C” is an entry term to “Ascorbic Acid.” In addition to these headings, there are more than 219,000 headings called Supplementary Concept Records (formerly Supplementary Chemical Records) within a separate thesaurus.
License not required