All posts by Graham McCann

PhySH – Physics Subject Headings

Short Description

PhySH (Physics Subject Headings) is a physics classification scheme developed by the American Physical Society to organize journal, meeting, and other content by topic, available for use starting in January 2016. It is intended initially to meet the specific goals of the APS, while a longer term goal is to make it available for use by the broader community. PhySH consists of hierarchies of concepts grouped into facets: Research Areas, Physical Systems, Properties, Techniques, and Professional Topics. The concepts are also organized by discipline for convenience. Individual concepts may belong to more than one facet or discipline.

URL

http://physh.aps.org/

Owned/Developed by

  1. Name of Owner: American Physical Socety
  2. Technical Contact:  Arthur Smith, *protected email*
  3. License Contact: Mark Doyle, *protected email*

Adopted (as opposed to owned) by organizations/publishers:
How is this KM applied?

  1. Manually
  2. By Authors and Editorial Staff

How is this KM used?

  1. Classify manuscripts within the APS peer-review process
  2. Help editors in finding similar articles previously submitted and in finding suitable referees

Description of the current use case(s) of the KM

  1. Manuscripts are assigned concepts from PhySH by submitting authors; these assignments are reviewed and may be modified by editors.
  2. For each journal section, relevant concepts are assigned to the most appropriate handling editor who is knowledgeable in that area
  3. Editors can search for other articles on the same or related topics

Description of the future/potential use cases of the KM (not yet realized)

  1. Improve finding relevant content across APS journals, which often have overlapping scopes
  2. Link relevant content from other APS non-journal areas (e.g., meetings)
  3. Map other classification systems (eg. PACS) to PhySH to enable older material to be indexed in the same way
  4. Allow individuals (particularly referees) to identify their areas of expertise
  5. Integrate with other knowledge models – For example, an independent taxonomy for chemical substances or astronomical objects should be relatively easy to append to PhySH.
  6. PhySH concepts have permanent identifiers allowing them to be integrated into the web of linked data.

What are the main goals for using this KM?

  1. Improve the peer-review process for our journals by ensuring the right expertise is applied in reviews (PACS, which was previously used for this, ceased updating in 2010).
  2. Cover all of physics
  3. Improve discovery – Content properly tagged with PhySH is intended to enable new and useful ways to browse and search the content while providing the underpinnings for recommendation systems and other personalized services.

Rationale for KM vs other means of searching and browsing?

  1. By providing a standardized list of “keywords” rather than relying on author-supplied terms or full-text or statistical indexing, we can have a greater assurance that related articles are associated with one another.

Is the KM being actively developed?

  1. Yes, internally – also feedback is welcome, see physh.aps.org website for links.

License information:

  1. PhySH is copyrighted with all rights reserved by the American Physical Society. We are still considering what license we would use for any public release of PhySH.

NewsIndexer Thesaurus

Short Description

An indexing system for the newspaper industry. A specialized group of terms with the newspaper industry’s indexing needs in mind. The vocabulary is divided into sections that correspond to the sections of a typical newspaper. An accompanying rule base enables highly accurate categorization of newspaper articles.

URL

http://www.newsindexer.com

Owned/Developed by

  1. Name of Owner: Access Innovations, Inc.
  2. Name of Developer: Access Innovations, Inc.
  3. Technical Contact: Mary Garcia, *protected email*
  4. License Contact:  Marjorie Hlava, *protected email*

Adopted (as opposed to owned) by organizations/publishers:
How is this KM applied?

  1. Manually and Auto-tagging software
  2. By Editorial Staff | Professional indexers

How is this KM used?

  1. Direct Bibliographic Search
  2. Display

Description of the current use case(s) of the KM

Customized version used by Acquire Media for categorization of news items, and RSS delivery according to customers’ interests.

Description of the future/potential use cases of the KM (not yet realized)

Categorization of news stories (including archived stories) by and for newspaper publishers; indexing of 20th and 21st century historical studies.

What are the main goals for using this KM?

  1. Increase downloads

Rationale for KM vs other means of searching and browsing?

Every news day, you can tag the articles as they are produced through a cloud service or installed on your own local servers. We automatically feed this data through NewsIndexer, which scans every article and searches for terms similar to those in its controlled vocabulary. NewsIndexer then displays these terms for the human indexer’s review and approval. For backfile collections you can just accept the indexing as an automatic batch process. For ongoing daily feeds you might want to review all or a random sample of the results on a regular basis for maintenance.

Is the KM being actively developed?

  1. Yes, internally

License information:

http://www.newsindexer.com/contact.htm

67 Bricks Ltd

67 Bricks works with publishers who want to enrich their content to make it more structured, granular, flexible and reusable.

We help publishers develop content enrichment processes, systems and delivery channels that support more agile and flexible production workflows, increase the value of legacy and new content, increase revenues from existing channels, enable better reuse of content and deliver revenues from new digital products.

Knowledge Models Used

Because we support different publishers, we work with a wide variety of knowledge models appropriate to our clients.

Contact

*protected email*
*protected email*

Taylor & Francis Group

Knowledge Models Used:

To follow…

URL:

http://www.tandfonline.com/

Short Description:

Taylor & Francis Group publishes more than 1,800 journals and over 4,000 new books each year, with a books backlist in excess of 60,000 specialist titles. We are providers of quality information made available through our dedicated platforms Taylor & Francis Online, Taylor & Francis eBooks and CRCnetBASE, disseminating knowledge that enables our customers to perform their jobs efficiently, enhance their education, and help contribute to the advancement of their chosen market sectors.

Our aim is to facilitate discovery and allow our users to access relevant research and information quickly and easily, wherever they are.

Our content spans all areas of Humanities, Social Sciences, Behavioral Sciences, Science, Technology and Medicine, and we are one of the world’s leading publishers of scholarly journals, books, eBooks, text books and reference works, publishing under the imprints Taylor & Francis, Routledge, CRC Press, Garland Science, Psychology Press, and Focal Press.

Contact:

Unified Astronomy Thesaurus

Short Description:

The Unified Astronomy Thesaurus (UAT) is an open, interoperable and community-supported thesaurus which unifies the existing divergent and isolated Astronomy & Astrophysics thesauri into a single high-quality, freely-available open thesaurus formalizing astronomical concepts and their inter-relationships.

URL:

http://astrothesaurus.org/

Owned/Developed by:

  1. Name of Owner: American Astronomical Society, http://aas.org/
  2. Name of Developer:  Katie Frey, ADS, https://groups.google.com/d/forum/uat-users

Adopted (as opposed to owned) by organizations/publishers:

IOP Publishing intend to use this for auto-indexing of images in the the AAS Astronomy Image Explorer and for indexing content on IOPsience journals and books platform.

Is the KM being actively developed?

  1. Yes by ADS and the Astronomy Community, http://astrothesaurus.org/

License information:

  1. Name of License
  2. Terms of license or link to license terms

PLOS Thesaurus

Short Description:

The Subject Areas belong to a thesaurus of over 10,000 terms initially built for us over the course of 2012 by Access Innovations. The 2012 thesaurus was based on the controlled vocabulary of classification terms that had been in use in PLOS Editorial Manager augmented by a specialist Science, Technology, Engineering and Medicine thesaurus built by Access Innovations. The entire corpus of PLOS articles was analysed to ensure that the thesaurus covers the research domain comprehensively.

URL:

GitHub: https://github.com/PLOS/plos-thesaurus

Owned/Developed by:

  1. Name of Developer:  Rachel Drysdale  gro.s1490249481olp@e1490249481ladsy1490249481rdr1490249481

How is this KM used?

Machine Aided Indexing (MAI)
e.g. PLOS One: http://www.plosone.org/taxonomy

License information:

https://github.com/PLOS/plos-thesaurus

IOP Thesaurus

Short Description:

The IOP thesaurus is a collection of ca. 6000 terms describing concepts in physics and related areas.

URL:

The model is not currently available externally.

Owned/Developed by:

  1. Name of Owner:   IOP Publishing Ltd
  2. Name of Developer:  Michael Roberts, *protected email*
  3. Technical Contact:   Michael Roberts, *protected email*
  4. License Contact:   Not applicable

Adopted (as opposed to owned) by organizations/publishers:
How is this KM applied?

IOP Publishing, applied manually and through auto-tagging software (Luxid/TEMIS),  Editorial Staff and Referees

How is this KM used?

Not currently in production use for semantic enrichment.

Description of the current use case(s) of the KM

Used for indexing of referees, within ScholarOne Manuscripts.

Description of the future/potential use cases of the KM (not yet realized)

Faceted search, display of further relevant content, key-word driven advertising.

What are the main goals for using this KM?

  1. Improve service to readers on IOPscience
  2. Enhance UX
  3. Increase Search Engine Ranking
  4. Increase time user spends on site
  5. Increase traffic
  6. Increase downloads
  7. New opportunities for innovation

Rationale for KM vs other means of searching and browsing?

As the IOP thesaurus describes relationships between topics as well as providing keywords, it provides a powerful mechanism for determining relevant content and search faceting not available through statistical approaches. The machine-based indexing approach ensures consistency across the corpus, avoiding human bias in applying keywords.

Is the KM being actively developed?

Yes, internally

License information:

KM is not currently available to license

SNOMED CT

Short Description:

SNOMED CT is an extensive clinical terminology that was formed by the merger, expansion, and restructuring of SNOMED RT® (Reference Terminology) and the United Kingdom National Health Service (NHS) Clinical Terms (also known as the Read Codes). It is the most comprehensive clinical vocabulary available in English (or any language). SNOMED CT is concept-oriented and has an advanced structure that meets most accepted criteria for a well-formed, machine-readable terminology. It has been designated as a US standard for electronic health information exchange in Interoperability Specifications produced by the Healthcare Information Technology Standards Panel and has also been adopted for use by the US Federal Government, through the Consolidated Health Informatics (CHI) Initiative, for several clinical domains

URL:

http://download.nlm.nih.gov/umls/kss/IHTSDO20140731/SnomedCT_Release_INT_20140731.zip

Owned/Developed by:

  1. Name of Owner:   International Health Terminology Standards Organisation (IHTSDO)
  2. Name of Developer:    International Health Terminology Standards Organisation (IHTSDO) originally developed by College of American Pathologists (CAP)
  3. Technical Contact:   gro.o1490249481dsthi1490249481@trop1490249481pus1490249481
  4. License Contact:   gro.o1490249481dsthi1490249481@trop1490249481pus1490249481

Adopted (as opposed to owned) by organizations/publishers:

Wiley in autotagging software

How is this KM used?

Description of the current use case(s) of the KM

Description of the future/potential use cases of the KM (not yet realized)

What are the main goals for using this KM?

Rationale for KM vs other means of searching and browsing?

Please describe the added-value that the KM delivers in comparison to “standard” full-text search indexes, author-supplied keywords or automated on-the-fly generation of topics based on purely statistical information.

Is the KM being actively developed?

Yes

License information:

See http://ihtsdo.org/licensing/

Medical Subject Headings (MeSH®)

Short Description:

MeSH is the National Library of Medicine’s controlled vocabulary thesaurus. It consists of sets of terms naming descriptors in a hierarchical structure that permits searching at various levels of specificity.

MeSH descriptors are arranged in both an alphabetic and a hierarchical structure. At the most general level of the hierarchical structure are very broad headings such as “Anatomy” or “Mental Disorders.” More specific headings are found at more narrow levels of the twelve-level hierarchy, such as “Ankle” and “Conduct Disorder.” There are 27,149 descriptors in 2014 MeSH. There are also over 218,000 entry terms that assist in finding the most appropriate MeSH Heading, for example, “Vitamin C” is an entry term to “Ascorbic Acid.” In addition to these headings, there are more than 219,000 headings called Supplementary Concept Records (formerly Supplementary Chemical Records) within a separate thesaurus.

URL:

 http://www.nlm.nih.gov/mesh/meshhome.html

Owned/Developed by:

  1. Name of Owner:         see http://www.nlm.nih.gov/pubs/factsheets/mesh.html 
  2. Name of Developer:
  3. Technical Contact:
  4. License Contact:

Adopted (as opposed to owned) by organizations/publishers:

How is this Knowledge Model used?

Description of the current use case(s) of the KM

Description of the future/potential use cases of the KM (not yet realized)

What are the main goals for using this KM?

Rationale for KM vs other means of searching and browsing?

Is the KM being actively developed?

Yes

12. License information:

License not required