The creation and characterisation of a National Compound Collection - Research Portal

Home > Research > Publications & Outputs > The creation and characterisation of a National...

Chemistry

Associated organisational unit

Chemical Synthesis

Text available via DOI:

https://doi.org/10.1039/c6sc00264a
Final published version

View graph of relations

The creation and characterisation of a National Compound Collection: The Royal Society of Chemistry pilot

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

David M. Andrews
Laura M. Broad
Paul J. Edwards
David N.A. Fox
Timothy Gallagher
Stephen L. Garland
Richard Kidd
Joseph B. Sweeney

More...

<mark>Journal publication date</mark>	2016
<mark>Journal</mark>	Chemical Science
Issue number	6
Volume	7
Number of pages	10
Pages (from-to)	3869-3878
Publication Status	Published
Early online date	23/02/16
<mark>Original language</mark>	English

Abstract

We present a summary of the National Compound Collection (NCC) pilot; which harvested chemical structure data from 746 publicly-Available PhD theses to create an enhanced database of diverse and interesting (largely organic) molecular entities. The database comprised ∼75000 structure entries, of which 70% were new to ChemSpider at the time of upload. The dataset was evaluated for structural uniqueness by twelve external drug discovery groups from the pharmaceutical, biotech, academic and not-for-profit sectors. These partners generated data reported here comparing the NCC pilot with their in-house compound collections. The proportion of NCC structures considered to be useful for drug discovery ranged from 5-80% depending on the strictness of the filters used; most interestingly from a drug discovery standpoint ∼13k NCC compounds (18% of the NCC) passed the filters and were of good diversity. These compounds are quite different from those that are already present in the screening collections but not so different that they are no longer considered to be drug-like. In general, the drug discovery teams would consider these compounds to be high value molecules for inclusion in their screening collections. This pilot addressed the potential value of unpublished data and explored the practicalities of large-scale data extraction, to inform both retrospective and prospective extraction of chemical data from theses.

Research

Associated organisational unit

Links

Text available via DOI:

The creation and characterisation of a National Compound Collection: The Royal Society of Chemistry pilot

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us