Home > Research > Publications & Outputs > A journey to Semantic Web query federation in t...

Electronic data

  • 1471-2105-10-S10-S10

    Rights statement: © 2009 Cheung et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

    Final published version, 1.14 MB, PDF document

    Available under license: CC BY

Links

Text available via DOI:

View graph of relations

A journey to Semantic Web query federation in the life sciences

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

A journey to Semantic Web query federation in the life sciences. / Cheung, Kei-Hoi; Frost, H. Robert; Marshall, M. Scott et al.
In: BMC Bioinformatics, Vol. 10, No. Suppl. 10, S10, 01.10.2009.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Cheung, K-H, Frost, HR, Marshall, MS, Prud'hommeaux, E, Samwald, M, Zhao, J & Paschke, A 2009, 'A journey to Semantic Web query federation in the life sciences', BMC Bioinformatics, vol. 10, no. Suppl. 10, S10. https://doi.org/10.1186/1471-2105-10-S10-S10

APA

Cheung, K-H., Frost, H. R., Marshall, M. S., Prud'hommeaux, E., Samwald, M., Zhao, J., & Paschke, A. (2009). A journey to Semantic Web query federation in the life sciences. BMC Bioinformatics, 10(Suppl. 10), Article S10. https://doi.org/10.1186/1471-2105-10-S10-S10

Vancouver

Cheung K-H, Frost HR, Marshall MS, Prud'hommeaux E, Samwald M, Zhao J et al. A journey to Semantic Web query federation in the life sciences. BMC Bioinformatics. 2009 Oct 1;10(Suppl. 10):S10. doi: 10.1186/1471-2105-10-S10-S10

Author

Cheung, Kei-Hoi ; Frost, H. Robert ; Marshall, M. Scott et al. / A journey to Semantic Web query federation in the life sciences. In: BMC Bioinformatics. 2009 ; Vol. 10, No. Suppl. 10.

Bibtex

@article{5b8d361cadbc424380cac184a124d6fd,
title = "A journey to Semantic Web query federation in the life sciences",
abstract = "BackgroundAs interest in adopting the Semantic Web in the biomedical domain continues to grow, Semantic Web technology has been evolving and maturing. A variety of technological approaches including triplestore technologies, SPARQL endpoints, Linked Data, and Vocabulary of Interlinked Datasets have emerged in recent years. In addition to the data warehouse construction, these technological approaches can be used to support dynamic query federation. As a community effort, the BioRDF task force, within the Semantic Web for Health Care and Life Sciences Interest Group, is exploring how these emerging approaches can be utilized to execute distributed queries across different neuroscience data sources.Methods and resultsWe have created two health care and life science knowledge bases. We have explored a variety of Semantic Web approaches to describe, map, and dynamically query multiple datasets. We have demonstrated several federation approaches that integrate diverse types of information about neurons and receptors that play an important role in basic, clinical, and translational neuroscience research. Particularly, we have created a prototype receptor explorer which uses OWL mappings to provide an integrated list of receptors and executes individual queries against different SPARQL endpoints. We have also employed the AIDA Toolkit, which is directed at groups of knowledge workers who cooperatively search, annotate, interpret, and enrich large collections of heterogeneous documents from diverse locations. We have explored a tool called {"}FeDeRate{"}, which enables a global SPARQL query to be decomposed into subqueries against the remote databases offering either SPARQL or SQL query interfaces. Finally, we have explored how to use the vocabulary of interlinked Datasets (voiD) to create metadata for describing datasets exposed as Linked Data URIs or SPARQL endpoints.ConclusionWe have demonstrated the use of a set of novel and state-of-the-art Semantic Web technologies in support of a neuroscience query federation scenario. We have identified both the strengths and weaknesses of these technologies. While Semantic Web offers a global data model including the use of Uniform Resource Identifiers (URI's), the proliferation of semantically-equivalent URI's hinders large scale data integration. Our work helps direct research and tool development, which will be of benefit to this community.",
author = "Kei-Hoi Cheung and Frost, {H. Robert} and Marshall, {M. Scott} and Eric Prud'hommeaux and Matthias Samwald and Jun Zhao and Adrian Paschke",
note = "{\textcopyright} 2009 Cheung et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.",
year = "2009",
month = oct,
day = "1",
doi = "10.1186/1471-2105-10-S10-S10",
language = "English",
volume = "10",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",
number = "Suppl. 10",

}

RIS

TY - JOUR

T1 - A journey to Semantic Web query federation in the life sciences

AU - Cheung, Kei-Hoi

AU - Frost, H. Robert

AU - Marshall, M. Scott

AU - Prud'hommeaux, Eric

AU - Samwald, Matthias

AU - Zhao, Jun

AU - Paschke, Adrian

N1 - © 2009 Cheung et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PY - 2009/10/1

Y1 - 2009/10/1

N2 - BackgroundAs interest in adopting the Semantic Web in the biomedical domain continues to grow, Semantic Web technology has been evolving and maturing. A variety of technological approaches including triplestore technologies, SPARQL endpoints, Linked Data, and Vocabulary of Interlinked Datasets have emerged in recent years. In addition to the data warehouse construction, these technological approaches can be used to support dynamic query federation. As a community effort, the BioRDF task force, within the Semantic Web for Health Care and Life Sciences Interest Group, is exploring how these emerging approaches can be utilized to execute distributed queries across different neuroscience data sources.Methods and resultsWe have created two health care and life science knowledge bases. We have explored a variety of Semantic Web approaches to describe, map, and dynamically query multiple datasets. We have demonstrated several federation approaches that integrate diverse types of information about neurons and receptors that play an important role in basic, clinical, and translational neuroscience research. Particularly, we have created a prototype receptor explorer which uses OWL mappings to provide an integrated list of receptors and executes individual queries against different SPARQL endpoints. We have also employed the AIDA Toolkit, which is directed at groups of knowledge workers who cooperatively search, annotate, interpret, and enrich large collections of heterogeneous documents from diverse locations. We have explored a tool called "FeDeRate", which enables a global SPARQL query to be decomposed into subqueries against the remote databases offering either SPARQL or SQL query interfaces. Finally, we have explored how to use the vocabulary of interlinked Datasets (voiD) to create metadata for describing datasets exposed as Linked Data URIs or SPARQL endpoints.ConclusionWe have demonstrated the use of a set of novel and state-of-the-art Semantic Web technologies in support of a neuroscience query federation scenario. We have identified both the strengths and weaknesses of these technologies. While Semantic Web offers a global data model including the use of Uniform Resource Identifiers (URI's), the proliferation of semantically-equivalent URI's hinders large scale data integration. Our work helps direct research and tool development, which will be of benefit to this community.

AB - BackgroundAs interest in adopting the Semantic Web in the biomedical domain continues to grow, Semantic Web technology has been evolving and maturing. A variety of technological approaches including triplestore technologies, SPARQL endpoints, Linked Data, and Vocabulary of Interlinked Datasets have emerged in recent years. In addition to the data warehouse construction, these technological approaches can be used to support dynamic query federation. As a community effort, the BioRDF task force, within the Semantic Web for Health Care and Life Sciences Interest Group, is exploring how these emerging approaches can be utilized to execute distributed queries across different neuroscience data sources.Methods and resultsWe have created two health care and life science knowledge bases. We have explored a variety of Semantic Web approaches to describe, map, and dynamically query multiple datasets. We have demonstrated several federation approaches that integrate diverse types of information about neurons and receptors that play an important role in basic, clinical, and translational neuroscience research. Particularly, we have created a prototype receptor explorer which uses OWL mappings to provide an integrated list of receptors and executes individual queries against different SPARQL endpoints. We have also employed the AIDA Toolkit, which is directed at groups of knowledge workers who cooperatively search, annotate, interpret, and enrich large collections of heterogeneous documents from diverse locations. We have explored a tool called "FeDeRate", which enables a global SPARQL query to be decomposed into subqueries against the remote databases offering either SPARQL or SQL query interfaces. Finally, we have explored how to use the vocabulary of interlinked Datasets (voiD) to create metadata for describing datasets exposed as Linked Data URIs or SPARQL endpoints.ConclusionWe have demonstrated the use of a set of novel and state-of-the-art Semantic Web technologies in support of a neuroscience query federation scenario. We have identified both the strengths and weaknesses of these technologies. While Semantic Web offers a global data model including the use of Uniform Resource Identifiers (URI's), the proliferation of semantically-equivalent URI's hinders large scale data integration. Our work helps direct research and tool development, which will be of benefit to this community.

U2 - 10.1186/1471-2105-10-S10-S10

DO - 10.1186/1471-2105-10-S10-S10

M3 - Journal article

VL - 10

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

IS - Suppl. 10

M1 - S10

ER -