Home > Research > Publications & Outputs > WikiDoMiner: wikipedia domain-specific miner

Links

Text available via DOI:

View graph of relations

WikiDoMiner: wikipedia domain-specific miner

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

WikiDoMiner: wikipedia domain-specific miner. / Ezzini, Saad; Abualhaija, Sallam; Sabetzadeh, Mehrdad.
ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. ed. / Abhik Roychoudhury; Cristian Cadar; Miryung Kim. Association for Computing Machinery (ACM), 2022. p. 1706-1710 (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Ezzini, S, Abualhaija, S & Sabetzadeh, M 2022, WikiDoMiner: wikipedia domain-specific miner. in A Roychoudhury, C Cadar & M Kim (eds), ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Association for Computing Machinery (ACM), pp. 1706-1710. https://doi.org/10.1145/3540250.3558916

APA

Ezzini, S., Abualhaija, S., & Sabetzadeh, M. (2022). WikiDoMiner: wikipedia domain-specific miner. In A. Roychoudhury, C. Cadar, & M. Kim (Eds.), ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering (pp. 1706-1710). (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering). Association for Computing Machinery (ACM). https://doi.org/10.1145/3540250.3558916

Vancouver

Ezzini S, Abualhaija S, Sabetzadeh M. WikiDoMiner: wikipedia domain-specific miner. In Roychoudhury A, Cadar C, Kim M, editors, ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. Association for Computing Machinery (ACM). 2022. p. 1706-1710. (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering). doi: 10.1145/3540250.3558916

Author

Ezzini, Saad ; Abualhaija, Sallam ; Sabetzadeh, Mehrdad. / WikiDoMiner: wikipedia domain-specific miner. ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. editor / Abhik Roychoudhury ; Cristian Cadar ; Miryung Kim. Association for Computing Machinery (ACM), 2022. pp. 1706-1710 (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering).

Bibtex

@inproceedings{dac7e9b89bd34a72bebc411580e05360,
title = "WikiDoMiner: wikipedia domain-specific miner",
abstract = "We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)",
keywords = "Domain-specific Corpus Generation, Natural Language Processing, Natural-language Requirements, Requirements Engineering, Wikipedia",
author = "Saad Ezzini and Sallam Abualhaija and Mehrdad Sabetzadeh",
year = "2022",
month = nov,
day = "9",
doi = "10.1145/3540250.3558916",
language = "English",
series = "ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering",
publisher = "Association for Computing Machinery (ACM)",
pages = "1706--1710",
editor = "Abhik Roychoudhury and Cristian Cadar and Miryung Kim",
booktitle = "ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering",
address = "United States",

}

RIS

TY - GEN

T1 - WikiDoMiner: wikipedia domain-specific miner

AU - Ezzini, Saad

AU - Abualhaija, Sallam

AU - Sabetzadeh, Mehrdad

PY - 2022/11/9

Y1 - 2022/11/9

N2 - We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)

AB - We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)

KW - Domain-specific Corpus Generation

KW - Natural Language Processing

KW - Natural-language Requirements

KW - Requirements Engineering

KW - Wikipedia

U2 - 10.1145/3540250.3558916

DO - 10.1145/3540250.3558916

M3 - Conference contribution/Paper

T3 - ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering

SP - 1706

EP - 1710

BT - ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering

A2 - Roychoudhury, Abhik

A2 - Cadar, Cristian

A2 - Kim, Miryung

PB - Association for Computing Machinery (ACM)

ER -