Standard
WikiDoMiner: wikipedia domain-specific miner. /
Ezzini, Saad; Abualhaija, Sallam; Sabetzadeh, Mehrdad.
ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. ed. / Abhik Roychoudhury; Cristian Cadar; Miryung Kim. Association for Computing Machinery (ACM), 2022. p. 1706-1710 (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering).
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Harvard
Ezzini, S, Abualhaija, S & Sabetzadeh, M 2022,
WikiDoMiner: wikipedia domain-specific miner. in A Roychoudhury, C Cadar & M Kim (eds),
ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Association for Computing Machinery (ACM), pp. 1706-1710.
https://doi.org/10.1145/3540250.3558916
APA
Ezzini, S., Abualhaija, S., & Sabetzadeh, M. (2022).
WikiDoMiner: wikipedia domain-specific miner. In A. Roychoudhury, C. Cadar, & M. Kim (Eds.),
ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering (pp. 1706-1710). (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering). Association for Computing Machinery (ACM).
https://doi.org/10.1145/3540250.3558916
Vancouver
Ezzini S, Abualhaija S, Sabetzadeh M.
WikiDoMiner: wikipedia domain-specific miner. In Roychoudhury A, Cadar C, Kim M, editors, ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. Association for Computing Machinery (ACM). 2022. p. 1706-1710. (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering). doi: 10.1145/3540250.3558916
Author
Ezzini, Saad ; Abualhaija, Sallam ; Sabetzadeh, Mehrdad. /
WikiDoMiner: wikipedia domain-specific miner. ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering. editor / Abhik Roychoudhury ; Cristian Cadar ; Miryung Kim. Association for Computing Machinery (ACM), 2022. pp. 1706-1710 (ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering).
Bibtex
@inproceedings{dac7e9b89bd34a72bebc411580e05360,
title = "WikiDoMiner: wikipedia domain-specific miner",
abstract = "We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)",
keywords = "Domain-specific Corpus Generation, Natural Language Processing, Natural-language Requirements, Requirements Engineering, Wikipedia",
author = "Saad Ezzini and Sallam Abualhaija and Mehrdad Sabetzadeh",
year = "2022",
month = nov,
day = "9",
doi = "10.1145/3540250.3558916",
language = "English",
series = "ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering",
publisher = "Association for Computing Machinery (ACM)",
pages = "1706--1710",
editor = "Abhik Roychoudhury and Cristian Cadar and Miryung Kim",
booktitle = "ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering",
address = "United States",
}
RIS
TY - GEN
T1 - WikiDoMiner: wikipedia domain-specific miner
AU - Ezzini, Saad
AU - Abualhaija, Sallam
AU - Sabetzadeh, Mehrdad
PY - 2022/11/9
Y1 - 2022/11/9
N2 - We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)
AB - We introduce WikiDoMiner - a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https: //doi.org/10.5281/zenodo.6672682)
KW - Domain-specific Corpus Generation
KW - Natural Language Processing
KW - Natural-language Requirements
KW - Requirements Engineering
KW - Wikipedia
U2 - 10.1145/3540250.3558916
DO - 10.1145/3540250.3558916
M3 - Conference contribution/Paper
T3 - ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering
SP - 1706
EP - 1710
BT - ESEC/FSE 2022 - Proceedings of the 30th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering
A2 - Roychoudhury, Abhik
A2 - Cadar, Cristian
A2 - Kim, Miryung
PB - Association for Computing Machinery (ACM)
ER -