Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review
Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review
}
TY - CONF
T1 - Dealing with big data outside of the cloud
T2 - LREC2014 Workshop on Challenges in the Management of Large Corpora (CMLC-2)
AU - Vidler, John
AU - Rayson, Paul
AU - Anthony, Laurence
AU - Scott, Andrew
AU - Mariani, John
PY - 2014/5/31
Y1 - 2014/5/31
N2 - The demands placed on systems to analyse corpus data increase with input size, and the traditional approaches to processing this data are increasingly having impractical run-times. We show that the use of desktop GPUs presents a significant opportunity to accelerate a number of stages in the normal corpus analysis pipeline. This paper contains our exploratory work and findings into applying high-performance computing technology and methods to the problem of sorting large numbers of concordance lines.
AB - The demands placed on systems to analyse corpus data increase with input size, and the traditional approaches to processing this data are increasingly having impractical run-times. We show that the use of desktop GPUs presents a significant opportunity to accelerate a number of stages in the normal corpus analysis pipeline. This paper contains our exploratory work and findings into applying high-performance computing technology and methods to the problem of sorting large numbers of concordance lines.
KW - Very Large Corpora
KW - CONCURRENCY
KW - GPU Computing
KW - High Performance Computing
KW - Concordances
KW - Sorting
M3 - Conference paper
SP - 21
EP - 24
Y2 - 31 May 2014
ER -