Vienna-Oxford International Corpus of English (VOICE)

VOICE is a computer-readable corpus of English as a lingua franca, one of the main usages of the language. It comprises 1 million words of transcribed spoken ELF from professional, educational and leisure domains and various speech event types. The speakers in the corpus are viewed primarily not as language learners but as speakers in their own right. VOICE was thus conceived to serve as the first general corpus of English as a lingua franca (ELF).

Project leader: Barbara Seidlhofer, University of Vienna
Time of compilation: 2001–2009
Size: 1,023,043 words
Language: English as a lingua franca (ELF)
Number of texts/samples: 151
Period: 2001–2007
Released: 2009
Funding: FWF (Austrian Science Fund); OUP (Oxford University Press)
Project home page: http://www.univie.ac.at/voice/

Reference lines and copyright

The recommended citation for VOICE 1.0 Online is:

VOICE. 2009. The Vienna-Oxford International Corpus of English (version 1.0 online). Director: Barbara Seidlhofer; Researchers: Angelika Breiteneder, Theresa Klimpfinger, Stefan Majewski, Marie-Luise Pitzl. http://voice.univie.ac.at (date of last access).

The short citation for VOICE 1.0 Online is:

VOICE. 2009. The Vienna-Oxford International Corpus of English (version 1.0 online). http://voice.univie.ac.at (date of last access).

Manual

An extensive manual for VOICE Online ('Using VOICE Online’) is available at http://www.univie.ac.at/voice/help.

Detailed information about the corpus (design, structure, statistics, transcription,…) can be found at the project web site.

Compilers

Project leader

Barbara Seidlhofer

Compilers

Angelika Breiteneder, Theresa Klimpfinger, Stefan Majewski, Marie-Luise Pitzl

Transcription Team

Ana Babic, Beate Baumgartner, Heike Böhringer, Jelena Djuric, Judith Hafner, Cornelia Hülmbauer, Julia Lichtkoppler, Leopold Lippert, Nicole Lundeen, Elisabeth Oberleitner, Ruth Osimk, Gerhild Silber, Julia Schüller, Kerstin Weber

Availability

VOICE 1.0 Online is available as a free-of-charge resource for non-commercial research purposes: http://www.univie.ac.at/voice/page/corpus_availability.

Users are permitted to use VOICE free of charge for private study and research, provided that usage is for personal non-commercial purposes only and not for any commercial advantage. Non-commercial publications, scientific papers and reports based on VOICE may include brief citations from VOICE. In accordance with standard academic and general practices, all publications, scientific papers and reports based on VOICE published in electronic or other form must give credit to VOICE. The full TERMS OF USE for VOICE can be accessed at http://www.univie.ac.at/voice/page/terms_of_use.

Technical information

The corpus is stored in a TEI-based XML format and rendered into HTML with a set of XSL Transformation stylesheets for online display.