-
CollectionText
Oxford Text Archive Core Collection
Date of publication:
1969-1994
Description:
The 1960’s recordings were gathered by Vince McNeaney during the SSRC-funded “Tyneside Linguistic Survey” (TLS) undertaken by Barbara Strang (Principal Investigator), John Pellowe and associates of Newcastle University. ...
This item contains 1 file (3.79
MB).
Publicly Available
-
-
Text
Oxford Text Archive Core Collection
Date of publication:
1640-1740
Author(s):
Unknown author
Description:
Resource deposited with the Oxford Text Archive.
This item contains 5 files (48.44
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2100 BCE-1700 BCE
Author(s):
Unknown author
Description:
This edition of the ETCSL is an expansion, revision
and enhancement of the first-time deposit of the corpus
( - currently not available). The Electronic Text Corpus of Sumerian Literature
...
This item contains 10 files (4.9
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1991-1994
Description:
The British National Corpus is a snapshot of British English in the early 1990s. The British National Corpus is: (i) a sample corpus: composed of text samples generally no longer than 45,000 words, (ii) a synchronic corpus: ...
This item contains 1 file (8.37
MB).
Restricted Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1991-1994
Description:
The British National Corpus is a snapshot of British English in the early 1990s. The British National Corpus is: (i) a sample corpus: composed of text samples generally no longer than 45,000 words, (ii) a synchronic corpus: ...
This item contains 1 file (538.34
MB).
Restricted Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1991-1994
Description:
The British National Corpus is a snapshot of British English in the early 1990s. The British National Corpus is: (i) a sample corpus: composed of text samples generally no longer than 45,000 words, (ii) a synchronic corpus: ...
This item contains 1 file (22.05
MB).
Restricted Use
-
-
Text
Oxford Text Archive Core Collection
Date of publication:
1984
Description:
Resource deposited with the Oxford Text Archive.
This item contains 5 files (6.12
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2020
Description:
The CorCenCC corpus contains over 11 million words (circa 14.4m tokens) from written, spoken and electronic (online, digital texts) Welsh language sources, taken from a range of genres, language varieties (regional and ...
This item contains 1 file (49.41
KB).
Publicly Available
-
-
Text
Oxford Text Archive Core Collection
Date of publication:
1675
Description:
Resource deposited with the Oxford Text Archive.
This item contains 5 files (1.31
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1994-11-01
Author(s):
Unknown author
Description:
Subset of the Brown corpus of American English : [1961]
This item contains 2 files (1.46
MB).
Academic Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1654-1655
Description:
A set of bound photocopies of a collection of printed pamphlets, books and newspapers, 1640-1661. The originals are in the British Library at shelfmarks E1-E1938, E2103-E2143, and E2255-E2272.
This item contains 3 files (2.86
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1990-2007
Description:
The project was the beneficiary of two research grants:
Arts and Humanities Research Council Grant no AH/E000649/1
British Academy Grant no SG39350
...
This item contains 1 file (8.05
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1991-1994
Description:
The resource contains a selection of excerpts from BNC-Baby files that have been annotated for metaphor. There are four registers, each comprising about 50,000 words: academic texts, news texts, fiction, and conversations.
...
This item contains 4 files (16.25
MB).
-
-
Corpus
Oxford Text Archive Legacy Collection
Date of publication:
1979
Author(s):
Unknown author
Description:
Various samples of a wide variety of published texts including magazines, newspapers, and institutional literature as well as many other examples of writing in English Catalogued on RLIN
This item contains 28 files (6.27
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1996-1998
Author(s):
Unknown author
Description:
Corpus of modern written German, approximately 23 million words.
This item contains 1 file (59.03
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
unknown
This item contains 3 files (2.46
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Author(s):
Unknown author
Description:
The Lancaster Corpus of Mandarin Chinese (LCMC) is designed as a Chinese match for the FLOB and FROWN corpora for modern British and American English. The corpus is suitable for use in both monolingual research into modern ...
This item contains 1 file (6.33
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Description:
Mode of access: Online. OTA website The rudimentary form of the Sheffield Corpus of Chinese contains a limited body of representative texts from Medieval (MedC) and Modern Chinese (ModC) periods. They are of two text types: ...
This item contains 1 file (138.18
KB).
Publicly Available
-
-
corpus
Learning and teaching materials
Description:
Text transcripts of all Annual Messages to Congress on the State of the Union, 1790 to 2024, in plain text format for use with concordancers and natural language processing tools.
This item contains 1 file (3.94
MB).
Publicly Available
-
-
corpus
Learning and teaching materials
Description:
Text transcripts of all US presidential inaugural address (to 2021) in plain text format, for use in concordancers and with language processing tools.
This item contains 1 file (335.88
KB).
Publicly Available
-
-
Text
Oxford Text Archive Legacy Collection
Date of publication:
1964-1974
Author(s):
Unknown author
Description:
On leaf preceeding t.p.: Latvijas PSR Zinātnu akadēmija. Andreja Upīša Valodas un literatūras institūts. -- Includes music ; Aspazijas lirika. -- [Waverly, Iowa] : Latvju Grāmata, 1964. -- ([Raksti] ; 2) ; Jaunatnei / ...
This item contains 1 file (360.21
KB).
Publicly Available
-
-
CollectionSound
Oxford Text Archive Core Collection
Date of publication:
2015
Description:
The resource is a speech corpus, with digital audio files, text transcripts, and files containing time stamps of the phoneme boundaries. There are 1813 .wav files containing spoken utterances, 1813 .lab files containing ...
This item contains 3 files (1.97
MB).
Publicly Available
-
-
Text
Oxford Text Archive Core Collection
Date of publication:
unknown
Description:
Modern Yugoslav fiction
This item contains 2 files (8.52
KB).
Publicly Available
-
-
Linguistic corpora
Oxford Text Archive Core Collection
Description:
A corpus of literary texts based on Harold Bloom’s The Western Canon: The Books and School of the Ages (1994), created in order to conduct exploratory research in in Culturomics and Corpus Stylistics. There are 805 texts ...
This item contains 2 files (158.99
MB).
Publicly Available
-
-
corpus
Learning and teaching materials
Description:
A corpus of the novels of G. A. Henty (1832-1902) in plain text format, made available for literary and linguistic research and for natural language processing. The texts were downloaded from Project Gutenberg, and then ...
This item contains 5 files (62.68
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Legacy Collection
Date of publication:
900-1400
Description:
vol. 2 ; pp. 19, 20 vol. 3 ; pp. 37-42 pp. 5,6,8 pp. 6-9
This item contains 2 files (84.52
KB).
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
730-1710
Author(s):
Unknown author
Description:
The corpus is comprised of selections from the following titles: The Prose Solomon and Saturn, and, Adrian and Ritheus ; The Old Testament : the Old English version of the Heptateuch, Aelfreic’s treatise on the Old and New ...
This item contains 4 files (9.61
MB).
Academic Use
-
-
Text
EEBO-TCP
Date of publication:
1619
Description:
A sermon on Matthew XXVI, 26. Reproduction of the original in the Henry E. Huntington Library and Art Gallery.
This item contains 4 files (1.36
MB).
Publicly Available
-
-
CollectionSound
Oxford Text Archive Core Collection
Date of publication:
2001-07-01
Author(s):
Unknown author
Description:
Recordings for the IViE corpus were made between 1997 and 2000, in nine urban locations in the British Isles: London (Second generation Caribbean English), Cambridge, Cardiff (Welsh English), Liverpool, Leeds, Bradford ...
This item contains 2 files (373.7
KB).
Academic Use
-
-
corpus
Oxford Text Archive Core Collection
Date of publication:
1881-1922
Description:
The Corpus of English Novels (CEN), compiled by Hendrik De Smet, has been designed to allow tracking of short-term language change and comparing usage across individual authors. It consists entirely of novels, written by ...
This item contains 2 files (54.16
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1990
Author(s):
Unknown author
Description:
Based on tape recordings of unscripted conversational material gathered by the Tape Recorded Survey of Hiberno-English Speech.
This item contains 1 file (734.75
KB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2004
Description:
The BAWE corpus contains 2761 pieces of proficient assessed student writing, ranging in length from about 500 words to about 5000 words. Holdings are fairly evenly distributed across four broad disciplinary areas (Arts and ...
This item contains 2 files (281.37
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2002-2004
Description:
This corpus contains 979,831 words, made up of 1723 articles taken from three daily French newspapers:
Le Monde (576 articles / 355,046 words)
L'Humanité (576 articles ...
This item contains 1 file (3.34
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1999-2001
Author(s):
Unknown author
Description:
The corpus consists of 1,489 essays written by 440 Swedish university students of English at three different levels, the majority in their first term of full-time studies. The total number of words is 1,221,265, which means ...
This item contains 4 files (3.45
MB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
2003
Author(s):
Unknown author
Description:
The collection consists of: Thirty million words of monolingual written data (Gujarati, Tamil, Hindi, Punjabi-news website articles); 600,000 words of monolingual spoken data (Hindi, Urdu, Punjabi, Bengali, Gujarati-radio ...
This item contains 9 files (108.26
MB).
Publicly Available
-
-
CollectionSound
Oxford Text Archive Core Collection
Date of publication:
1999-2005
Description:
The BASE corpus consists of 160 lectures and 39 seminars recorded in a variety of university departments. Holdings are distributed across four broad disciplinary groups, each represented by 40 lectures and 10 seminars. ...
This item contains 3 files (3.93
MB).
Academic Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1996-2001
Description:
Linguistic corpus comprised of the following texts Bibliographic information gathered from both the electronic texts and deposit forms provided with the CIC : Corpus dell' Italiano Commerciale, compiled and edited by Sara ...
This item contains 1 file (646.48
KB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1560-1760
Description:
The Corpus of English Dialogues comprises dialogues from 1560 to 1760. Dialogues are of prime interest to the study of the development of English
because interactive face-to-face communication has been found to ...
This item contains 4 files (7.43
MB).
Academic Use
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1999-2005
Description:
The Giessen - Long Beach Chaplin Corpus (GLBCC) consists of transcribed interactions between native English speakers, ESL and EFL speakers. Pairs of students, in California (for English as native and second language) and ...
This item contains 1 file (878.38
KB).
Publicly Available
-
-
Corpus
Oxford Text Archive Core Collection
Date of publication:
1410-1695
Author(s):
Nevalainen, Terttu
; et al.show everyone
Nevalainen, Terttu
;
Raumolin-Brunberg, Helena
;
Keränen, Jukka
;
Nevala, Minna
;
Nurmi, Arja
;
Palander-Collin, Minna
;
Taylor, Ann
;
Pintzuk, Susan
;
Warner, Anthony
Description:
The Parsed Corpus of Early English Correspondence contains 4970 personal letters by 666 writers, altogether 2.2 million words of running text from the years 1410?-1681. The letters have been selected to be as socially ...
This item contains 1 file (37.05
MB).
Academic Use
-