Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English
dc.contributor | Coltheart, Max School of Behavioural Science Macquarie University Sydney |
dc.contributor.editor | Coltheart, Max |
dc.contributor.editor | Kucera, Henry |
dc.coverage.placeName | s.l. |
dc.date.accessioned | 2018-07-27 |
dc.date.accessioned | 2022-08-21T16:24:42Z |
dc.date.available | 2022-08-21T16:24:42Z |
dc.date.created | 1961 |
dc.identifier | ota:0668 |
dc.identifier.uri | http://hdl.handle.net/20.500.14106/0668 |
dc.description.abstract | In English “The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961” Frequency analysis of English usage Publication based on OTA text: Computational analysis of present-day American English / by Henry Kuçera and W. Nelson Francis. -- Providence [RI] : Brown University, 1967. -- pp. xvii-xxv Publication based on OTA text: Frequency analysis of English usage : lexicon and grammar / by Henry Kuçera and W. Nelson Francis. -- Boston : Houghton Mifflin, 1982. -- pp. 3-15. -- “Available with prior consent of depositor for research purposes only”. -- United States Office of Education. Cooperative Research Project No. E-007. -- OTA 0402 |
dc.format.extent | Text data (3 files : ca. 1095, 5, 8 KB) |
dc.format.medium | Digital bitstream |
dc.language | English |
dc.language.iso | eng |
dc.publisher | University of Oxford |
dc.relation.ispartof | Oxford Text Archive Legacy Collection |
dc.rights | Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ |
dc.rights.label | PUB |
dc.subject.lcsh | Computational linguistics -- Australia |
dc.subject.lcsh | Anthologies -- United States |
dc.subject.other | Anthologies |
dc.title | Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English |
dc.type | Text |
has.files | yes |
branding | Oxford Text Archive |
files.size | 1128179 |
files.count | 2 |
otaterms.date.range | 1900-1999 |
This item is
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Publicly Available
and licensed under:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Files for this item
Download all local files for this item (1.08 MB)

- Name
- kuceradat-0668.txt
- Size
- 1.07 MB
- Format
- Text file
- Description
- Version of the work in plain text format
1 01 001 .0044**K 1 01 001 .01 1 01 001 .020 2 01 001 .027 1 01 001 .028 1 01 001 .05 1 01 001 .05**K 3 01 001 .07 1 01 001 .076 1 01 001 .09 1 01 001 .1 1 01 001 .130 1 01 001 .143 1 01 001 .179 12 02 002 .22 3 03 003 .22-CALIBER 1 01 001 .222'S 1 01 001 .243 1 01 001 .255 2 01 001 .264 ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ -^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^^ ^ ^ ^ ^ ^ ^ ^ ^ ^^ ^^ ^ ^ ^ ^^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ` ^ ^ ^ '- ^ ^ ^ ^ ^ 4 01 002 .45 1 01 001 .45-CALIBER 1 01 001 .455 2 01 001 .458 2 02 002 .5 1 01 001 .50 1 01 001 .500 1 01 001 .7 1 01 001 .75 1 01 001 .7854 1 01 001 (*=A,B*$) 139 12 039 + 1 01 001 +.04 1 01 001 +.50 1 01 001 +.7 1 01 001 +C 1 01 001 $0.9 1 . . .

- Name
- kuceradoc-0668.txt
- Size
- 7.49 KB
- Format
- Text file
- Description
- Version of the work in plain text format
KUCERA (Kucera-Francis Word-frequency Count) Notes provided by Roger Mitton, Dept of Computer Science, Birkbeck College, Malet Street, London WC1E 7HX November 1984 KUCERA contains over 50,000 entries from the Kucera-Francis Frequency Count of items in the corpus of text collected at Brown University (commonly referred to as the Brown Corpus). Details of the corpus are given in 'Computational Analysis of Present-day American English' by Henry Kucera and W. Nelson Francis, Brown University Press, 1967, and also in 'Frequency Analysis of English Usage: Lexicon and Grammar' by the same authors, published by Houghton Mifflin, 1982. The following is from the latter book: 'The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961. The text is divided into five hundred samples of about two thousand words each, which are assigned . . .