Show simple item record

Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English

 
dc.contributor Coltheart, Max School of Behavioural Science Macquarie University Sydney
dc.contributor.editor Coltheart, Max
dc.contributor.editor Kucera, Henry
dc.coverage.placeName s.l.
dc.date.accessioned 2018-07-27
dc.date.accessioned 2022-08-21T16:24:42Z
dc.date.available 2022-08-21T16:24:42Z
dc.date.created 1961
dc.identifier ota:0668
dc.identifier.uri http://hdl.handle.net/20.500.14106/0668
dc.description.abstract In English “The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961” Frequency analysis of English usage Publication based on OTA text: Computational analysis of present-day American English / by Henry Kuçera and W. Nelson Francis. -- Providence [RI] : Brown University, 1967. -- pp. xvii-xxv Publication based on OTA text: Frequency analysis of English usage : lexicon and grammar / by Henry Kuçera and W. Nelson Francis. -- Boston : Houghton Mifflin, 1982. -- pp. 3-15. -- “Available with prior consent of depositor for research purposes only”. -- United States Office of Education. Cooperative Research Project No. E-007. -- OTA 0402
dc.format.extent Text data (3 files : ca. 1095, 5, 8 KB)
dc.format.medium Digital bitstream
dc.language English
dc.language.iso eng
dc.publisher University of Oxford
dc.relation.ispartof Oxford Text Archive Legacy Collection
dc.rights Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.rights.label PUB
dc.subject.lcsh Computational linguistics -- Australia
dc.subject.lcsh Anthologies -- United States
dc.subject.other Anthologies
dc.title Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English
dc.type Text
has.files yes
branding Oxford Text Archive
files.size 1128179
files.count 2
otaterms.date.range 1900-1999

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)

 Files for this item

 Download all local files for this item (1.08 MB)

Icon
Name
kuceradat-0668.txt
Size
1.07 MB
Format
Text file
Description
Version of the work in plain text format
 Download file  Preview
 File Preview  
1 01 001 .0044**K
    1 01 001 .01
    1 01 001 .020
    2 01 001 .027
    1 01 001 .028
    1 01 001 .05
    1 01 001 .05**K
    3 01 001 .07
    1 01 001 .076
    1 01 001 .09
    1 01 001 .1
    1 01 001 .130
    1 01 001 .143
    1 01 001 .179
   12 02 002 .22
    3 03 003 .22-CALIBER
    1 01 001 .222'S
    1 01 001 .243
    1 01 001 .255
    2 01 001 .264
    ^  ^   ^
    ^  ^   ^
       ^
    ^  ^   ^  ^ -^
              ^
    ^  ^   ^  ^
    ^  ^   ^  ^
    ^  ^   ^  ^^
    ^  ^   ^  ^ ^
    ^  ^   ^  ^^
^^
    ^  ^   ^  ^^
    ^  ^   ^  ^
    ^  ^   ^  ^
    ^  ^   ^  ^
       ^   ^  ^
    ^         ^
    ^  ^   ^  ^ `
    ^  ^   ^     '-
       ^   ^   ^
       ^   ^
    4 01 002 .45
    1 01 001 .45-CALIBER
    1 01 001 .455
    2 01 001 .458
    2 02 002 .5
    1 01 001 .50
    1 01 001 .500
    1 01 001 .7
    1 01 001 .75
    1 01 001 .7854
    1 01 001 (*=A,B*$)
  139 12 039 +
    1 01 001 +.04
    1 01 001 +.50
    1 01 001 +.7
    1 01 001 +C
    1 01 001 $0.9
   1 . . .
										
Icon
Name
kuceradoc-0668.txt
Size
7.49 KB
Format
Text file
Description
Version of the work in plain text format
 Download file  Preview
 File Preview  
KUCERA (Kucera-Francis Word-frequency Count)

Notes provided by

    Roger Mitton, Dept of Computer Science,
    Birkbeck College, Malet Street,
    London WC1E 7HX

    November 1984

     KUCERA  contains   over   50,000   entries   from   the
Kucera-Francis  Frequency  Count  of  items in the corpus of
text collected at Brown University (commonly referred to  as
the  Brown  Corpus).   Details  of  the  corpus are given in
'Computational Analysis of Present-day American English'  by
Henry Kucera and W.  Nelson Francis, Brown University Press,
1967, and also in  'Frequency  Analysis  of  English  Usage:
Lexicon  and  Grammar'  by  the  same  authors, published by
Houghton Mifflin, 1982.  The following is  from  the  latter
book:

     'The corpus consists of approximately 1,014,000 graphic
words of running text, all of which was first printed in the
United States in the year 1961.  The text  is  divided  into
five hundred samples of about two thousand words each, which
are  assigned . . .
										

Show simple item record