Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English

Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English

dc.contributor	Coltheart, Max School of Behavioural Science Macquarie University Sydney
dc.contributor.editor	Coltheart, Max
dc.contributor.editor	Kucera, Henry
dc.coverage.placeName	s.l.
dc.date.accessioned	2018-07-27
dc.date.accessioned	2022-08-21T16:24:42Z
dc.date.available	2022-08-21T16:24:42Z
dc.date.created	1961
dc.identifier	ota:0668
dc.identifier.uri	http://hdl.handle.net/20.500.14106/0668
dc.description.abstract	In English “The corpus consists of approximately 1,014,000 graphic words of running text, all of which was first printed in the United States in the year 1961” Frequency analysis of English usage Publication based on OTA text: Computational analysis of present-day American English / by Henry Kuçera and W. Nelson Francis. -- Providence [RI] : Brown University, 1967. -- pp. xvii-xxv Publication based on OTA text: Frequency analysis of English usage : lexicon and grammar / by Henry Kuçera and W. Nelson Francis. -- Boston : Houghton Mifflin, 1982. -- pp. 3-15. -- “Available with prior consent of depositor for research purposes only”. -- United States Office of Education. Cooperative Research Project No. E-007. -- OTA 0402
dc.format.extent	Text data (3 files : ca. 1095, 5, 8 KB)
dc.format.medium	Digital bitstream
dc.language	English
dc.language.iso	eng
dc.publisher	University of Oxford
dc.relation.ispartof	Oxford Text Archive Legacy Collection
dc.rights	Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.rights.label	PUB
dc.subject.lcsh	Computational linguistics -- Australia
dc.subject.lcsh	Anthologies -- United States
dc.subject.other	Anthologies
dc.title	Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English
dc.type	Text
has.files	yes
branding	Oxford Text Archive
files.size	1128179
files.count	2
otaterms.date.range	1900-1999

This item is

Publicly Available

and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)

Files for this item

Download all local files for this item (1.08 MB)

Name: kuceradat-0668.txt
Size: 1.07 MB
Format: Text file
Description: Version of the work in plain text format

Download file Preview

File Preview

1 01 001 .0044**K
    1 01 001 .01
    1 01 001 .020
    2 01 001 .027
    1 01 001 .028
    1 01 001 .05
    1 01 001 .05**K
    3 01 001 .07
    1 01 001 .076
    1 01 001 .09
    1 01 001 .1
    1 01 001 .130
    1 01 001 .143
    1 01 001 .179
   12 02 002 .22
    3 03 003 .22-CALIBER
    1 01 001 .222'S
    1 01 001 .243
    1 01 001 .255
    2 01 001 .264
    ^  ^   ^
    ^  ^   ^
       ^
    ^  ^   ^  ^ -^
              ^
    ^  ^   ^  ^
    ^  ^   ^  ^
    ^  ^   ^  ^^
    ^  ^   ^  ^ ^
    ^  ^   ^  ^^
^^
    ^  ^   ^  ^^
    ^  ^   ^  ^
    ^  ^   ^  ^
    ^  ^   ^  ^
       ^   ^  ^
    ^         ^
    ^  ^   ^  ^ `
    ^  ^   ^     '-
       ^   ^   ^
       ^   ^
    4 01 002 .45
    1 01 001 .45-CALIBER
    1 01 001 .455
    2 01 001 .458
    2 02 002 .5
    1 01 001 .50
    1 01 001 .500
    1 01 001 .7
    1 01 001 .75
    1 01 001 .7854
    1 01 001 (*=A,B*$)
  139 12 039 +
    1 01 001 +.04
    1 01 001 +.50
    1 01 001 +.7
    1 01 001 +C
    1 01 001 $0.9
   1 . . .

Name: kuceradoc-0668.txt
Size: 7.49 KB
Format: Text file
Description: Version of the work in plain text format

Download file Preview

File Preview

KUCERA (Kucera-Francis Word-frequency Count)

Notes provided by

    Roger Mitton, Dept of Computer Science,
    Birkbeck College, Malet Street,
    London WC1E 7HX

    November 1984

     KUCERA  contains   over   50,000   entries   from   the
Kucera-Francis  Frequency  Count  of  items in the corpus of
text collected at Brown University (commonly referred to  as
the  Brown  Corpus).   Details  of  the  corpus are given in
'Computational Analysis of Present-day American English'  by
Henry Kucera and W.  Nelson Francis, Brown University Press,
1967, and also in  'Frequency  Analysis  of  English  Usage:
Lexicon  and  Grammar'  by  the  same  authors, published by
Houghton Mifflin, 1982.  The following is  from  the  latter
book:

     'The corpus consists of approximately 1,014,000 graphic
words of running text, all of which was first printed in the
United States in the year 1961.  The text  is  divided  into
five hundred samples of about two thousand words each, which
are  assigned . . .

Show simple item record

Kuçera-Francis wordlist : [a] frequency count of the Brown corpus of present day American English

Files for this item

Local Connections

Repository

CLARIN Community Connections