DANJONES (Daniel Jones English Pronouncing Dictionary) Notes provided by Roger Mitton, Dept of Computer Science, Birkbeck College, Malet Street, London WC1E 7HX November 1984 DANJONES contains over 34,000 entries from the English Pronouncing Dictionary by Daniel Jones, 12th edition, Everyman, Dent, London 1963. The keying-in was arranged by Professor L. Guierre of the Sorbonne, and the work is described (in French) in 'Etudes de Linguistique Appliquee', 1966, Vol 4, pp 48-64. The version described here is much the same as described by Guierre, except for the character set used in the pronunciation-field, as I'll explain later. The file contains all the entries in the book except for most of the proper nouns, though adjectives of nationality and suchlike have been retained. Each line contains one entry of 101 characters. Columns 1-20 contain the spelling, in capital letters, including apostrophes and hyphens. An open bracket '(' after an E indicates an acute accent. An 8 at the end of a word indicates that the spelling included some exotic symbol, such as a tilde (~) or a circumflex accent (^). Verbs ending in IZE are transcribed 'IZE 9' when the spelling ISE is also used. Columns 21-42 contain the pronunciation, except for stress and syllable boundaries. The pronunciation is what Daniel Jones in 1963 regarded as the Received Pronunciation. Users of this material should bear in mind that RP is spoken by only a small minority of speakers of English, and that RP itself, and indeed the concept of RP, have changed somewhat since 1963 - see the introduction to the most recent edition of the English Pronouncing Dictionary (the 14th) by A.C. Gimson. Obviously, the character set available on most computers does not include many of the symbols of the phonetic alphabet, so the following have been used: a as in 'fat' a: as in 'father' e as in 'get' i as in 'it' i: as in 'see' o as in 'hot' o: as in 'saw' u as in 'put' u: as in 'too' ^ as in 'up' @ as the 'a' in 'about' @: as in 'bird' E@ as the 'ere' in 'where' Ou as the 'o' in 'holy' I in the 'ious' of 'inglorious' U in the 'uous' of 'ingenuous' g as in 'give' 9 as the 'ng' of 'sing' 8 as the 'th' of 'thin' 6 as the 'th' of 'then' S as the 'sh' of 'ship' 3 as the 's' of 'measure' j as the 'y' of 'yes' tS as in 'chin' d3 as in 'jam' L to distinguish between 'flannelly' and 'Hanley' N to distinguish between 'personable' and 'parsnip' These characters differ from those originally used by Guierre. Working in the 1960's, he was obliged to use the even more limited set of characters available on a card-punch. I have recoded the characters so as to avoid the use of two characters to represent just one phonetic symbol, to use characters that resemble more closely the phonetic symbols and generally to make the pronunciation-field more readable. A closing bracket ')' indicates that the preceding symbol was either in italics or in brackets in the book. In the case of i:) it means that the word may be pronounced with either i or i:, and likewise for u:) and @:). After other symbols, the closing bracket indicates that the preceding phoneme is often omitted. A hyphen in the pronunciation field indicates the subtle difference between, for example, 'selfish' (self-iS) and 'shellfish' (Sel-fiS), or 'nutshell' (n^t-Sel) and 'Mitchell' (mitSel). The symbol Y for the pronunciation means that the pronunciation is not capable of being represented by the above symbols, eg 'genre'; there are only twelve of these in the file. Column 43 may contain a *, which indicates that the word may be followed by an r when followed by a word beginning with a vowel; for example, 'far' is pronounced fa: in 'far better', but fa:r in 'far away' (a phenomenon known as 'r-linking'). Column 44 may contain a Q, which indicates that the word is derived from one of the other words in the corpus by adding one of the following suffixes: -er, -or, -ment, -able, -ist, -ent such as 'abatement' from 'abate'. Columns 45-60 contain a rather complicated code. The first part of it contains one or more of the letters N, Q, V or A, which indicates that the word is a Noun (N), Adjective (Q), Verb (V) or Other (A). (Q for Adjective and A for Other make more sense in French.) This may be followed by a string of letters and numbers which indicate how many variant pronunciations of the word exist, and what sort of variants they are (eg stress variants, phonematic variants), though they do not tell you what the variant pronunciations actually are. See Guierre's paper for details. Columns 61-69 give the stress 'skeleton' of the word, with 0 for unstressed syllable, 1 for secondary stress and 2 for primary stress; for instance, 'abbreviation' is 01020. Finally, columns 70 to 101 contain the same pronunciation as columns 21-42, with the addition of stress and syllable markers. These are slashes ('/') to indicate syllable boundaries, and commas and apostrophes to indicate stress - primary stress on the syllable following the apostrophe and secondary stress on the syllable following First 50 lines, then 10 lines every 3000, then the last 50. Line 1 'EM @m A1XCJ 0 @m 'FLU flu: N 0 flu: 'NEATH ni:8 A 0 ni:8 'SBLOOD zbl^d A 0 zbl^d 'SDEATH zde8 A 0 zde8 'TIS tiz V 0 tiz 'TWAS twoz V1X 0 twoz 'TWEEN twi:n A 0 twi:n 'TWERE tw@: * V2X 0 tw@: 'TWIXT twikst A 0 twikst -SHIRE Si@ * A1X 0 Si@ A ei A1XJ 0 ei A FORTIORI eifo:tio:rai A4X 21020 'ei/,fo:/ti/'o:/rai A POSTERIORI ei-posterio:rai A3X 201020 'ei/-pos/,te/ri/'o:/rai A PRIORI ei-praio:rai A1X 2020 'ei/-prai/'o:/rai A.D. eidi: A 22 'ei/'di: A.M. eiem AGD 22 'ei/'em AARDVARK a:dva:k N 20 'a:d/va:k AB INITIO abiniSiOu A1X 10200 ,a/bi/'ni/Si/Ou ABACK @bak A 02 @/'bak ABACUS ab@k@s N 200 'a/b@/k@s ABAFT @ba:ft A 02 @/'ba:ft ABANDON @band@n V 020 @/'ban/d@n ABANDON @band@n NH 020 @/'ban/d@n ABANDONMENT @band@nm@nt QN 0200 @/'ban/d@n/m@nt ABASE @beis V 02 @/'beis ABASEMENT @beism@nt QN 020 @/'beis/m@nt ABASH @baS V 02 @/'baS ABATABLE @beit@bl QQ 0200 @/'bei/t@/bl ABATE @beit V 02 @/'beit ABATEMENT @beitm@nt QN 020 @/'beit/m@nt ABATIS ab@ti:z NZ 200 'a/b@/ti:z ABATIS ab@tis NY1X 200 'a/b@/tis ABATISES ab@tisiz NZ 2000 'a/b@/ti/siz ABATTIS @batis N 020 @/'ba/tis ABATTOIR ab@twa: * N1X 200 'a/b@/twa: ABBACY ab@si N 200 'a/b@/si ABBE( abei NH 20 'a/bei ABBESS abis N1X 20 'a/bis ABBEY abi N 20 'a/bi ABBOT ab@t N 20 'a/b@t ABBOTSHIP ab@t-Sip N 200 'a/b@t/-Sip ABBREVIATE @bri:vieit V1XIC 0200 @/'bri:/vi/eit ABBREVIATION @bri:vieiS@)n N 01020 @/,bri:/vi/'ei/S@)n ABBREVIATOR @bri:vieit@ *QN1XIC 02000 @/'bri:/vi/ei/t@ ABBREVIATORY @bri:vj@t@ri Q2XIC 02000 @/'bri:/vj@/t@/ri ABC eibi:si: N 202 'ei/bi:/'si: ABDICANT abdik@nt N 200 'ab/di/k@nt ABDICATE abdikeit V 200 'ab/di/keit ABDICATION abdikeiS@)n N 1020 ,ab/di/'kei/S@)n Line 3001 BEHAVIOURISM biheivj@riz@)m N1X 02000 bi/'hei/vj@/ri/z@)m BEHEAD bihed V1X 02 bi/'hed BEHELD biheld V1X 02 bi/'held BEHEMOTH bihi:mo8 NS 020 bi/'hi:/mo8 BEHEST bihest N1X 02 bi/'hest BEHIND bihaind A1X 02 bi/'haind BEHINDHAND bihaindhand AQ1X 020 bi/'hain/dhand BEHOLD bihOuld V1X 02 bi/'hOuld BEHOLDEN bihOuld@)n Q1X 020 bi/'hOul/d@)n BEHOLDER bihOuld@ *QN1X 020 bi/'hOul/d@ Line 6001 COMMINATION komineiS@)n N 1020 ,ko/mi/'nei/S@)n COMMINATORY komin@t@ri Q1X 20000 'ko/mi/n@/t@/ri COMMINGLE komi9gl V1X 020 ko/'mi9/gl COMMINUTE kominju:t V 200 'ko/mi/nju:t COMMINUTION kominju:S@)n N 1020 ,ko/mi/'nju:/S@)n COMMISARY komis@ri NS 2000 'ko/mi/s@/ri COMMISERATE k@miz@reit V1X 0200 k@/'mi/z@/reit COMMISERATION k@miz@reiS@)n N1X 01020 k@/,mi/z@/'rei/S@)n COMMISSAR komisa: * NPG 102 ,ko/mi/'sa: COMMISSARIAL komisE@rI@l Q 1020 ,ko/mi/'sE@/rI@l Line 9001 DISINTEGRATOR disintigreit@ *QN 02000 dis/'in/ti/grei/t@ DISINTER disint@: * VLG 202 'di/sin/'t@: DISINTERESTED disintristid Q2XCGDXC 0200 dis/'in/tris/tid DISINTERMENT disint@:m@nt NLG 1020 ,di/sin/'t@:/m@nt DISJOIN disd3oin V 02 dis/'d3oin DISJOINT disd3oint V 02 dis/'d3oint DISJUNCT disd3^9k)t Q 02 dis/'d3^9k)t DISJUNCTION disd3^9k)S@)n N 020 dis/'d3^9/k)S@)n DISJUNCTIVE disd3^9k)tiv QGD 020 dis/'d3^9/k)tiv DISK disk N 0 disk Line 12001 FIND faind VN 0 faind FINDER faind@ *QN 20 'fain/d@ FINE fain NVQ 0 fain FINERY fain@ri N 200 'fai/n@/ri FINESSE fines NV 02 fi/'nes FINGER fi9g@ * NV 20 'fi9/g@ FINIAL fainI@l N1X 20 'fai/nI@l FINICAL finik@)l Q 200 'fi/ni/k@)l FINICKING finiki9 Q 200 'fi/ni/ki9 FINICKY finiki Q 200 'fi/ni/ki Line 15001 HYDRARGYRUM haidra:d3ir@m N 0200 hai/'dra:/d3i/r@m HYDRATE haidreit N1X 20 'hai/dreit HYDRO haidrOu N 20 'hai/drOu HYDROCARBON haidrOu)ka:b@n N1X 2020 'hai/drOu)/'ka:/b@n HYDROCEPHALUS haidrOu)sef@l@s N3XLGX 20200 'hai/drOu)/'se/f@/l@s HYDROCHLORIC haidr@klorik Q3XLPXLGXJ 2020 'hai/dr@/'klo/rik HYDRODYNAMIC haidrOu)dainamik Q3X 20020 'hai/drOu)/dai/'na/mik HYDRODYNAMICAL haidrOu)dainamik@)l QQ3X 200200 'hai/drOu)/dai/'na/mi/k@)l HYDRODYNAMICS haidrOu)dainamiks QN3X 20020 'hai/drOu)/dai/'na/miks Line 18001 LIGHTMINDED laitmaindid QLP 220 'lait/'main/did LIGHTNING laitni9 N 20 'lait/ni9 LIGHTSHIP lait-Sip N 20 'lait/-Sip LIGHTSOME laits@m Q 20 'lait/s@m LIGNEOUS lignI@s Q1XI 20 'lig/nI@s LIGNITE lignait N 20 'lig/nait LIGNUM lign@m N 20 'lig/n@m LIGURIAN ligju@rI@n NQ3X 020 li/'gju@/rI@n LIKABLE laik@bl QQ 200 'lai/k@/bl LIKE laik NQV 0 laik Line 21001 ONTOLOGICAL ont@lod3ik@)l QQ1X 10200 ,on/t@/'lo/d3i/k@)l ONTOLOGIST ontol@d3ist N 0200 on/'to/l@/d3ist ONTOLOGY ontol@d3i N 0200 on/'to/l@/d3i ONUS Oun@s N 20 'Ou/n@s ONWARD onw@d AQ 20 'o/nw@d ONYX oniks N1X 20 'o/niks OOF u:f N 0 u:f OOLITE Ou)@lait N1X 200 'Ou)/@/lait OOLITIC Ou)@litik Q1X 1020 ,Ou)/@/'li/tik OOLOGIST Ou)ol@d3ist N 0200 Ou)/'o/l@/d3ist Line 24001 PROGENITURE prOu)d3enitS@ * N5X 0200 prOu)/'d3e/ni/tS@ PROGENY prod3ini N1X 200 'pro/d3i/ni PROGNATHIC progna8ik Q 020 prog/'na/8ik PROGNATHOUS prognei8@s QS 020 prog/'nei/8@s PROGNATISM progn@8iz@)m NS 2000 'prog/n@/8i/z@)m PROGNOSES prognOusi:z NZ 020 prog/'nOu/si:z PROGNOSIS prognOusis NY 020 prog/'nOu/sis PROGNOSTICATE pr@gnostikeit V1X 0200 pr@g/'nos/ti/keit PROGNOSTICATION pr@gnostikeiS@)n N1X 01020 pr@g/,nos/ti/'kei/S@)n PROGRAM prOugram N 20 'prOu/gram Line 27001 SENTENTIOUS sentenS@s Q1X 020 sen/'ten/S@s SENTIENCE senS@)ns N2XI 20 'sen/S@)ns SENTIENT senS@)nt QQ2XI 20 'sen/S@)nt SENTIMENT sentim@nt N 200 'sen/ti/m@nt SENTIMENTAL sentimentl Q 1020 ,sen/ti/'men/tl SENTIMENTALISM sentiment@liz@)m N1XC 102000 ,sen/ti/'men/t@/li/z@)m SENTIMENTALITY sentimentaliti N1X 100200 ,sen/ti/men/'ta/li/ti SENTINEL sentinL N 200 'sen/ti/nL SENTRY sentri N 20 'sen/tri SENZA sents@ A 20 'sen/ts@ Line 30001 TERRITORY terit@)ri N 2000 'te/ri/t@)/ri TERROR ter@ * N 20 'te/r@ TERRORISER ter@raiz@ *QN 2000 'te/r@/rai/z@ TERRORISM ter@riz@)m N 2000 'te/r@/ri/z@)m TERRORIST ter@rist QN 200 'te/r@/rist TERRORIZATION ter@raizeiS@)n N1XLGX 10020 ,te/r@/rai/'zei/S@)n TERRORIZE 9 ter@raiz V 200 'te/r@/raiz TERRORIZER ter@raiz@ *QN 2000 'te/r@/rai/z@ TERSE t@:s Q 0 t@:s TERTIAN t@:S@)n NQ2XI 20 't@:/S@)n Line 33001 VESTA vest@ N 20 'ves/t@ VESTAL vestl NQ 20 'ves/tl VESTIBULAR vestibjul@ * Q 0200 ves/'ti/bju/l@ VESTIBULE vestibju:l N 200 'ves/ti/bju:l VESTIGE vestid3 N 20 'ves/tid3 VESTIGIAL vestid3I@l Q1XI 020 ves/'ti/d3I@l VESTITURE vestitS@ * N 200 'ves/ti/tS@ VESTMENT vest)m@nt N 20 'ves/t)m@nt VESTRY vestri N 20 'ves/tri VESTURE vestS@ * N 20 'ves/tS@ Line 34133 ZINCKY zi9ki Q 20 'zi9/ki ZINCO zi9kOu N 20 'zi9/kOu ZINCOGRAPH zi9kOu)gra:f N3X 200 'zi9/kOu)/gra:f ZINGARI zi9g@ri:) NZ 200 'zi9/g@/ri:) ZINGARO zi9g@rOu NY 200 'zi9/g@/rOu ZINKY zi9ki Q 20 'zi9/ki ZINNIA zinj@ N1XI 20 'zi/nj@ ZIONISM zai@niz@)m N 2000 'zai/@/ni/z@)m ZIONIST zai@nist QN 200 'zai/@/nist ZIONWARD zai@nw@d QA 200 'zai/@/nw@d ZIONWARDS zai@nw@dz A 200 'zai/@/nw@dz ZIP zip N 0 zip ZIPPER zip@ * N 20 'zi/p@ ZIRCONIUM z@:kOunj@m N1XI 020 z@:/'kOu/nj@m ZITHER zi6@ * N 20 'zi/6@ ZLOTY zloti N 20 'zlo/ti ZOA zOu@ NZ1X 20 'zOu/@ ZODIAC zOudiak N1XIC 200 'zOu/di/ak ZODIACAL zOu)dai@k@)l Q 0200 zOu)/'dai/@/k@)l ZOETROPE zOu)itrOup N 200 'zOu)/i/trOup ZOIC zOu)ik Q 20 'zOu)/ik ZOLLVEREIN tsol-f@rain N1XCH 200 'tsol/-f@/rain ZOMBIE zombi N 20 'zom/bi ZONAL zOunL Q1XC 20 'zOu/nL ZONE zOun NV 0 zOun ZOO zu: N 0 zu: ZOOGRAPHER zOu)ogr@f@ * N 0200 zOu)/'o/gr@/f@ ZOOGRAPHY zOu)ogr@fi N 0200 zOu)/'o/gr@/fi ZOOKS zu:ks A 0 zu:ks ZOOLITE zOu)@lait N1X 200 'zOu)/@/lait ZOOLOGIC zOu)@lod3ik QGPX 1020 ,zOu)/@/'lo/d3ik ZOOLOGICAL zOu)@lod3ik@)l QQGPX 10200 ,zOu)/@/'lo/d3i/k@)l ZOOLOGIST zOu)ol@d3ist N1X 0200 zOu)/'o/l@/d3ist ZOOLOGY zOu)ol@d3i N1X 0200 zOu)/'o/l@/d3i ZOOMORPHIC zOuOu)mo:fik Q1X 1020 ,zOu/Ou)/'mo:/fik ZOON zOuon NY 20 'zOu/on ZOOPHYTE zOu)@fait N1X 200 'zOu)/@/fait ZOOT zu:t N 0 zu:t ZOOTOMIST zOu)ot@mist N 0200 zOu)/'o/t@/mist ZOOTOMY zOu)ot@mi N 0200 zOu)/'o/t@/mi ZORIL zoril N 20 'zo/ril ZOROASTRIAN zorOu)astrI@n NQ 1020 ,zo/rOu)/'as/trI@n ZOROASTRIANISM zorOu)astrI@niz@)m N 102000 ,zo/rOu)/'as/trI@/ni/z@)m ZOUAVE zu:)a:v NSXC 02 zu:)/'a:v ZULU zu:lu: N 20 'zu:/lu: ZWIEBACK zwi:bak N2XC 20 'zwi:/bak ZYGOMA zaigOum@ NY1X 020 zai/'gOu/m@ ZYGOMATA zaigOum@t@ NZ1X 0200 zai/'gOu/m@/t@ ZYMOSIS zaimOusis N1X 020 zai/'mOu/sis ZYMOTIC zaimotik Q1X 020 zai/'mo/tik Last line was line 34182.