CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes – the National Corpus of Contemporary Welsh
dc.contributor.author | Dawn Knight |
dc.coverage.placeName | Wales |
dc.date.accessioned | 2020-11-03 |
dc.date.accessioned | 2023-06-22T11:29:44Z |
dc.date.available | 2023-06-22T11:29:44Z |
dc.date.created | 2020 |
dc.date.issued | 2020-11-01 |
dc.identifier | ota:2564 |
dc.identifier.uri | http://hdl.handle.net/20.500.14106/2564 |
dc.description.abstract | The CorCenCC corpus contains over 11 million words (circa 14.4m tokens) from written, spoken and electronic (online, digital texts) Welsh language sources, taken from a range of genres, language varieties (regional and social) and contexts. The contributors to CorCenCC are representative of the over half a million Welsh speakers in the country. The creation of CorCenCC was a community-driven project, which offered users of Welsh an opportunity to be proactive in contributing to a Welsh language resource that reflects how Welsh is currently used. |
dc.description.sponsorship | Arts and Humanities Research Council |
dc.description.sponsorship | Economic and Social Research Council |
dc.format.extent | 14.4m tokens |
dc.format.medium | Digital bitstream |
dc.language | Welsh |
dc.language.iso | cym |
dc.publisher | University of Oxford |
dc.relation.ispartof | Oxford Text Archive Core Collection |
dc.relation.uri | http://doi.org/10.17035/d.2020.0119878310 |
dc.rights | Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.rights.label | PUB |
dc.subject.lcsh | Linguistics |
dc.subject.lcsh | Linguistics analysis (Linguistics) |
dc.subject.other | Linguistic corpora |
dc.subject.other | Speech--Research |
dc.title | CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes – the National Corpus of Contemporary Welsh |
dc.type | Corpus |
has.files | yes |
branding | Literary and Linguistic Data Service |
branding | Oxford Text Archive |
files.size | 50593 |
files.count | 1 |
relation.uri | http://doi.org/10.17035/d.2020.0119878310 |
otaterms.date.range | 2000-present |
This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Files for this item

- Name
- d.2020.0119878310
- Format
- unknown
- Description
- Note
- This file is hosted on an external server
- URI
- http://doi.org/10.17035/d.2020.0119878310