Interactional Variation Online
dc.contributor.author | Knight, Dawn |
dc.contributor.author | O’Keeffe, Anne |
dc.contributor.author | Fitzgerald, Christopher |
dc.contributor.author | McNamara, Justin |
dc.contributor.author | Geraldine, Mark |
dc.contributor.author | Fahey Palma, Tania |
dc.contributor.author | Farr, Fiona |
dc.contributor.author | Cowan, Benjamin |
dc.contributor.author | Adolphs, Svenja |
dc.date.accessioned | 2024-10-04T11:07:56Z |
dc.date.available | 2024-10-04T11:07:56Z |
dc.date.issued | 2024-09-05 |
dc.identifier.uri | http://hdl.handle.net/20.500.14106/2572 |
dc.description | The IVO corpus is a collection of approx. 170,000 transcribed words of 19 recorded virtual meetings held between July 2021 and July 2022, itemised in the 'IVO_corpus_file details' file. Recordings vary in length, number of participants, and meeting type. The ‘IVO core meetings corpus’ comprises meetings 1-15. They include 15 recordings from four different institutional contexts, ranging from municipal council meetings (DCC), a non-governmental organisation promoting arts (NCoL), an academic conference organising committee (TaLC) and a state-of-the-art software development company (GitLab). Some of these meetings are hybrid (i.e. some participants are in the same location). The meetings are agenda-driven and can be defined as workplace interaction. There are four remaining meetings (16-19) which are more representative of interviews, training sessions or presentations than meetings and so are not included in the IVO core meetings corpus. The IVO project was co-led by Anne O’Keeffe (anne.keeffe@mic.ul.ie) at Mary Immaculate College (MIC), Limerick and Dawn Knight (KnightD5@cardiff.ac.uk), at the Centre for Language and Communication Research, Cardiff University. The full project team comprised: 2 Principal Investigators (PI – Anne O’Keeffe, Dawn Knight), 2 Co-Investigators (CIs – Svenja Adolphs, Benjamin Cowan, Tania Fahey-Palma, Fiona Farr, Sandrine Peraldi), 1 Postdoctoral Researcher and 2 Research Associates over the course of the project. In addition, there were 9 academic advisors https://ivohub.com/gallery/. The project was co-funded by AHRC and IRC. This data in this corpus has been anonymised using a combination of manual and automated techniques. In addition to transcriptions of speech, the IVO core meetings corpus is tagged for selected nonverbal features. These include annotations for backchannels (head nods and spoken) in the first and last five minutes, emblematic gestures and meaningful gestures for each visible participant - saved as .eaf files (which can be opened in ELAN - see: https://archive.mpi.nl/tla/elan). The extent to which each recording was annotated for these features is detailed in the IVO_corpus_file_details (i.e. this varies from one file to the next). Where more than one feature was annotated, these were assembled into a single combined .eaf file. All .eaf files of the IVO core meetings corpus can be opened/reused in ELAN. |
dc.language.iso | eng |
dc.publisher | Cardiff University |
dc.relation.ispartof | Oxford Text Archive Core Collection |
dc.relation.isreferencedby | https://doi.org/10.1075/ijcl.24060.kni |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://ivohub.com/ |
dc.subject | Linguistic corpus |
dc.title | Interactional Variation Online |
dc.title.alternative | IVO |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
hidden | |
hasMetadata | false |
has.files | yes |
branding | Oxford Text Archive Core Collection |
demo.uri | https://research-data.cardiff.ac.uk/articles/dataset/_b_Interactional_Variation_Online_b_harnessing_emerging_technologies_in_the_digital_humanities_to_analyse_online_discourse_in_different_workplace_contexts/26394130 |
contact.person | Dawn Knight knightD5@cardiff.ac.uk Cardiff University |
sponsor | Arts and Humanities Research Council and Irish Research Council AH/W001608/1 UK-Ireland Collaboration in the Digital Humanities Research Grants Call nationalFunds |
sponsor | Arts and Humanities Research Council AH/W001608/1 UK-Ireland Collaboration in the Digital Humanities Research Grants Call nationalFunds |
sponsor | Irish Research Council IRC/W001608/1 Interactional Variation Online nationalFunds |
size.info | 170000 words |
files.size | 5151236 |
files.count | 6 |
otaterms.date.range | 2000-present |
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
Files for this item
Download all local files for this item (4.91 MB)
- Name
- ivo_meetings.txt
- Size
- 985.9 KB
- Format
- Text file
- Description
- All of the transcripts (without timestamps) from the entire corpus in a single file (.txt). This can be uploaded to a digital concordancing tool for further exploration (e.g. Sketch Engine).
<doc id="file29632231" filename="_Auct_01_.txt" parent_folder="upload" Institution="Auct" Genre="non-meeting"> <s008> 0:00 [tech noise] [screen share start] [screen share stop] hi everyone [pause] group 0:18 hi hi [anon_$008] <s008> 0:20 just gonna share my screen there let me know when you can see it [screen share start] [tech noise] [pause] [screen share stop] okay so what we're gonna do this morning we're gonna go back to our pronunciations [pause] um because i think like we've had a number of new areas that we haven't been used to um so i think what i'll do is i'll start with the areas pronunciation and obviously if there's any that aren't on the list that you need help with obviously let me know and we can add them to the list and we can practice them and then if there's any names as well obviously we can go to those so i'll just share the um sheet there [screen share start] can can everyone see my screen [screen scroll] [pause] unknown 0:59 yes yes . . .
- Name
- ivo_meetingscore.txt
- Size
- 801.54 KB
- Format
- Text file
- Description
- Transcripts (without timestamps) from the 'core', corpus in a single file (.txt). This can be uploaded to a digital concordancing tool for further exploration (e.g. Sketch Engine)
<doc id="file28716633" filename="DCC_01.txt" parent_folder="upload"> <S031> arising from the minutes of the last meeting. Um and just to say as well, that we're being live s= that there's a recording in progress. Uh there's no motions today. Um is there a volunteer to be vice chair in case my broadband goes down? <S032> I can do if you if mine stays up. <S031> That's great. Thanks, [anon_$032]. Okay, so uh [anon_$042] we'll head to you for um an update on the draft [anon_PL1] development plan um 2022 to 2028. So you've the floor [anon_$042] whenever you're ready. [tech noise] Is [anon_$042] with us? [tech noise] <S033> she doesn't appear to be I'll run down the corridor and see where she is. <S031> Okay, good to see the remote working is over there your end as well. Um [anon_$034] can I go to to you for the local economic and community plan, which I know will really form a lot of our workload for the next couple of months. <S034> Yeah, happy to to take that item. . . .
- Name
- Transcription conventions.pdf
- Size
- 111.43 KB
- Format
- Description
- Description of the conventions following in transcribing the data.
- Name
- Youtube Links to GitLab Videos.pdf
- Size
- 13.85 KB
- Format
- Description
- List of links to the recordings on YouTube.
- Name
- IVO_corpus_file details.pdf
- Size
- 141.94 KB
- Format
- Description
- Metadata relating to the corpus files.
- Name
- IVO Corpus eaf_files.zip
- Size
- 2.91 MB
- Format
- application/zip
- Description
- Corpus files annotated in ELAN, which can be opened and reused in ELAN.
- IVO Corpus eaf_files
- Git2_combined.eaf-1 B
- Git3_emblems.eaf-1 B
- DCC3_combined.eaf-1 B
- DCC1_emblems.eaf-1 B
- NCoL2_emblems.eaf-1 B
- Git1_combined.eaf-1 B
- DCC4_emblems.eaf-1 B
- NCoL1_combined.eaf-1 B
- TaLC3_emblems.eaf-1 B
- NCoL3_emblems.eaf-1 B
- DCC2_combined.eaf-1 B
- NCoL4_combined.eaf-1 B
- Git4_emblems.eaf-1 B
- TaLC1_combined.eaf-1 B
- TaLC2_emblems.eaf-1 B