Your search
Results 7 resources
-
DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.
-
The MSP-AVW is an audiovisual whisper corpus for audiovisual speech recognition purpose. The MSP-AVW corpus contains data from 20 female and 20 male speakers. For each subject, three sessions are recorded consisting of read sentences, isolated digits and spontaneous speech. The data is recorded under neutral and whisper conditions. The corpus was collected in a 13ft x 13ft ASHA certified...
-
This 3-year project investigates language change in five urban dialects of Northern England—Derby, Newcastle, York, Leeds and Manchester. Data collection method: Linguistic analysis of speech data (conversational, word list) from samples of different northern English urban communities. Data collection consisted of interviews, which included (1) some structured questions about the interviewee...
-
Multi-laboratory evaluation of forensic voice comparison systems under conditions reflecting those of a real forensic case. There is increasing pressure on forensic laboratories to validate the performance of forensic analysis systems before they are used to assess strength of evidence for presentation in court (including pressure from the recently released report by the President’s Council...
-
Forensic database of voice recordings of 500+ Australian English speakers (AusEng 500+). This database contains 3899 recordings totalling 310 hours of speech from 555 Australian-English speakers. 324 female speakers: - 91 recorded in one recording session - 69 recorded in two separate recording sessions - 159 recorded in three recording sessions - 5 recorded in more than three recording...
-
The Voices Obscured in Complex Environmental Settings (VOiCES) corpus is a creative commons speech dataset targeting acoustically challenging and reverberant environments with robust labels and truth data for transcription, denoising, and speaker identification. This is one of the largest corpora to date that has transcriptions and simulatenously recorded real-world noise. The details: -...
Explore
Audio
- Language
-
Accent/Region
(4)
- Australian English (2)
- British English (2)
- Conversation (3)
- Forensic (3)
- Multi-Speaker (5)
- Multi-Style (1)
- Speech in Noise (2)
Speech Production & Articulation
- Video (1)
Tags
- male
- English (6)
- audio data (5)
- female (5)
- adult (4)
- forensic (3)
- interview (3)
- read speech (3)
- telephone (2)
- transcribed (2)
- Australian (2)
- conversation (2)
- spontaneous speech (2)
- Newcastle (2)
- Southern standard British English (SSBE) (1)
- map task (1)
- environmental noise (1)
- noisy audio (1)
- reverberation (1)
- British (1)
- Derby (1)
- English accents (1)
- Leeds (1)
- Manchester (1)
- York (1)
- audiovisual (1)
- digits (1)
- video (1)
- whisper (1)
- British English (1)
- phonetic labels (1)