Your search
Results 9 resources
-
English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a result, there are many varieties of English. Although the great many advances in English automatic speech recognition (ASR) over the past decades, results are usually reported based on test datasets which fail to represent the diversity of...
-
DECTE is an amalgamation of the existing Newcastle Electronic Corpus of Tyneside English (NECTE), created between 2001 and 2005, and NECTE2, a collection of interviews conducted in the Tyneside area since 2007. It thereby constitutes a rare example of a publicly available on-line corpus presenting dialect material spanning five decades.
-
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer).
-
This 3-year project investigates language change in five urban dialects of Northern England—Derby, Newcastle, York, Leeds and Manchester. Data collection method: Linguistic analysis of speech data (conversational, word list) from samples of different northern English urban communities. Data collection consisted of interviews, which included (1) some structured questions about the interviewee...
-
The MSP-Conversation corpus contains interactions annotated with time-continuous emotional traces for arousal (calm to active), valence (negative to positive), and dominance (weak to strong). Time-continuous annotations offer the flexibility to explore emotional displays at different temporal resolutions while leveraging contextual information. Release 1.0 contains 74 conversations with...
-
These transcripts and video files are samples of Spanish and English caregiver (almost always mother)-child interaction collected at child ages 2 ½, 3, and 3 ½ years as part of a 10-year longitudinal study of the language and literacy development of U.S.-born children raised in Spanish-speaking homes. Each recording is approximately 30 minutes in length. The caregiver and target child are...
-
The West Yorkshire Regional English Database (WYRED) consists of approximately 200 hours of high-quality audio recordings of 180 West Yorkshire (British English) speakers. All participants are male between the ages of 18-30, and are divided evenly (60 per region) across three boroughs within West Yorkshire (Northern England): Bradford, Kirklees, and Wakefield. Speakers participated in four...
Explore
Audio
- Conversation
-
Accent/Region
(6)
- American English (1)
- British English (3)
- World Englishes (2)
- Child Speech (1)
- Forensic (2)
-
Language
(7)
- Bi-/Multilingual (1)
- English (7)
- Spanish (1)
- Multi-Speaker (5)
Tags
- audio data (6)
- English (6)
- adult (5)
- conversation (5)
- male (3)
- transcribed (3)
- Newcastle (2)
- female (2)
- read speech (2)
- phonetic labels (2)
- Southern standard British English (SSBE) (1)
- forensic (1)
- interview (1)
- map task (1)
- telephone (1)
- Spanish (1)
- bilingual (1)
- child speech (1)
- child-centered audio (1)
- mother-child interaction (1)
- arousal (1)
- dominance (1)
- perceptually annotated (1)
- valence (1)
- British (1)
- Derby (1)
- English accents (1)
- Leeds (1)
- Manchester (1)
- York (1)
- American English (1)
- Ohio (1)
- British English (1)
- Non-native speech (1)
- adaptation (1)
- diapix (1)
- L2 English (1)
- World Englishes (1)
- dyadic (1)
- spontaneous speech (1)
- video (1)
Resource type
- Dataset (9)