Confidence intervals for forced alignment with the Mason-Alberta Phonetic Segmenter

dc.contributor.authorKelley, Matthew C.
dc.date.accessioned2025-05-27T20:58:22Z
dc.date.issued2025-05
dc.description.abstractForced alignment is a common tool in experimental phonetics to align audio with orthographic and phonetic transcriptions. Phonetic segmentation is not a straightforward process, however, and boundaries between phonetic segments cannot be easily determined. Most forced alignment tools provide a single estimate of a boundary based on conditional probabilities of segment categories given some acoustic data. The present project introduces a method of deriving confidence intervals for these boundaries using a neural network ensemble technique with the Mason-Alberta Phonetic Segmenter. Ten different segment classifier neural networks were previously trained, and the alignment process is repeated with each model. The alignment ensemble is then used to place the boundary at the median of the time points, and 97.85% confidence intervals are constructed using order statistics. On the Buckeye and TIMIT corpora, the ensemble boundaries show a slight improvement over using just a single model. The confidence intervals are incorporated into Praat TextGrids using a point tier, and they are also output as a table for researchers to analyze separately.
dc.identifier.citationKelley, M. C. (2025, May 19-23). Confidence intervals for forced alignment with the Mason-Alberta Phonetic Segmenter [conference presentation]. The 188th Meeting of the Acoustical Society of America, New Orleans, LA, USA.
dc.identifier.urihttps://hdl.handle.net/1920/14591
dc.identifier.urihttps://doi.org/10.13021/MARS/14868
dc.language.isoen
dc.rightsAttribution 4.0 Internationalen
dc.rightsCopyright 2025 Matthew C. Kelley
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectphonetics
dc.subjectforced alignment
dc.subjectautomatic speech recognition
dc.titleConfidence intervals for forced alignment with the Mason-Alberta Phonetic Segmenter
dc.typePresentation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
asa25_1_kelley_2025_maps_cis_0.pdf
Size:
638.7 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: