1) Mietta’s Praat scripts: http://www.helsinki.fi/~lennes/praat-scripts/
The aim of the Speech Corpus Toolkit (SpeCT) is to provide an organized inventory of well-documented Praat scripts that can be easily downloaded, modified and used in order to perform small tasks during the various stages of building, organizing, annotating, analysing, searching and exporting data from a speech corpus.
2) Praat vocal toolkit: http://www.praatvocaltoolkit.com/
Vocal Toolkit is a free plugin for Praat with automated scripts for voice processing. You can manipulate duration, intensity, pitch etc. with a simple click
3) AutoVOT: https://github.com/mlml/autovot/releases
Autovot is a software package for automatic measurement of voice onset time (VOT), using an algorithm which is trained to mimic VOT measurement by human annotators.
1.Ingmar Steiner:Automatic Speech Data Processing with Praat1 Lecture Notes
6) CPrAN plugin manager:http://cpran.net/
EasyAlign is a user-friendly automatic phonetic alignment tool for continuous speech under Praat. It is possible to align speech from an orthographic or phonetic transcription. It requires a few minor manual steps and the result is a multi-level annotation within a TextGrid composed of phonetic, syllabic, lexical and utterance tiers as below.
The Prosodylab-Aligner is a set of Python and shell scripts for performing automated alignment of text to audio of speech using Hidden Markov Models developed in our lab by Kyle Gorman. It is designed to be easy to use as possible, and especially for use with data elicited in a laboratory setting. While it ships with pre-trained North American English monophone models based on data collected in our lab, it also supports training on arbitrary data.
9)The Penn Phonetics Lab Forced Aligner:https://www.ling.upenn.edu/phonetics/old_website_2015/p2fa/
The Penn Phonetics Lab Forced Aligner is an automatic phonetic alignment toolkit based on HTK.
10) ALISA: http://simple4all.org/product/alisa/
ALISA uses a two step approach for the task of aligning speech with imperfect transcripts: 1) sentence-level speech segmentation and 2) sentence-level speech and text alignment. Both processes are fully automated and require as little as 10 minutes of manually labelled speech: inter-sentence silence segments for the segmentation, and orthographic transcripts of these sentences for the aligner.
The tool can be applied to any language with an alphabetic writing system and can align up to 75% of the original data with a sentence error rate of less then 8% and a word error rate of less than 1%.
compatibility: Linux/OS X
12) Prosogram: http://bach.arts.kuleuven.be/pmertens/prosogram/
Prosogram is a tool for the analysis and transcription of pitch variations in speech. Its stylization simulates the auditory perception of pitch by the listener. A key element in tonal perception is the segmentation of speech into syllable-sized elements, resulting from changes in the spectrum (sound timbre) and intensity. The tool also provides measurements of prosodic features for individual syllables (such a duration, pitch, pitch movement direction and size), as well as prosodic properties of longer stretches of speech (such as speech rate, proportion of silent pauses, pitch range, and pitch trajectory). The tool can easily interact with other software tools. It is used as the first step in automatic phonological transcription of intonation, the detection of sentence stress and intonation boundaries.