Praat Scripts and Plugins


1) Mietta’s Praat scripts:

The aim of the Speech Corpus Toolkit (SpeCT) is to provide an organized inventory of well-documented Praat scripts that can be easily downloaded, modified and used in order to perform small tasks during the various stages of building, organizing, annotating, analysing, searching and exporting data from a speech corpus.

2) Praat vocal toolkit:

Vocal Toolkit is a free plugin for Praat with automated scripts for voice processing. You can manipulate duration, intensity, pitch etc. with a simple click

3) AutoVOT:

Autovot is a software package for automatic measurement of voice onset time (VOT), using an algorithm which is trained to mimic VOT measurement by human annotators.

4) Tutorials

1.Ingmar Steiner:Automatic Speech Data Processing with Praat1 Lecture Notes

2. Praat Tutorial: (a. Ohio State U. b.Stanford c. A Mandarin Version 中文版 d. Colorado at Boulder Praat Handbook e.OT theory in Praat f.UToronto)

5) Prosody Cloning

6) CPrAN plugin manager:


7) EasyAlign:

EasyAlign is a user-friendly automatic phonetic alignment tool for continuous speech under Praat. It is possible to align speech from an orthographic or phonetic transcription. It requires a few minor manual steps and the result is a multi-level annotation within a TextGrid composed of phonetic, syllabic, lexical and utterance tiers as below.

Compatibility: Windows

8) Prosodylab-Aligner:

The Prosodylab-Aligner is a set of Python and shell scripts for performing automated alignment of text to audio of speech using Hidden Markov Models developed in our lab by Kyle Gorman. It is designed to be easy to use as possible, and especially for use with data elicited in a laboratory setting. While it ships with pre-trained North American English monophone models based on data collected in our lab, it also supports training on arbitrary data.

9)The Penn Phonetics Lab Forced Aligner:

The Penn Phonetics Lab Forced Aligner is an automatic phonetic alignment toolkit based on HTK.

10) ALISA:

ALISA uses a two step approach for the task of aligning speech with imperfect transcripts: 1) sentence-level speech segmentation and 2) sentence-level speech and text alignment. Both processes are fully automated and require as little as 10 minutes of manually labelled speech: inter-sentence silence segments for the segmentation, and orthographic transcripts of these sentences for the aligner.

The tool can be applied to any language with an alphabetic writing system and can align up to 75% of the original data with a sentence error rate of less then 8% and a word error rate of less than 1%.

compatibility: Linux/OS X

11) MAUS :

12) Prosogram

Prosogram is a tool for the analysis and transcription of pitch variations in speech. Its stylization simulates the auditory perception of pitch by the listener. A key element in tonal perception is the segmentation of speech into syllable-sized elements, resulting from changes in the spectrum (sound timbre) and intensity. The tool also provides measurements of prosodic features for individual syllables (such a duration, pitch, pitch movement direction and size), as well as prosodic properties of longer stretches of speech (such as speech rate, proportion of silent pauses, pitch range, and pitch trajectory). The tool can easily interact with other software tools. It is used as the first step in automatic phonological transcription of intonation, the detection of sentence stress and intonation boundaries.

Adding Looping method to existing Praat Scripts

There are many great praat scripts out there (here’s my collection).

Mietta’s Praat scripts ( ), for example, are particularly useful for doing various things with praat.

However, one problem I’m constantly facing is that Mietta’s scripts usually work with 1 sound file at a time. I have to import sound files one by one, and then run the scripts. For example, if you want to cut a long sound file into several pieces, you have to import the sound file and the corresponding textgrid file to praat, and then run the script. Only after that, you can start cutting a second sound file. It was fine when there are only a couple of sound files. But now I am working on hundreds of files (827 to be precise). That means, I have to import sound files 827 times, and run the script 827 times. It is time-consuming.

I therefore added some codes to Mietta’s save_intervals_to_wav_sound_files.praat, so that the script will automatically look for all the sound files and their corresponding textgrids in one folder, and cut them.

Here’s the original script:

Here’s the edited one:

My Posts on Nantong Chinese

Here are my posts about my native language: Nantong Chinese.

Nantong Chinese is just a side-project of mine, and the articles I wrote are less than academic. However, they could be the first step toward more rigorous academic inquires. Although I wrote the following posts purely for fun, I did provide references whenever other people’s views were used.  The Language Attitude Survey was done with my collaborators in China.

Please let me know if you have any suggestions. Thank you!

Note: the following posts are also on the Nantong Dialect Web (南通方言网).


  1. Why do Nantong People Prefer “Eating” Tea?
  2. The Speech Act of Sentence-Final Particles in Imperative Mood.
  3. The Relationship between Tone Shape and Tone Length in Nantong Chinese.
  4. The Tone Shape of Nantong Chinese–an SSANOVA Analysis
  5. Bibliographies of Nantong Chinese Research 

Language Attitude Survey

  1. Who is still speaking Nantong Chinese (Report 1)
  2. How frequent is Nantong Chinese used in everyday life (Report 2) 
  3. The general public’s attitude toward Nantong Chinese (Report 3)


note: I’ve been writing short articles for a website called 南通方言网 (Nantong Dialect Web). The following is one of them. Since the target audience is Chinese speakers, I wrote all the articles in Chinese. I am re-posting them here just to keep a record. For my fellow Nantong people out there, please let me know what you think. Thank you!

This page shows you all my posts about nantong Chinese





说到“吃茶”不能不提赵州禅师从谂的一段公案。赵州从谂禅师(778年-897年),俗姓郝,唐代著名高僧。曾有两位僧人从远方来到赵州,向赵州禅师请教如何是禅。赵州禅师问其中的一个,“你以前来过吗?”那个人回答:“没有来过。”赵州禅师说:“吃茶去!” 赵州禅师转向另一个僧人,问:“你来过吗?”这个僧人说:“我曾经来过。”赵州禅师说:“吃茶去!”这时,引领那两个僧人到赵州禅师身边来的监院就好奇地问:“禅师,怎么来过的你让他吃茶去,未曾来过的你也让他吃茶去呢?” 赵州禅师称呼了监院的名字,监院答应了一声,赵州禅师说:“吃茶去!” (《五灯会元》卷四

赵州禅师的一个“吃茶去”体现了佛家的淡泊,禅机无限,我们恐难完全体会。但是从他用的这个“吃“字,我们可以知道 南通话里的“吃茶”一词不是南通人的独创,而是古代汉语的孑遗。




1. sa, 例:你吃sa
2. sei 例:你吃sei
3. bau 例:你吃bau

这个几个语气助词有不同的言语行为(speech act), 表示不同的意思。我提出一点感性的认识,请大家指正。
1. “sa” 表示命令而不是请求。
2. “sei” 表示请求而不是命令。
b)句是一个探讨请求的口吻。请人帮忙,用sei 显得比较恰当。”你帮帮我的忙sei” 表示了请求。而“你帮帮我的忙sa” 就带有了命令的口吻,仿佛别人欠你的,应该帮你的忙。

3. ”bau” 是表示建议,建议某人做某事。“你过来bau” 表示的是建议别人过来,而不是请求或命令别人过来。


我想我们是不是可以通过加入词的方式来辨别助词的语用 ( 下文 *号 标注的句子与我个人的语感不合)
1. 表建议,加“建议”
a) * 我建议你还是来sa
b) * 我建议你还是来sei
c) 我建议你还是来bau
这里只有1c) 读起来顺,其他两个都不顺。说明建议的语气和 bau最契合。

2. 表催促,加“怎么天天都要我催你的sei.”
a) 起来sa!怎么天天早上都要我催你的sei.
b) ?起来sei 怎么天天早上都要我催你的sei.
c) *起来bau!怎么天天早上都要我催你的sei.
我个人语感上,a) 句可说,c) 句不可说,b) 句我不肯定(我倾向于可以说)。
命令的口吻可以通过语调来体现,不用语气助词也可以。但命令的口吻似乎无法与sei 和 bau 共存,
d) 起来sa! 我命令你!
e) *起来sei ! 我命令你!
f) *起来bau ! 我命令你!

3. 表请求,加“求求”
a) *我求求你登点儿来sa
b) 我求求你登点儿来sei
c) 我求求你登点儿来bau
这三例可以说明 sa 跟 “请求” 的语气是矛盾的。 sei 和 bau 和 “请求”不矛盾。

由上9例我们可以看出 sei 和 bau, 正如你所说,都可以表示 请求(邀请),但bau可表建议,sei 不可以。bau 和 sei 似乎不能与”命令“的语气共存。当然我这个“加词”的方法不一定对。

另外 sa 似乎还有另一个用法,
4. sa 表挑衅/激将(daring)

a) 你试试看sa! 我不拎起来弄你个巴掌!
b) *你试试看sei! 我不拎起来弄你个巴掌!
c) *你试试看bau! 我不拎起来弄你个巴掌!

4 中的例子跟整个句子的语调相关,而不仅仅是sa。问题是 sa 在这里可能不是表示催促。