Giter VIP home page Giter VIP logo

expressives_praat's Introduction

expressives_praat

©Pere Garau Borràs (2022). Universitat de les Illes Balears

According to the aim of the research (detecting acoustic correlates for expressives), there are three relevant values to be obtained from the corpus: pitch values, intensity values and speech rate values. Manual work in obtention of these values would represent an enormous effort and, maybe, a less precise data. The corpus implements two tools to collecting data: a Praat script, to collect all data referred to acoustic parameters; and a Python script to treat all the data, obtain speech rate values and mark the presence of potential expressives. Last version of Lennes (2016) has been used as a base to build a Praat script to fulfil the purpose. Lennes’ original script is designed to collect maximum pitch data by going “through all the TextGrid files and the Sound files in a given folder, find sound-grid pairs that have the same name, open each pair, run through the TextGrid, collect data from labeled intervals and append the information to a simple tabulated text file (which you can later open in a statistical or spreadsheet program)” (Lennes, 2016). The resulting Praat script used here (henceforth, PS) allows the user to get, for each labelled segment, the values of maximum pitch, minimum pitch, pitch difference (i.e., maximum– minimum), maximum intensity, minimum intensity, intensity difference and segment duration. The script loops between the files, which must had been saved in the same folder and by pairs of ‘wav’ audio files and TextGrids named the same. Every file pair is loaded to Praat. From the audio file a Pitch object and an Intensity object are created. For Pitch object creation time step value is 0.1 s (as the value set when bounding the segments), minimum pitch is set at 75 Hz, and maximum at 300 Hz, after having examined the pitch range of the audios. For Intensity object operation values are set to 75 Hz for minimum pitch, 0.1 s for time step and no mean value has been calculated. Once these two objects are created, a loop into the segments. If the segment is labelled with a pause marker (‘xxx’) is skipped. Only segments not marked with pause mark are contemplated for analysis. For each segment, starting and ending time points and duration values are saved. Also, the script gets maximum and minimum pitch of the segment and calculates, by subtraction, pitch difference. The same procedure is used for intensity values. Moreover, the Script saves the file name, segment number and text label. File name is saved in order to locate the segment within the corpus. Segment number is stored to always maintain access to the segment reference and position. Text label is kept to postprocessing with Python script. Finally, obtained values are saved separated by semicolons in a ‘csv’ file, allowing posterior work with any statistical or data manager software. Temporary files are deleted. The result from the use of the PS is a single csv file containing all values from pitch and intensity (maximum, minimum and difference), duration, file name, segment number and segment transcription.

To cite: Garau, P. (2022). expressives_praat.praat (v1.0). Zenodo. https://doi.org/10.5281/zenodo.7234113

expressives_praat's People

Contributors

garaupere avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.