phonetisr: A Naive IPA Tokeniser

This package is a (naive) tokeniser of phonetic transcriptions in the International Phonetic Alphabet (IPA).

With phonetisr, you can parse texts and word lists transcribed in IPA and tokenise them into phones so that you can perform quantitative analyses.

Installation

You can install the development version from GitHub with:

# install.packages("remotes")
remotes::install_github("stefanocoretta/phonetisr@v0.0.5")

Usage

library(phonetisr)

# IPA strings to be tokenised
ipa <- c("pʰãkʰ", "tʰum̥", "ɛkʰɯ")

# List of character sequences to be considered single phones
ph <- c("pʰ", "tʰ", "kʰ", "ã", "m̥")

# Tokenise strings
phonetise(ipa, multi = ph)
#> [[1]]
#> [1] "pʰ" "ã"  "kʰ"
#> 
#> [[2]]
#> [1] "tʰ" "u"  "m̥" 
#> 
#> [[3]]
#> [1] "ɛ"  "kʰ" "ɯ"

Roadmap

Scan for illegal (non-IPA) characters.
Provide a list of default multi-character phones.
Functions for data import/export.
Ignore diacritics.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

phonetisr: A Naive IPA Tokeniser

Installation

Usage

Roadmap

Files

README.md

Latest commit

History

README.md

File metadata and controls

phonetisr: A Naive IPA Tokeniser

Installation

Usage

Roadmap