Classifying pluricentric languages: Extending the monolingual model
This study presents a new language identification model for pluricentric languages that uses n-gram language models at the
character and word level. The model is evaluated in two steps. The first step consists of the identification of two varieties of
Spanish (Argentina and Spain) and two varieties of French (Quebec and France) evaluated independently in binary classification
schemes. The second step integrates these language models in a six-class classification with two Portuguese varieties.
Share this page