සැකිල්ල:pt-IPA/documentation
Introduction
සංස්කරණයUsage:
===Pronunciation=== {{pt-IPA}}
or:
===Pronunciation=== {{pt-IPA|fiksar}}
In cases when the template is not capable of generating the correct pronunciation, you can explicitly state the dialects:
===Pronunciation=== {{pt-IPA|br=abecedáryo|pt=àbssdário}}
The module can accommodate multiple pronunciations for a given dialect:
===Pronunciation=== {{pt-IPA|br=+|pt=àmém,àmén}}
Special features
සංස්කරණයIntroduction
සංස්කරණයThe general principle behind this module is to allow a single respelling to be used as much as possible for both Portugal and Brazil, despite the dramatic differences in pronunciation between the two dialects. To support this, various symbols are defined that have an effect in only one of the two dialects.
For example, unstressed a e o in Portugal are normally pronounced as reduced vowels /ɐ ɨ u/, but sometimes as open vowels /a ɛ ɔ/, and sometimes (less frequently) as close vowels /ɐ e o/. The corresponding words in Brazil are usually pronounced with full vowels /a e o/ regardless of the particular quality in Portugal. To support this, unmarked symbols a e o
request the default unstressed pronunciation (usually reduced), while we provide special symbols à è ò
to indicate unstressed open vowels in Portugal and special symbols ā ē ō
to indicate unstressed close vowels in Portugal. All three sets of symbols map to the same pronunciation in Brazil.
Conversely, in Brazil there are frequently multiple ways of pronouncing unstressed vowels /e i o u/ in hiatus (i.e. directly before another vowel), where a single word often admits multiple pronunciations, while in Portugal these are fairly consistently pronounced as glides /j w/. We provide various symbols to support the variation in Brazil, which all map to glides in Portugal.
Various specific situations are described in more detail below.
Indicating vowel quality
සංස්කරණය- Acute accent
á é í ó ú
indicates stressed vowels, and in addition, in combination with non-high vowelsá é ó
indicates an open quality /a ɛ ɔ/. - Circumflex accent
â ê ô
indicates stressed close vowels /ɐ e o/. - Grave accent
à è ò
indicates unstressed open vowels /a ɛ ɔ/ in Portugal but has no effect in Brazil. - Dot-under
ạ ẹ ọ
indicates unstressed open vowels /a ɛ ɔ/ in both Portugal and Brazil (e.g. in fofoca, forrobodó respelledfọfóca
,fọrrọbọdó
). - Macron
ā ē ō
indicates unstressed close vowels /ɐ e o/ in Portugal but has no effect in Brazil. Note thatā
is rarely necessary, but is useful in cases like saudade respelledsāudade
. - Dot-under + circumflex
ậ ệ ộ
indicates unstressed close vowels /ɐ e o/ in both Brazil and Portugal; however, this should rarely be necessary, as unstressed /e o/ are the default in Brazil.
In most circumstances, vowel quality must be indicated on any stressed e
or o
; if not, an error will be thrown. The following are the exceptions where this is not necessary:
- When followed by a nasal consonant (
m n nh
), regardless of whether this nasal consonant is in turn followed by a vowel. In this case, the vowel quality defaults to close. For example, lenha, mente and bom respelled as-is are interpreted as if respelledlênha
,mênte
andbôm
. - In a diphthong
ei eu oi ou
. These are as if respelledêi êu ôi ôu
, in keeping with their most common interpretation. - For o specifically, if it is directly followed by another vowel, in which case it defaults to close. For example, boa and voo respelled as-is are interpreted as if respelled
bôa
andvôo
. - In the common suffixes
-dor
,-tor
,-sor
and their corresponding feminine and plural forms, where theo
defaults to close. - In the common suffix
-oso
, where theo
defaults to close. The corresponding feminine and plural suffixes-osa
,-osos
and-osas
are also handled automatically, but in these cases theo
defaults to open, as the suffix is metaphonic.
Stressed a defaults to open /a/ unless directly followed by a nasal consonant (m, n or nh), in which case it defaults to close /ɐ/, as in cama.
Nasalization
සංස්කරණය- Tilde
ã ẽ ĩ õ ũ
can be used to indicate a nasalized vowel, as in standard Portuguese spelling. Some properties of nasalized vowels:- Nasalized vowels are pronounced close (as if written with a circumflex accent) unless specifically indicated as open using
ã́ ẽ́ ṍ
. - Nasal diphthongs
ãe ão õe ũi
have special handling, as in standard Portuguese spelling. - Nasalized vowels written with a tilde are normally stressed, but an explicit acute or circumflex accent elsewhere takes precedence (as in bênção, or respellings such as
Bãejamím
for one possible Portugal pronunciation of Benjamim). - As with standard Portuguese spelling, nasalized vowels can also be indicated using
m
orn
followed by a consonant, orm
at the end of a word. In Brazil,n
at the end of a word also indicates nasalization, but it indicates /n/ in Portugal without nasalization of the preceding vowel. Usemm
in respelling to indicate coda /m/, andnn
to indicate coda /n/.
- Nasalized vowels are pronounced close (as if written with a circumflex accent) unless specifically indicated as open using
Stress assignment
සංස්කරණයThe following should be noted (all of which is consistent with standard Portuguese spelling rules):
- Primary stress on a word can be indicated by placing an acute accent or circumflex on the stressed vowel. The choice of accent indicates the quality of the vowel in the case of a e o, while an acute accent should always be used on i u. See above for the specifics of vowel quality.
- Nasal vowels indicated with a tilde (see above) bear primary stress in the absence of an acute or circumflex accent elsewhere in the word. In the case of multiple tildes in a single word (e.g. pãozão, aviãozão), the last vowel with tilde bears the primary stress.
- In the absence of any acute accent, circumflex or tilde, the standard stress assignment rules apply. Approximately speaking, the following final syllables are unstressed: -a -e -o -as -es -os -am -em -ans -ens. If a word ends in any of these sequences, the second-to-last syllable bears primary stress; otherwise the last syllable bears primary stress.
- Epenthetic syllables in Brazil involving /i/ spelled using
i^
,i^^
ori*
, such as in digno (“worthy”) respelleddighi^no
and punk (“punk”) respelledpanki^
, do not affect stress assignment. Effectively, the stress assignment algorithm behaves as if the syllables are not present.
In addition, the following special rules apply:
- If there are multiple primary stresses indicated, all but the last are converted to secondary stress. This is the standard way of notating secondary stress in a word; in particular, a grave accent does not indicate secondary stress (see above).
- In the rare circumstance where the above rule does not suffice for secondary stress (e.g. if the secondary stress comes after the primary stress), use line-under to explicitly indicate secondary stress:
a̱ e̱ i̱ o̱ u̱
. The quality of a e o marked in this fashion is open /a ɛ ɔ/. To indicate close /ɐ e o/, useâ̱ ê̱ ô̱
. - There is special handling of the suffixes -mente, -zinho(s) and -zinha(s). Essentially, these endings automatically have a
--
inserted before them (see below for the exact meaning of this separator). This means that both the suffix and the preceding component get primary stresses assigned, and that an explicit primary stress on the preceding component does not indicate primary stress on the entire word, but only on that portion (which is eventually converted to secondary stress by the rule above about multiple primary stresses). This means, for example, that the respellingfácilmente
for facilmente does not indicate primary stress on the a, but rather secondary stress, with primary stress on the first e in -mente. Similarly, a respelling likeabertamente
for abertamente will throw an error, as the first e bears stress but does not have its quality indicated; a respelling likeabértamente
must be used. To defeat this behavior, add an explicit accent on the suffix. For example, dormente should be respelleddormênte
, and vizinho should be respelledvizínho
.
Symbols indicating other Brazil-Portugal differences (vowel raising, glides, epenthesis)
සංස්කරණය- The symbol
i*
indicates an epenthetic unstressed /i/ in Brazil (and has no effect on determination of the stressed vowel) but no vowel in Portugal. - The symbol
i^
not preceding or following a vowel indicates either an epenthetic unstressed /i/ in Brazil or no vowel (but still causes palatalization of /t/ and /d/) and indicates no vowel in Portugal. - The symbol
i^
preceding or following a vowel indicates either an unstressed /i/ in hiatus or a /j/ in Brazil and is the same asi
in Portugal. Note that the behavior ofi^
preceding a vowel is actually the default currently for handlingi
in hiatus in Brazil. Usingi^
following a vowel is principally useful in the sequenceui^
. This maps to eitheru.i
oruy
in Brazil but toui
in Portugal (which ends up pronounced /wi/). This gives the correct pronunciation for words like distribuição. - The symbol
i^^
is likei^
(in both meanings) but with the two possibilities listed in the opposite order. - The symbol
u^
indicates either an unstressed /u/ in hiatus or a /w/ in Brazil and is the same asu
in Portugal. - The symbol
u^^
is likeu^
but with the two possibilities listed in the opposite order. - The symbol
e^
indicates either an unstressede
ori
in Brazil and is the same ase
in Portugal. - The symbol
e^^
is likee^
but with the two possibilities listed in the opposite order. - The symbol
o^
indicates either an unstressedo
oru
in Brazil and is the same aso
in Portugal. - The symbol
o^^
is likeo^
but with the two possibilities listed in the opposite order. - The symbol
des^
at the beginning of a word or component indicates eitherdes++
ordis++
in Brazil and is the samedes
in Portugal. - The symbol
des^^
is likedes^
but with the two possibilities listed in the opposite order. - The symbol
ê*
is likeê
in Brazil buté
in Portugal. This is useful especially before nasal consonants, e.g. gene (“gene”) respelledgê*ne
. - The symbol
é*
is likeé
in Brazil butê
in Portugal. This is useful especially in the diphthong ei, e.g. geleia (“jelly, jam”) respelledgelé*ia
. - The symbol
ô*
is likeô
in Brazil butó
in Portugal. This is useful especially before nasal consonants, e.g. carbono (“carbon”) respelledcarbô*no
. - The symbol
ó*
is likeó
in Brazil butô
in Portugal. This is useful especially in the diphthong oi, e.g. apoio (“I support”) respelledapó*io
.
Some mnemonics to help you remember these codes:
^
indicates that there are two possible outputs in Brazil, the first of which is generally the same as the vowel directly preceding. For example, the first possible output fori^
andu^
in hiatus is /i/ and /u/ respectively. Similarly, the first possible output fore^
ando^
is /e/ and /o/ respectively.^^
is the same as^
but the two outputs are given in opposite order.*
indicates a single output in Brazil that differs from the corresponding Portugal output, where the Brazil output is always the vowel exactly as written. Hence,i*
means /i/ in Brazil (and nothing in Portugal). Similarly,ê* ô* é* ó*
mean exactly those vowels in Brazil, but the "height-opposite" vowels in Portugal.
Multiple components of a word
සංස්කරණය- Use
-
to treat several components of a word as separate words. Each component is normally assigned its own stress (although all but the last stress will be converted to a secondary stress, consistent with handling of multiple word stresses elsewhere), and letters at component boundaries are treated as if at word boundaries. This follows standard Portuguese spelling practices; compare arco-íris, batata-da-terra, etc. --
is similar to-
but a few word-final transformations do not apply to the component preceding the--
; for example, in Brazil, writtena
in this position is /a/ not /ɐ/, and optional /(j)/ insertion after a stressed vowel and before /s/ does not apply. This is intended for suffixes like -mente, -zinho/-zinha, -zão, etc., which require this behavior. Note that -mente, -zinho(s) and -zinha(s) automatically add--
before them; but you will need to manually add it in words like cafezeiro (respelledcafé--zeiro
), boazona (respelledboa--zona
), etc.:
is somewhat like-
and--
, but final -o and -e in the preceding component are not raised to /u/ and /i/ respectively in Brazil, as they are with-
and--
. This is useful especially for prefixes with secondary stress, e.g. eletrodoméstico respelledelétrò:doméstico
; idiossincrasia respelledídiò:sincrasia
; antiferromagnético respelledânti:férrò:màghi^nético
. Note that in words like these, the final -o of the prefix is frequently pronounced /ɔ/ in Portugal and requiresò
for this reason.+
can be used before suffixes like -inho and -íssimo, and behaves like:
but with the following differences: (1) sress on the component preceding+
is undisplayed rather than being converted into secondary stress; (2) syllabification is transparent to+
. Hence e.g. rapazinho can be respelledrapaz+inho
, and vozinha can be respelledvóz+inha
, and the correct pronunciation will be generated.++
is like+
but no stress assignment happens at all to the component preceding it. It is intended for unstressed prefixes such as des-; letters at the beginning of the following component will be treated as word-initial. In fact, the special notationdes^
uses++
internally.
Other symbols
සංස්කරණය+
stands for the pagename (see example above).#
indicates an optional hiatus in both Portugal and Brazil, as in diabetes (“diabetes”) respelleddi#abétes
. In this example, this respelling is equivalent to writingdi.abétes,dyabétes
for both Portugal and Brazil. (This differs from the symbol combinations written above using^
and^^
, which apply only to Brazil.)- Use
.
to indicate an explicit syllable division, particularly between vowels, as in enraizar (“to take root”) respelledenra.izar
. Under normal circumstances, do not use this to override the default syllabification algorithm; instead, contact User:Benwing2 to suggest changes to that algorithm. ü
afterg
andq
before a front vowel (e i y
) indicates that theu
should be pronounced as /w/ rather than being silent, as in linguista (“linguist”) respelledlingüista
and frequência (“frequency”) respelledfreqüência
.- Use
,
to separate multiple possible pronunciations. Note, however, that this is only recognized if no space follows the comma; otherwise, the comma is considered to be embedded in the respelling and is treated as a foot boundary, as in rei morto, rei posto (“the king is dead, long live the king”).
Inline modifiers
සංස්කරණයYou can attach inline modifiers to a given pronunciation using the format RESPELLING<MOD:TEXT><MOD:TEXT>...
. For example, to attach a qualifier colloquial to a given pronunciation, use a syntax as follows (for pizza):
{{pt-IPA|br=pitsa,pítissa<q:colloquial>|pt=piza,pitsa}}
which generates
Note how the Brazil pronunciation with respelling pítissa
is tagged as colloquial. Currently the following inline modifiers are recognized:
Modifier | Meaning |
---|---|
q: |
Qualifier placed before the pronunciation it is attached to. |
ref: |
Reference placed after the pronunciation it is attached to. If you use this, make sure to place a ===References=== section near the bottom, whose contents use <references /> . The syntax is as described for the |refN= argument to {{IPA}} and {{it-IPA}} . In general, specify the text of the reference directly following ref: . To specify a name for a given reference, use <<name:NAME>> directly after the reference text and inside of the inline modifier; this is as if <ref name="NAME">...</ref> were used. To use a previously named footnote a second time, use only <ref:<<name:NAME>>> with an empty reference text; this is as if <ref name="NAME" /> were used. You can also group references using <<group:GROUP>> after the reference text.
|
bullets: |
Specify the number of bullets preceding the line for this dialect variant (defaults to 1). If given, this should follow all comma-separated terms. |
pre: |
Specify text to precede the formatted pronunciation line. If given, this should follow all comma-separated terms. |
post: |
Specify text to follow the formatted pronunciation line. If given, this should follow all comma-separated terms. |
An example of using a reference is with menu:
{{pt-IPA|pt=menu<q:normative>,mènu<q:common but considered incorrect><ref:[https://www.flip.pt/Duvidas-Linguisticas/Duvida-Linguistica/DID/752]>}} ===References=== <references />
which generates
References
Substitution notation
සංස්කරණයFor long (especially multiword) terms requiring just one or two respelling indications, it can be annoying to have to repeat the entire term in the respelling. To make respelling such terms easier, substitution notation is supported. The syntax is easiest illustrated using an example, e.g. for análise de ativação (“activation analysis”):
{{pt-IPA|[ativ:àtiv]}}
which generates
Here, the substitution notation stands for the full respelling análise de àtivação
. The general format is to use a bracketed expression in place of the respelling, where inside of the brackets is one or more substitutions, semicolon-separated and of the form FROM:TO
, where FROM
is a portion of the original spelling and TO
is the corresponding respelling. Substitutions are implemented left-to-right, and the FROM
portion of each substitution must match the original spelling in at least one place or an error is thrown.
An example using two substitutions is for transtorno de personalidade antissocial (“antisocial personality disorder”), which can be written as follows using substitution notation:
{{pt-IPA|[torn:tôrn;antiss:ânti:ss]}}
which generates
- (Brazil) IPA(key): /tɾɐ̃sˈtoʁ.nu d͡ʒi peʁ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.siˈaw/ [tɾɐ̃sˈtoɦ.nu d͡ʒi peh.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.sɪˈaʊ̯], (faster pronunciation) /tɾɐ̃sˈtoʁ.nu d͡ʒi peʁ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaw/ [tɾɐ̃sˈtoɦ.nu d͡ʒi peh.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaʊ̯]
- (São Paulo) IPA(key): /tɾɐ̃sˈtoɾ.nu d͡ʒi peɾ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.siˈaw/ [tɾɐ̃sˈtoɾ.nu d͡ʒi peɾ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.sɪˈaʊ̯], (faster pronunciation) /tɾɐ̃sˈtoɾ.nu d͡ʒi peɾ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaw/ [tɾɐ̃sˈtoɾ.nu d͡ʒi peɾ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaʊ̯]
- (Rio de Janeiro) IPA(key): /tɾɐ̃ʃˈtoʁ.nu d͡ʒi peʁ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.siˈaw/ [tɾɐ̃ʃˈtoʁ.nu d͡ʒi peχ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.so.sɪˈaʊ̯], (faster pronunciation) /tɾɐ̃ʃˈtoʁ.nu d͡ʒi peʁ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaw/ [tɾɐ̃ʃˈtoʁ.nu d͡ʒi peχ.so.na.liˈda.d͡ʒi ˌɐ̃.t͡ʃi.soˈsjaʊ̯]
- (Southern Brazil) IPA(key): /tɾɐ̃sˈtoɻ.no de peɻ.so.na.liˈda.de ˌɐ̃.t͡ʃi.so.siˈaw/ [tɾɐ̃sˈtoɻ.no de peɻ.so.na.liˈda.de ˌɐ̃.t͡ʃi.so.sɪˈaʊ̯], (faster pronunciation) /tɾɐ̃sˈtoɻ.no de peɻ.so.na.liˈda.de ˌɐ̃.t͡ʃi.soˈsjaw/ [tɾɐ̃sˈtoɻ.no de peɻ.so.na.liˈda.de ˌɐ̃.t͡ʃi.soˈsjaʊ̯]
Note the use of a semicolon to separate the two substitutions, and the fact that the embedded colon in the second substitution is not problematic.
Substitution expressions can be combined with regular respellings, given inline modifiers, etc. For example, for antigo eslavo eclesiástico (“Old Church Slavonic”), the initial e
of eclesiástico can be either respelled as-is or using ē
, indicating two possible pronunciations /e/ and /i/ in Portugal. To specify this, use the following:
{{pt-IPA|[ecl:ēcl],+}}
which generates
- (Brazil) IPA(key): /ɐ̃ˈt͡ʃi.ɡu izˈla.vu e.kle.ziˈas.t͡ʃi.ku/ [ɐ̃ˈt͡ʃi.ɡu izˈla.vu e.kle.zɪˈas.t͡ʃi.ku], (faster pronunciation) /ɐ̃ˈt͡ʃi.ɡu izˈla.vu e.kleˈzjas.t͡ʃi.ku/, /ɐ̃ˈt͡ʃi.ɡu ezˈla.vu e.kle.ziˈas.t͡ʃi.ku/ [ɐ̃ˈt͡ʃi.ɡu ezˈla.vu e.kle.zɪˈas.t͡ʃi.ku], (faster pronunciation) /ɐ̃ˈt͡ʃi.ɡu ezˈla.vu e.kleˈzjas.t͡ʃi.ku/
- (Rio de Janeiro) IPA(key): /ɐ̃ˈt͡ʃi.ɡu iʒˈla.vu e.kle.ziˈaʃ.t͡ʃi.ku/ [ɐ̃ˈt͡ʃi.ɡu iʒˈla.vu e.kle.zɪˈaʃ.t͡ʃi.ku], (faster pronunciation) /ɐ̃ˈt͡ʃi.ɡu iʒˈla.vu e.kleˈzjaʃ.t͡ʃi.ku/, /ɐ̃ˈt͡ʃi.ɡu eʒˈla.vu e.kle.ziˈaʃ.t͡ʃi.ku/ [ɐ̃ˈt͡ʃi.ɡu eʒˈla.vu e.kle.zɪˈaʃ.t͡ʃi.ku], (faster pronunciation) /ɐ̃ˈt͡ʃi.ɡu eʒˈla.vu e.kleˈzjaʃ.t͡ʃi.ku/
- (Southern Brazil) IPA(key): /ɐ̃ˈt͡ʃi.ɡo ezˈla.vo e.kle.ziˈas.t͡ʃi.ko/ [ɐ̃ˈt͡ʃi.ɡo ezˈla.vo e.kle.zɪˈas.t͡ʃi.ko], (faster pronunciation) /ɐ̃ˈt͡ʃi.ɡo ezˈla.vo e.kleˈzjas.t͡ʃi.ko/
- (Portugal) IPA(key): /ɐ̃ˈti.ɡu (i)ʒˈla.vu e.klɨˈzjaʃ.ti.ku/ [ɐ̃ˈti.ɣu (i)ʒˈla.vu e.klɨˈzjaʃ.ti.ku], /ɐ̃ˈti.ɡu (i)ʒˈla.vu i.klɨˈzjaʃ.ti.ku/ [ɐ̃ˈti.ɣu (i)ʒˈla.vu i.klɨˈzjaʃ.ti.ku]
- (Northern Portugal) IPA(key): /ɐ̃ˈti.ɡu (i)ʒˈla.bu e.klɨˈzjaʃ.ti.ku/ [ɐ̃ˈti.ɣu (i)ʒˈla.βu e.klɨˈzjaʃ.ti.ku], /ɐ̃ˈti.ɡu (i)ʒˈla.bu i.klɨˈzjaʃ.ti.ku/ [ɐ̃ˈti.ɣu (i)ʒˈla.βu i.klɨˈzjaʃ.ti.ku]
Prefixes and suffixes
සංස්කරණයPrefixes (words ending in a hyphen) are always treated as lacking primary stress. Any stressed vowels are given secondary stress. Suffixes (words beginning with a hyphen), however, are usually stressed as normal. To specify the pronunciation of an unstressed suffix such as -a or -fago, put a dot over the vowel that would be stressed, using the symbols ȧ ė i̇ ȯ u̇
. For example, for -fago, use
{{pt-IPA|-fȧgo}}
which generates
Note the lack of a stress marker and the occurrence of /ɐ/ in Portugal.
Deduplication
සංස්කරණයIf the same pronunciation is generated twice for a given dialect (including with the same qualifiers and references, if any), only the first occurrence is displayed. This is useful, for example, when there are two Portugal pronunciation variants but only one Brazil pronunciation, such as for hemorragia (“hemorrhage”); use
{{pt-IPA|hèmorragia,+}}
which generates
Here, +
expands to the pagename hemorragia
, which differs from the first respelling only in the latter having è
instead of e
. Both variants map to the same sound /e/ in Brazil, so the two Portugal variants end up pronounced the same and are deduplicated.
Epenthetic /i/ in Brazil
සංස්කරණයBrazilian Portuguese is known for having an unwritten epenthetic /i/ inserted to break up difficult-to-pronounce consonant clusters. A well-known example is advogado (“lawyer”), frequently pronounced as if written adivogado. Words with epenthetic /i/ often admit alternative pronunciations where the vowel is not pronounced on the surface (but is still present in a latent sense because it triggers palatalization of /t d/ to /t͡ʃ d͡ʒ/). To indicate such a vowel, use one of the following symbols (all of which generate no vowel in Portugal dialects):
- If the vowel is always present
- If the vowel is always present, use
i*
. This is typically the case, for example, with mn clusters such as in amnésia (“amnesia”), which are not normally supported in Brazil (and in fact are one source of spelling differences between Brazil and Portugal, cf. Portugal amnistia (“amnesty”), spelled anistia in Brazil). (Another such cluster is brr, such as in ab-rogação (“abrogation”).) For example, for gimnosperma (“gymnosperm”), write
{{pt-IPA|gimi*nòspérma}}
which generates
Here i*
specifies a mandatory epenthetic /i/ in Brazil that is not present in Portugal; meanwhile, ò
specifies an unpredictable unstressed open /ɔ/ that is not present in Brazil (which has normal /o/).
Similarly for ab-rogação, write
{{pt-IPA|abi*rrogação}}
which generates
- If the vowel is usually present, but sometimes isn't
- If the vowel is usually present, but sometimes isn't, use
i^
. This is the case for most consonant clusters where the second consonant is a stop, fricative or nasal, i.e. any consonant other than /l/, single /ɾ/ , or a glide /j/ or /w/. (There are a few exceptions; see the next item.) An example is pneu (“tire”); write
{{pt-IPA|pi^neu}}
which generates
Another example of note is digno (“worthy”); write
{{pt-IPA|dighi^no}}
which generates
Two things should be noted here. One is the use of gh
to get hard /ɡ/; this is the recommended way of respelling in this situation. (gu
would not work for Portugal, where the respelling diguno
would be generated.) Similarly for a cluster with c
, use respelling with k
, e.g. respell acne (“acne”) as aki^ne
. The other is the lack of a stress mark in the respelling. This is because the epenthetic /i/ that is generated is ignored for stress assignment purposes (but is treated as a normal vowel for all other purposes, e.g. palatalization of t d
, softening of c g
, and syllabification).
- If the vowel is usually not present, but sometimes is
- If the vowel is usually not present, but sometimes is, use
i^^
. This happens commonly with /kt/, /ps/, /pt/, /bs/ and /bt/ clusters (exceptions are /ps/ and /bs/ clusters followed by another consonant, such as substantivo).
Vowels in hiatus
සංස්කරණයA hiatus is an occurrence of two vowels next to each other with no consonant between them. (In Portuguese, diphthongs such as au ei õe are not normally considered instances of hiatus, but instead of considered single phonemes.) The current treatment of hiatuses is as follows:
- In Portugal, an unstressed e or i directly followed by another vowel is treated as a glide /j/. Likewise an unstressed o or u directly followed by another vowel is treated as a glide /w/. An exception is the sequences
eí
(as in ateísta, veículo, etc.) ande.i
(as in europeizar respelledeurope.izar
), which are treated as if spelledaí
anda.i
, respectively, consistent with normal Central Portugal pronunciation. - In Brazil, all vowels in hiatus are currently treated as-is, so that e.g.
paciência
renders as /pa.siˈẽ.si.ɐ/ andpassear
renders as /pa.seˈa(ʁ)/. This is subject to change. To explicitly notate a glide, usey
orw
. To explicitly notate a hiatus, put a.
between the vowels. To notate multiple possibilities, use circumflex symbols as described above.
Special handling of certain consonants in certain contexts
සංස්කරණයl
- "Coda
l
" (i.e. writtenl
when not occurring before a vowel) generates [w] in Brazil and [ɫ] in Portugal. - Before coda
l
in Portugal, vowels generally have an open pronunciation, even when unstressed. Specifically,al
becomes [aɫ], as in saltar (“to jump”);el
becomes [ɛɫ], as in túnel (“tunnel”) and beldade (“beauty”); andol
generates two outputs, [oɫ] and [ɔɫ] (representing regional and per-speaker variation), as in Moldávia (“Moldova”).
r
- The pronunciation of written
r
varies greatly between Brazil and Portugal and within different dialects in each case. - There are three varieties of
r
, which we will term strongr
, weakr
and codar
. Strongr
and weakr
contrast between vowels, where strongr
is written as doublerr
whereas weakr
is written as singler
, as in e.g. carro (“car”) vs. caro (“dear”). Elsewhere, only one variety occurs. Specifically, strong and weakr
only occur before vowels, while codar
occurs elsewhere (before a consonant or at the end of a word). Strongr
occurs at the beginning of a word, as well as after a nasal vowel (as in genro (“son-in-law”)), anl
(as in chilrear (“to chirp”)), and ans
(as in Israel (“Israel”)). Weakr
occurs after all other consonants. - The following generalizations describe the most common pronunciation of strong, weak and coda
r
:- Weak
r
is a flap [ɾ] everywhere. - Strong
r
is usually a guttural sound, conventionally notated as /ʁ/ in phonemic notation. In Portugal, this actually corresponds to the normal pronunciation [ʁ], but in Brazil this conventional notation (even though we follow it) is highly misleading as it does not at all represent the actual pronunciation of this sound in most dialects. Rather, the most common pronunciation is [h] (sometimes a uvular [χ], as in Rio de Janeiro). Hence, we use [h] as the phonetic representation of strongr
in "general Brazilian". - Coda
r
is pronounced the same as weakr
in Portugal and some Brazilian dialects (e.g. standard São Paulo city), but the same as strongr
in most Brazilian dialects; this is what we use for "general Brazilian". Meanwhile, some Brazilian dialects have a unique sound for codar
that is different from both strong and weakr
. For example, the typical "Caipira" accent (found in several rural areas of Brazil) uses an American R [ɻ], while São Paulo state tends to use a British R [ɹ].
- Weak
- Coda
r
in Brazil following any of the stressed vowels /a ɛ e i/ is by default written as optional, i.e. (ʁ), (ɾ) or the like. This expresses the fact that most such words are verbs, and codar
in verbs is frequently omitted. This does not apply to non-verbs, which must be respelled withrh
to prevent this. For example, angular, colher and emir should be respelledangularh
,colhérh
andemirh
respectively. - Before word-final
r
in Portugal, unstressed vowels are rendered as open /a ɛ ɔ/, representing their most common pronunciation, as in dólar (“dollar”), líder (“leader”), júnior (“junior”). This also applies before a component boundary; in particular, prefixes inter-, hiper-, super- respelledínter:
,híper:
,súper:
automatically get /ɛ/ before the finalr
.
s
The letter s
can have multiple possible pronunciations.
- Between non-nasal vowels (including across a word boundary), single
s
is /z/ while doubless
is /s/. Hence thes
in casa (“house”) is /z/; likewise the firsts
in os árvores (“the trees”). - Elsewhere before a vowel,
s
is normally /s/, e.g. word-initially as in sorte (“luck”) or after a consonant as in verso (“verse”). This includes when following a nasal vowel, as in cansado (“tired”). An exception is in-trans-
, as in transação (“transaction”) or intransitivo (“intransitive”), where it is /z/. - When not before a vowel,
s
is either a hissing sound /s z/ or a hushing sound /ʃ ʒ/, depending on the dialect; Portugal and Rio de Janeiro dialects use hushing sounds, while other Brazilian dialects use hissing sounds. Voiced sounds /z ʒ/ occur before voiced consonants, while unvoiced sounds /s ʃ/ occur elsewhere. An exception is word-initially before a consonant, where /s/ occurs even in Portugal and Rio. All such words are borrowings, often unassimilated or semi-unassimilated, e.g. spyware and staccato. - To force /s/ where it would not normally occur, use
ss
. - Note also that the sequence
sh
represents /ʃ/, as in English.
x
Written x
has multiple possible pronunciations in Portuguese: /s/ (as in máximo (“maximum”), trouxe (“I/he brought”)), /z/ (as in existir (“to exist”)), /ʃ/ (as in baixo (“low”)) or /ks/ (as in fixo (“fixed”)). Sometimes the same word can have two different pronunciations of x
, as in xerox (“photocopy, xerox”), pronounced /ʃɛˈɾɔks/. The module handles this by defaulting to specific pronunciations in specific circumstances, and requiring respelling in all other cases. Specifically:
- Initial
x-
defaults to /ʃ/, as in xadrez (“chess”), Xangô (“name of an orisha in Candomblé and similar religions”), xerocar (“to xerox”). - Final
-x
defaults to /ks/, as in látex (“latex”), unissex (“unisex”), Félix (“Felix”). x
following a diphthong defaults to /ʃ/, as in abaixar (“to lower”), frouxo (“loose”), peixe (“fish”).- Non-final
x
in the sequence-nx-
defaults to /ʃ/, as in enxame (“swarm”), enxugar (“to wipe”). - The sequence
-ex-
followed by a consonant has special handling. Thex
is pronounced as if writtens
, and the entire sequenceex
in Portugal is pronounced as if writteneis
. This still applies in writtenêxC
oréxC
. Examples: experiência (“experience”), exsudar (“to exude”), têxtil (“textile”). - In all other circumstances,
x
must be respelledss
,z
,sh
,cs
or similar; otherwise an error results. - NOTE: When respelling
x
to generate the sound /ʃ/, it is recommended to use the respellingsh
, notch
, because in the future an additional pronunciation line may be added for the Northeast Portugal dialect, wherech
is pronounced as /t͡ʃ/ and the words buxo (“box (tree)”) and bucho (“maw”) form a minimal /ʃ/-/t͡ʃ/ pair.
Special handling of certain vowels in certain contexts
සංස්කරණය- Initial unstressed
o-
andho-
in Portugal are normally /ɔ/. This also applies after component boundaries such as those indicated by:
. - Unstressed
-ie-
and-ee-
in Portugal, as in alienado (“alienated”) and Teerão (“Teheran”), are normally /jɛ/.
Other comments
සංස්කරණය- Under some circumstances, multiple pronunciations are output even when a single respelling is provided. This happens in the following circumstances:
- When any of the symbols described above that use a circumflex character
^
are used (only in Brazil). - In Brazil, when a word begins with unstressed en- or em- followed by a consonant. The two outputs have /ẽ/ (indicated as careful pronunciation) and /ĩ/ (indicated as natural pronunciation).
- In Brazil, when a word begins with unstressed es- or ex- followed by a consonant (except in cases like excelente). The two outputs have /i/ and /e/, representing regional and per-speaker variation.
- In Portugal, when a word has a stressed /o/ in hiatus (directly followed by another vowel). In this case, the second output has a /w/ inserted and is marked regional. For example, boa ends up outputting /ˈbo.ɐ/ and /ˈbo.wɐ/, with the second marked as regional.
- In Portugal, when the sequence /ʃs/ is found (as in nascer). The two outputs have /ʃs/ (indicated as careful pronunciation) and /ʃ/ (indicated as natural pronunciation).
- In Portugal, when unstressed ol occurs followed by a consonant or word boundary. The two outputs have [oɫ] and [ɔɫ], representing regional and per-speaker variation.
- When any of the symbols described above that use a circumflex character
Warning
සංස්කරණයThis template is still being developed and is liable to change.
- Click preview before adding a usage of this template to an article in the main namespace.
- Do not automatically add usages of this template to articles in the main namespace.
- Report any mistake to User:Benwing2.