(Translated by https://www.hiragana.jp/)
Corsican alphabet: Difference between revisions - Wikipedia Jump to content

Corsican alphabet: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
correcting Trigraph wikilink
 
(35 intermediate revisions by 27 users not shown)
Line 1: Line 1:
The modern '''Corsican alphabet''' ([[Corsican language|Corsican]] '''u santacroce''' or '''u salteriu''') uses 22 basic letters taken from the [[Latin alphabet]] with some changes, plus some multigraphs. The pronunciations of the English, French, Italian or Latin forms of these letters are not a guide to their pronunciation in ''Corsu'', which has its own pronunciation, often the same, but frequently not. As can be seen from the table below, two of the phonemic letters are represented as [[trigraph]]s, plus some other [[digraph (orthography)|digraphs]]. Nearly all the letters are [[allophonic]]; that is, a [[phoneme]] of the language might have more than one pronunciation and be represented by more than one letter. The exact pronunciation depends mainly on word order and usage and is governed by a complex set of rules, variable to some degree by dialect. These have to be learned by the speaker of the language.
The modern '''Corsican alphabet''' ({{lang-co|u santacroce}} or {{lang|co|u salteriu}}) uses twenty-two basic letters taken from the [[Latin alphabet]] with some changes, plus some multigraphs. The pronunciations of the English, French, Italian or Latin forms of these letters are not a guide to their pronunciation in Corsican, which has its own pronunciation, often the same, but frequently not. As can be seen from the table below, two of the phonemic letters are represented as [[Trigraph (orthography)|trigraphs]], plus some other [[digraph (orthography)|digraphs]]. Nearly all the letters are [[allophonic]]; that is, a [[phoneme]] of the language might have more than one pronunciation and be represented by more than one letter. The exact pronunciation depends mainly on word order and usage and is governed by a complex set of rules, variable to some degree by dialect. These have to be learned by the speaker of the language.


== Modern alphabet ==
== Modern alphabet ==
Line 10: Line 10:
|align="center" colspan="26" | '''[[Capital letters|Majuscule Forms]]''' (also called '''uppercase''' or '''capital letters''')
|align="center" colspan="26" | '''[[Capital letters|Majuscule Forms]]''' (also called '''uppercase''' or '''capital letters''')
|-
|-
|width="2%" align="center"|[[A]]||width="2%" align="center"|[[B]]||width="2%" align="center"|[[C]]||width="2%" align="center"|[[List of Latin-script trigraphs|CHJ]]||width="2%" align="center"|[[D]]||width="2%" align="center"|[[E]]||width="2%" align="center"|[[F]]||width="2%" align="center"|[[G]]||width="2%" align="center"|[[Ghj (trigraph)|GHJ]]||width="2%" align="center"|[[H]]||width="2%" align="center"|[[I]]||width="2%" align="center"|[[J]]||width="2% align="center"|[[L]]||width="2%" align="center"|[[M]]||width="2%" align="center"|[[N]]||width="2%" align="center"|[[O]]||width="2%" align="center"|[[P]]||width="2% align="center"|[[Q]]||width="2%" align="center"|[[R]]||width="2%" align="center"|[[S]]||width="2%" align="center"|[[Sc (digraph)|SC]]||width="2%" align="center"|[[Sg (digraph)|SG]]||width="2%" align="center"|[[T]]||width="2%" align="center"|[[U]]||width="2% align="center"|[[V]]||width="2%" align="center"|[[Z]]
|width="2%" align="center"|[[A]]||width="2%" align="center"|[[B]]||width="2%" align="center"|[[C]]||width="2%" align="center"|[[List of Latin-script trigraphs|CHJ]]||width="2%" align="center"|[[D]]||width="2%" align="center"|[[E]]||width="2%" align="center"|[[F]]||width="2%" align="center"|[[G]]||width="2%" align="center"|[[Ghj (trigraph)|GHJ]]||width="2%" align="center"|[[H]]||width="2%" align="center"|[[I]]||width="2%" align="center"|[[J]]||width="2%" align="center"|[[L]]||width="2%" align="center"|[[M]]||width="2%" align="center"|[[N]]||width="2%" align="center"|[[O]]||width="2%" align="center"|[[P]]||width="2%" align="center"|[[Q]]||width="2%" align="center"|[[R]]||width="2%" align="center"|[[S]]||width="2%" align="center"|[[Sc (digraph)|SC]]||width="2%" align="center"|[[Sg (digraph)|SG]]||width="2%" align="center"|[[T]]||width="2%" align="center"|[[U]]||width="2%" align="center"|[[V]]||width="2%" align="center"|[[Z]]
|-
|-
|align="center" colspan="26" | '''[[Lower case|Minuscule Forms]]''' (also called '''lowercase''' or '''small letters''')
|align="center" colspan="26" | '''[[Lower case|Minuscule Forms]]''' (also called '''lowercase''' or '''small letters''')
Line 20: Line 20:
|width="2%" align="center"|''à''||width="2%" align="center"|''bì''||width="2%" align="center"|''cì''||width="2%" align="center"|''chjì''||width="2%" align="center"|''dè''||width="2%" align="center"|''è''||width="2%" align="center"|''effe''||width="2%" align="center"|''gì''||width="2%" align="center"|''ghjè''||width="2%" align="center"|''acca''||width="2%" align="center"|''ì''||width="2%" align="center"|''jì''||width="2%" align="center"|''elle''||width="2%" align="center"|''emme''||width="2%" align="center"|''enne''||width="2%" align="center"|''ò''||width="2%" align="center"|''pè''||width="2%" align="center"|''cù''||width="2%" align="center"|''erre''||width="2%" align="center"|''esse''||width="2%" align="center"|''esci''||width="2%" align="center"|''esge''||width="2%" align="center"|''tì''||width="2%" align="center"|''ù''||width="2%" align="center"|''vè''||width="2%" align="center"|''zeda''
|width="2%" align="center"|''à''||width="2%" align="center"|''bì''||width="2%" align="center"|''cì''||width="2%" align="center"|''chjì''||width="2%" align="center"|''dè''||width="2%" align="center"|''è''||width="2%" align="center"|''effe''||width="2%" align="center"|''gì''||width="2%" align="center"|''ghjè''||width="2%" align="center"|''acca''||width="2%" align="center"|''ì''||width="2%" align="center"|''jì''||width="2%" align="center"|''elle''||width="2%" align="center"|''emme''||width="2%" align="center"|''enne''||width="2%" align="center"|''ò''||width="2%" align="center"|''pè''||width="2%" align="center"|''cù''||width="2%" align="center"|''erre''||width="2%" align="center"|''esse''||width="2%" align="center"|''esci''||width="2%" align="center"|''esge''||width="2%" align="center"|''tì''||width="2%" align="center"|''ù''||width="2%" align="center"|''vè''||width="2%" align="center"|''zeda''
|-
|-
|align="center" colspan="26" | '''[[IPA]] transcription of names'''
|align="center" colspan="26" | '''[[International Phonetic Alphabet|IPA]] transcription of names'''
|-
|-
|width="2%" align="center"|{{IPA|[ˈa]}}||width="2%" align="center"|{{IPA|[ˈbi]}}||width="2%" align="center"|{{IPA|[ˈtʃi]}}||width="2%" align="center"|{{IPA|[ˈci]}}||width="2%" align="center"|{{IPA|[ˈdɛ]}}||width="2%" align="center"|{{IPA|[ˈɛ]}}||width="2%" align="center"|{{IPA|[ˈɛffɛ]}}||width="2%" align="center"|{{IPA|[ˈdʒɛ]}}||width="2%" align="center"|{{IPA|[ˈɟɛ]}}||width="2%" align="center"|{{IPA|[ˈakka]}}||width="2%" align="center"|{{IPA|[ˈi]}}||width="2%" align="center"|{{IPA|[ˈʒi]}}||width="2%" align="center"|{{IPA|[ˈɛllɛ]}}||width="2%" align="center"|{{IPA|[ˈɛmmɛ]}}||width="2%" align="center"|{{IPA|[ˈɛnnɛ]}}||width="2%" align="center"|{{IPA|[ˈo]}}||width="2%" align="center"|{{IPA|[ˈpɛ]}}||width="2%" align="center"|{{IPA|[ˈku]}}||width="2%" align="center"|{{IPA|[ˈɛrrɛ]}}||width="2%" align="center"|{{IPA|[ˈɛssɛ]}}||width="2%" align="center"|{{IPA|[ˈɛʃi]}}||width="2%" align="center"|{{IPA|[ˈɛʒɛ]}}||width="2%" align="center"|{{IPA|[ˈti]}}||width="2%" align="center"|{{IPA|[ˈu]}}||width="2%" align="center"|{{IPA|[ˈvɛ]}}||width="2%" align="center"|{{IPA|[ˈdzɛda]}}
|width="2%" align="center"|{{IPA|[ˈa]}}||width="2%" align="center"|{{IPA|[ˈbi]}}||width="2%" align="center"|{{IPA|[ˈtʃi]}}||width="2%" align="center"|{{IPA|[ˈci]}}||width="2%" align="center"|{{IPA|[ˈdɛ]}}||width="2%" align="center"|{{IPA|[ˈɛ]}}||width="2%" align="center"|{{IPA|[ˈɛffɛ]}}||width="2%" align="center"|{{IPA|[ˈdʒɛ]}}||width="2%" align="center"|{{IPA|[ˈɟɛ]}}||width="2%" align="center"|{{IPA|[ˈakka]}}||width="2%" align="center"|{{IPA|[ˈi]}}||width="2%" align="center"|{{IPA|[ˈji]}}|| width="2%" align="center" |{{IPA|[ˈɛllɛ]}}||width="2%" align="center"|{{IPA|[ˈɛmmɛ]}}||width="2%" align="center"|{{IPA|[ˈɛnnɛ]}}||width="2%" align="center"|{{IPA|[ˈo]}}||width="2%" align="center"|{{IPA|[ˈpɛ]}}||width="2%" align="center"|{{IPA|[ˈku]}}||width="2%" align="center"|{{IPA|[ˈɛrrɛ]}}||width="2%" align="center"|{{IPA|[ˈɛssɛ]}}||width="2%" align="center"|{{IPA|[ˈɛʃi]}}||width="2%" align="center"|{{IPA|[ˈɛʒɛ]}}||width="2%" align="center"|{{IPA|[ˈti]}}||width="2%" align="center"|{{IPA|[ˈu]}}||width="2%" align="center"|{{IPA|[ˈvɛ]}}||width="2%" align="center"|{{IPA|[ˈdzɛda]}}
|}
|}


Notes :
Notes:
* Unlike French, there are no mute letters (and notably no mute e, even if an unstressed letter E/e may be pronounced like a [[schwa]] or mutated into another unstressed vowel);
* Unlike French, there are no mute letters (and notably no mute e, even if an unstressed letter E/e may be pronounced like a [[schwa]] or mutated into another unstressed vowel);
* the letter H/h only occurs after another consonant to form digrams or trigrams : CH/ch, CHJ/chj, DH/dh (rare for Southern dialects), GH/gh, GHJ/ghj;
* the letter H/h only occurs after another consonant to form digrams or trigrams : CH/ch, CHJ/chj, DH/dh (rare for Southern dialects), GH/gh, GHJ/ghj or alone to differentiate two homophones. Example: ''{{lang|co|è}}'' "and" and ''{{lang|co|hè}}'' "is";
* the letter J/j may be found in older transcriptions (before the adoption of a stable orthography), where today it is preferably written with the digram SG/sg; otherwise it only occurs in trigrams : CHJ/chj or GHJ/ghj;
* the letter J/j may be found in older transcriptions (before the adoption of a stable orthography), where today it is preferably written with the digram SG/sg; otherwise it only occurs in trigrams : CHJ/chj or GHJ/ghj;
* the letter Q/q only occurs in the consonantal digram QU/qu;
* the letter Q/q only occurs in the consonantal digram QU/qu;
* the letters K/k (cappa {{IPA|[ˈkappa]}}), W/w (vè dòppio {{IPA|[ˈvɛ ˈdɔppio]}}), X/x (iquèsi {{IPA|[iˈkɛzi]}}), Y/y (i grècu {{IPA|[ˈi ˈgɾɛku]}}) are not used;
* the letters K/k (cappa {{IPA|[ˈkappa]}}), W/w (vè dòppio {{IPA|[ˈvɛ ˈdɔppio]}}), X/x (iquèsi {{IPA|[iˈkɛzi]}}), Y/y (i grècu {{IPA|[ˈi ˈɡɾɛku]}}) are not used;
* for collation purpose, the digraphs and trigraphs are split into their component letters.
* for collation purpose, the digraphs and trigraphs are split into their component letters.


== Basic diacritics ==
Basic diacritics:

The Corsican language is stressed on varying syllables, even if most often the stress occurs on the penultimate syllable (monosyllabic words are most often stressed, but may be unstressed in a few cases). As the position of the stress is distinctive in many terms, the stress needs to be distinguished. The grave accent is then written above the wowel of the stressed syllable, if it's not the penultimate one. The stress is also marked on monosyllabic words.
The Corsican language is stressed on varying syllables, even if most often the stress occurs on the penultimate syllable (monosyllabic words are most often stressed, but may be unstressed in a few cases). As the position of the stress is distinctive in many terms, the stress needs to be distinguished. The grave accent is then written above the vowel of the stressed syllable, if it is not the penultimate one. The stress is also marked on monosyllabic words.


The following letters can then occur in standard Corsican orthographies:
The following letters can then occur in standard Corsican orthographies:
: À/à, È/è, Ì/ì, Ò/ò, Ù/u.
: À/à, È/è, Ì/ì, Ò/ò, Ù/ù.


In addition, Corsican includes vocalic diphthongs, that count as a single syllable. If that syllable is stressed, the first vowel is softened or reduced, and the second vowel holds the stress mark which must be written (IÀ/ià, IÈ/iè, IÒ/iò, IÙ/iù).
In addition, Corsican includes vocalic diphthongs that count as a single syllable. If that syllable is stressed, the first vowel is softened or reduced, and the second vowel holds the stress mark which must be written (IÀ/ià, IÈ/iè, IÒ/iò, IÙ/iù).


However, in other unstressed syllables, the default orthography considers vowel pairs as unstressed diphthongs counting for a single syllable (IA/ia, IE/ie, IO/io, IU/iu); if the two vowels need to be separated, and none of them are stressed, a diaeresis mark may sometimes be used on the first vowel (ÏA/ïa, ÏE/ïe, ÏO/ïo, ÏU/ïu). This case is not always followed, except for academic purpose to exhibit the absence of diphthong and the syllabic break : most writers don't use it. The diaeresis is also not needed in the more common case, where the vowel pair is stressed on the leading I/i without a diphthong, as the stress mark already marks the diaeresis (ÌA/ìa, ÌE/ìe, ÌO/ìo, ÌU/ìu). But when this vowel pair is final, the stress mark on the first vowel is most frequently not written (except for academic purpose) because such diphthongs normally don't occur on the final position. For example, ''zìu'' (uncle) {{IPA|[ˈtsi·u]}} is most often written just as ''ziu'' ; same thing about ''Bastìa'' {{IPA|[Bas·ˈti·a]}} most often written just as ''[[Bastia]]'' (even if it's ''not'' pronounced {{IPA|[Bas·ˈtja]}}).
However, in other unstressed syllables, the default orthography considers vowel pairs as unstressed diphthongs counting for a single syllable (IA/ia, IE/ie, IO/io, IU/iu); if the two vowels need to be separated, and none of them are stressed, a diaeresis mark may sometimes be used on the first vowel (ÏA/ïa, ÏE/ïe, ÏO/ïo, ÏU/ïu). This case is not always followed, except for academic purpose to exhibit the absence of diphthong and the syllabic break: most writers don't use it. The diaeresis is also not needed in the more common case, where the vowel pair is stressed on the leading I/i without a diphthong, as the stress mark already marks the diaeresis (ÌA/ìa, ÌE/ìe, ÌO/ìo, ÌU/ìu). But when this vowel pair is final, the stress mark on the first vowel is most frequently not written (except for academic purpose) because such diphthongs normally do not occur on the final position. For example, ''zìu'' (uncle) {{IPA|[ˈtsi.u]}} is most often written just as ''ziu''; similarly with ''Bastìa'' {{IPA|[basˈti.a]}}, most often written just as ''[[Bastia]]'' (even though it is ''not'' pronounced {{IPA|[ˈbas.tja]}}).


Also, the vowel U/u is also used in digraphs to form complex consonants CU/cu, GU/gu, QU/qu, before one of the vowels A/a, E/e, I/i, O/o (which may be stressed or not). If the letter U/u must still be separated to avoid the digraph of the complex consonant, a diaeresis will be used above U/u to separate the syllables. This case occurs in : CÜ/cü or GÜ/gü when those syllables are not stressed. If one of those syllables are stressed, it takes the normal grave accent, and the vowel after it is written normally without needing any diaeresis.
Also, the vowel U/u is also used in digraphs to form complex consonants CU/cu, GU/gu, QU/qu, before one of the vowels A/a, E/e, I/i, O/o (which may be stressed or not). If the letter U/u must still be separated to avoid the digraph of the complex consonant, a diaeresis will be used above U/u to separate the syllables. This case occurs in CÜ/cü or GÜ/gü when those syllables are not stressed. If one of those syllables are stressed, it takes the normal grave accent, and the vowel after it is written normally without needing any diaeresis.


Unlike Italian (where nasalized vowels have disappeared), the nasalization of vowels E/e, I/i, O/o can occur frequently in Corsican, on stressed or unstressed syllables before N/n. As this nasalization is normally mandatory, and does not mute the letter N/n (unlike French), no diacritic is needed: if more rare cases where the vowel must not be nasalized, the letter N/n is doubled.
Unlike Italian (where nasalized vowels have disappeared), the nasalization of vowels E/e, I/i, O/o can occur frequently in Corsican, on stressed or unstressed syllables before N/n. As this nasalization is normally mandatory, and does not mute the letter N/n (unlike French), no diacritic is needed; in more rare cases where the vowel must not be nasalized, the letter N/n is doubled.


The basic alphabet for standard Corsican in modern orthography is then :
The basic alphabet for standard Corsican in modern orthography is then:
: A a (À à), B b, C c, D d, E e (È è), F f, G g, H h, I i (Ì ì, Ï ï), J j, L l, M m, N n, O o (Ò ò), P p, Q q, R r, S s, T t, U u (Ù ù, Ü u), V v, Z z.
: A a (À à), B b, C c, D d, E e (È è), F f, G g, H h, I i (Ì ì, Ï ï), J j, L l, M m, N n, O o (Ò ò), P p, Q q, R r, S s, T t, U u (Ù ù, Ü ü), V v, Z z.


All these letters can be typed with the standard French keyboard.
All these letters can be typed with the standard French keyboard.
Line 57: Line 58:
Corsican also contains phonetic distinctions for the aperture of vowels E/e and O/o, which may be distinctive in some cases.
Corsican also contains phonetic distinctions for the aperture of vowels E/e and O/o, which may be distinctive in some cases.


However, given that the phonetic varies in regional dialectal variants of the language (where the distinction of aperture may also become a mutation of the vowel, notably in the southern dialects), the distinction of aperture is generally not written, even if this creates homographs whose meaning is revealed by the context. Some early Corsican transcriptions however have used the acute accent on É/é for the [[closed e]], however this is not necessary in the modern orthography because a stressed È/è is normally already meant as a closed e (IPA: {{IPA|[e]}}), and an unstressed E/e most often mutates into another vowel, instead of being pronounced as [[open e]](IPA: {{IPA|[ɛ]}}).
However, given that the phonetics varies in regional dialectal variants of the language (where the distinction of aperture may also become a mutation of the vowel, notably in the southern dialects), the distinction of aperture is generally not written, even if this creates homographs whose meaning is revealed by the context. Some early Corsican transcriptions however have used the acute accent on É/é for the [[closed e]], however this is not necessary in the modern orthography because a stressed È/è is normally already meant as a close e (IPA: {{IPA|[e]}}), and an unstressed E/e most often mutates into another vowel, instead of being pronounced as [[Latin epsilon|open e]] (IPA: {{IPA|[ɛ]}}).


As well, the combination Ô/ô has been found in older transcriptions to mean the [[closed o]] (IPA: {{IPA|[o]}}), where it is normally stressed, and it is now preferably written as Ò/ò like other stressed vowels, the absence of diacritic (except on penultimate syllables) generally implying the [[open o]] (IPA: {{IPA|[ɔ]}}).
As well, the combination Ô/ô has been found in older transcriptions to mean the [[close o]] (IPA: {{IPA|[o]}}), where it is normally stressed, and it is now preferably written as Ò/ò like other stressed vowels, the absence of diacritic (except on penultimate syllables) generally implying the [[open o]] (IPA: {{IPA|[ɔ]}}).


Finally, Corsican texts may sometimes contain words imported from French (most often proper names for people, or toponyms).
Finally, Corsican texts may sometimes contain words imported from French (most often proper names for people, or toponyms).


With these common extensions needed for modern Corsican, the extended alphabet is:
With these common extensions needed for modern Corsican, the extended alphabet is:
: A a [ â] (À à), [Æ æ], B b, C c [Ç ç], D d, E e [É é, Ê ê] (È è) [Ë ë], F f, G g, H h, I i (Ì ì) [Î î, Ï ï], J j, [K k], L l, M m, N n [Ñ ñ], O o [Ô ô] (Ò ò), [Œ œ], P p, Q q, R r, S s, T t, U u (Ù ù) [Ü ü], V v, [W w], [X x], [Y y, Ÿ ÿ], Z z.
: A a [ â] (À à), [Æ æ], B b, C c [Ç ç], D d, E e [É é, Ê ê] (È è) [Ë ë], F f, G g, H h, I i (Ì ì) [Î î, Ï ï], J j, K k, L l, M m, N n , æǽ [Ô ô] (Ò ò), [Œ œ], P p, Q q, R r, S s, T t, U u (Ù ù) [Ü ü], V v, [W w], [X x], [Y y, Ÿ ÿ], Z z.


Like French, the rare ligatured letters Æ/æ and Œ/œ are treated as a+e and o+e for collation purposes.
Like French, the rare ligatured letters Æ/æ and Œ/œ are treated as a+e and o+e for collation purposes.
Line 70: Line 71:
==External links==
==External links==
* {{cite web|title=L'alphabet – U santacroce / U salteriu|url=http://pagesperso-orange.fr/gbatti-alinguacorsa/lexiques/alphabet.htm|publisher=A Lingua Corsa|year=2008|accessdate=2008-06-19}}
* {{cite web|title=L'alphabet – U santacroce / U salteriu|url=http://pagesperso-orange.fr/gbatti-alinguacorsa/lexiques/alphabet.htm|publisher=A Lingua Corsa|year=2008|accessdate=2008-06-19}}
* {{cite web|title=Corsican (corsu)|url=http://www.omniglot.com/writing/corsican.htm|publisher=Omniglot|first=Simon|last=Ager|date=1998–2008|accessdate=2008-06-19}}
* {{cite web|title=Corsican (corsu)|url=http://www.omniglot.com/writing/corsican.htm|publisher=Omniglot|first=Simon|last=Ager|date=1998–2008|accessdate=2008-06-19|archive-url=https://web.archive.org/web/20061128105910/http://www.omniglot.com/writing/corsican.htm|archive-date=2006-11-28|url-status=dead}}
* {{cite web|title=A lingua corsa, Accolta|language=fr|first=G.|last=Batti|url=http://gbatti-alinguacorsa.pagesperso-orange.fr/|date=2003-09-24|accessdate=2011-02-26}} An extensive description of the Corsican language, with many references.
* {{cite web|title=A lingua corsa, Accolta|language=fr|first=G.|last=Batti|url=http://gbatti-alinguacorsa.pagesperso-orange.fr/|date=2003-09-24|accessdate=2011-02-26}} An extensive description of the Corsican language, with many references.



Latest revision as of 14:54, 8 April 2024

The modern Corsican alphabet (Corsican: u santacroce or u salteriu) uses twenty-two basic letters taken from the Latin alphabet with some changes, plus some multigraphs. The pronunciations of the English, French, Italian or Latin forms of these letters are not a guide to their pronunciation in Corsican, which has its own pronunciation, often the same, but frequently not. As can be seen from the table below, two of the phonemic letters are represented as trigraphs, plus some other digraphs. Nearly all the letters are allophonic; that is, a phoneme of the language might have more than one pronunciation and be represented by more than one letter. The exact pronunciation depends mainly on word order and usage and is governed by a complex set of rules, variable to some degree by dialect. These have to be learned by the speaker of the language.

Modern alphabet

[edit]
Order
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Majuscule Forms (also called uppercase or capital letters)
A B C CHJ D E F G GHJ H I J L M N O P Q R S SC SG T U V Z
Minuscule Forms (also called lowercase or small letters)
a b c chj d e f g ghj h i j l m n o p q r s sc sg t u v z
Names
à chjì è effe ghjè acca ì elle emme enne ò erre esse esci esge ù zeda
IPA transcription of names
[ˈa] [ˈbi] [ˈtʃi] [ˈci] [ˈdɛ] [ˈɛ] [ˈɛffɛ] [ˈdʒɛ] [ˈɟɛ] [ˈakka] [ˈi] [ˈji] [ˈɛllɛ] [ˈɛmmɛ] [ˈɛnnɛ] [ˈo] [ˈpɛ] [ˈku] [ˈɛrrɛ] [ˈɛssɛ] [ˈɛʃi] [ˈɛʒɛ] [ˈti] [ˈu] [ˈvɛ] [ˈdzɛda]

Notes:

  • Unlike French, there are no mute letters (and notably no mute e, even if an unstressed letter E/e may be pronounced like a schwa or mutated into another unstressed vowel);
  • the letter H/h only occurs after another consonant to form digrams or trigrams : CH/ch, CHJ/chj, DH/dh (rare for Southern dialects), GH/gh, GHJ/ghj or alone to differentiate two homophones. Example: è "and" and "is";
  • the letter J/j may be found in older transcriptions (before the adoption of a stable orthography), where today it is preferably written with the digram SG/sg; otherwise it only occurs in trigrams : CHJ/chj or GHJ/ghj;
  • the letter Q/q only occurs in the consonantal digram QU/qu;
  • the letters K/k (cappa [ˈkappa]), W/w (vè dòppio [ˈvɛ ˈdɔppio]), X/x (iquèsi [iˈkɛzi]), Y/y (i grècu [ˈi ˈɡɾɛku]) are not used;
  • for collation purpose, the digraphs and trigraphs are split into their component letters.

Basic diacritics:

The Corsican language is stressed on varying syllables, even if most often the stress occurs on the penultimate syllable (monosyllabic words are most often stressed, but may be unstressed in a few cases). As the position of the stress is distinctive in many terms, the stress needs to be distinguished. The grave accent is then written above the vowel of the stressed syllable, if it is not the penultimate one. The stress is also marked on monosyllabic words.

The following letters can then occur in standard Corsican orthographies:

À/à, È/è, Ì/ì, Ò/ò, Ù/ù.

In addition, Corsican includes vocalic diphthongs that count as a single syllable. If that syllable is stressed, the first vowel is softened or reduced, and the second vowel holds the stress mark which must be written (IÀ/ià, IÈ/iè, IÒ/iò, IÙ/iù).

However, in other unstressed syllables, the default orthography considers vowel pairs as unstressed diphthongs counting for a single syllable (IA/ia, IE/ie, IO/io, IU/iu); if the two vowels need to be separated, and none of them are stressed, a diaeresis mark may sometimes be used on the first vowel (ÏA/ïa, ÏE/ïe, ÏO/ïo, ÏU/ïu). This case is not always followed, except for academic purpose to exhibit the absence of diphthong and the syllabic break: most writers don't use it. The diaeresis is also not needed in the more common case, where the vowel pair is stressed on the leading I/i without a diphthong, as the stress mark already marks the diaeresis (ÌA/ìa, ÌE/ìe, ÌO/ìo, ÌU/ìu). But when this vowel pair is final, the stress mark on the first vowel is most frequently not written (except for academic purpose) because such diphthongs normally do not occur on the final position. For example, zìu (uncle) [ˈtsi.u] is most often written just as ziu; similarly with Bastìa [basˈti.a], most often written just as Bastia (even though it is not pronounced [ˈbas.tja]).

Also, the vowel U/u is also used in digraphs to form complex consonants CU/cu, GU/gu, QU/qu, before one of the vowels A/a, E/e, I/i, O/o (which may be stressed or not). If the letter U/u must still be separated to avoid the digraph of the complex consonant, a diaeresis will be used above U/u to separate the syllables. This case occurs in CÜ/cü or GÜ/gü when those syllables are not stressed. If one of those syllables are stressed, it takes the normal grave accent, and the vowel after it is written normally without needing any diaeresis.

Unlike Italian (where nasalized vowels have disappeared), the nasalization of vowels E/e, I/i, O/o can occur frequently in Corsican, on stressed or unstressed syllables before N/n. As this nasalization is normally mandatory, and does not mute the letter N/n (unlike French), no diacritic is needed; in more rare cases where the vowel must not be nasalized, the letter N/n is doubled.

The basic alphabet for standard Corsican in modern orthography is then:

A a (À à), B b, C c, D d, E e (È è), F f, G g, H h, I i (Ì ì, Ï ï), J j, L l, M m, N n, O o (Ò ò), P p, Q q, R r, S s, T t, U u (Ù ù, Ü ü), V v, Z z.

All these letters can be typed with the standard French keyboard.

Corsican also needs an orthographic apostrophe to mark the elision, preferably written in its curly form (’) for good typography, even though the vertical ASCII quote (') is common.

Extended diacritics

[edit]

Corsican also contains phonetic distinctions for the aperture of vowels E/e and O/o, which may be distinctive in some cases.

However, given that the phonetics varies in regional dialectal variants of the language (where the distinction of aperture may also become a mutation of the vowel, notably in the southern dialects), the distinction of aperture is generally not written, even if this creates homographs whose meaning is revealed by the context. Some early Corsican transcriptions however have used the acute accent on É/é for the closed e, however this is not necessary in the modern orthography because a stressed È/è is normally already meant as a close e (IPA: [e]), and an unstressed E/e most often mutates into another vowel, instead of being pronounced as open e (IPA: [ɛ]).

As well, the combination Ô/ô has been found in older transcriptions to mean the close o (IPA: [o]), where it is normally stressed, and it is now preferably written as Ò/ò like other stressed vowels, the absence of diacritic (except on penultimate syllables) generally implying the open o (IPA: [ɔ]).

Finally, Corsican texts may sometimes contain words imported from French (most often proper names for people, or toponyms).

With these common extensions needed for modern Corsican, the extended alphabet is:

A a [ â] (À à), [Æ æ], B b, C c [Ç ç], D d, E e [É é, Ê ê] (È è) [Ë ë], F f, G g, H h, I i (Ì ì) [Î î, Ï ï], J j, K k, L l, M m, N n , æǽ [Ô ô] (Ò ò), [Œ œ], P p, Q q, R r, S s, T t, U u (Ù ù) [Ü ü], V v, [W w], [X x], [Y y, Ÿ ÿ], Z z.

Like French, the rare ligatured letters Æ/æ and Œ/œ are treated as a+e and o+e for collation purposes.

[edit]
  • "L'alphabet – U santacroce / U salteriu". A Lingua Corsa. 2008. Retrieved 2008-06-19.
  • Ager, Simon (1998–2008). "Corsican (corsu)". Omniglot. Archived from the original on 2006-11-28. Retrieved 2008-06-19.
  • Batti, G. (2003-09-24). "A lingua corsa, Accolta" (in French). Retrieved 2011-02-26. An extensive description of the Corsican language, with many references.