Lao

 

[This document is for discussion, and should not yet be used as a reference.  It closely follows pages 116-119 of the LA-LC Romanization Tables, with annotations and typographic corrections by Doug Cooper, January 2006.]

 

Consonants (see note 1)

 

Vernacular

Romanization When Initial

Romanization When Final

k

k

kh

(k)

kh

(k)

ng

ng

ch

(t)

s

(t)

s

(t)

ny

-

d

t

t

(t)

th

(t)

th

(t)

n

n

b

p

p

(p)

ph

(p)

f

(p)

ph

(p)

f

(p)

m

m

y

-

r

(n)

l (see note 3)

(n)

v (see note 3)

-

h (see note 3)

-

ʻ (ayn)

-

h

-

 

 

Obsolete Consonants

 

Vernacular

Romanization

 

s

 

t

 

t

 

tsa

 

l

 

 

Vowels

 

Vernacular

Romanization

-,‌   -

a

-

ā

-

i

-

ī

-

ư

-

ư̄

-

u

 - ຸ ຸ ,    -

ū

ເ-ະ ເ-  ັ

e

ເ-

ē

ແ-ະ    ແ-  ັ

æ

ແ-

ǣ

ໂ-ະ ,    -

o

ໂ-

ō

ເ-າະ ,    - ັອ

 -,   -

ō̜

ເ- ິະ, ເ-  ິ

œ

ເ- ີ

œ̄

ເ- ັຽະ     ັ-ຽ

ia

‌‌- ັຽ‌‌   -ຽ

īa

ເ-  ຶອະ, ເ- ຶອ

ưa

ເ- ືອ

ư̄a

- ົວະ     ັ-ວ

ua

- ົວ ,     -

ūa

‌‌- ,   ໃ- ,  - ັຍ

ai

-າຍ ,  -າຽ

āi

‌‌- ົາ

ao

-

am

 

 

Special Symbols

 

Vernacular

Romanization

(syllable repeated)

ຯລຯ

la

 

Notes

1.  Some consonants do not occur as finals, except in loan-words.  The romanization for these consonants is enclosed in parentheses in the table.

2.  In final position, follows as vowel, closing the dipthong, and is romanized as i.  See the vowel table (ai and āi).  In non-initial position, is often written in a secondary form .  See the vowel table (ia and īa).

3.  The combining form of is  -.  It is not romanized except when subscript to , thus:  ຄຼິສຕຽນ (khitsatīan), ຫຼາຍ (lāi).

4.  In initial position, is romanized as v, thus:  ວັດ (vat).

When medial is followed by a vowel, it forms a cluster with the initial consonant, and is romanized as w, thus:  ຂວາ (khwā), ແຂວງ (khwǣng).

When medial is followed by a final consonant, it is romanized ūa, thus:  ບວກ (būa), ດ້ວຍ (dūai)

When is in final position following a vowel, it is romanized as o, thus:  ນາວ (nāo), ແກ້ວ (khǣo).  See also the vowel table (ua and ūa).  The forms  - ິວ  and  - ີວ are romanized as (iu and īu), and the forms ຽວ and ເ- ັຽວ are romanized as īeo.

5.  is not romanized before (ny), (n), (m), (r), (l), and (v), thus: ຫຍ (nyā).  ຫນ and ຫມ are often written as ligatures, thus:  (n) and (m).

6.  is used in initial position for syllables beginning with a vowel, thus ອີກ (ʻīk).  When not in initial position, is a vowel.  See the vowel table (, ō̜, ưa, ư̄a).

7.  In the vowel columns, - shows the position of the consonant relative to the vowel.  In all cases the romanization of the vowel is intended to follow the romanization of the consonant even though parts of the vernacular representation of the vowel may be written before the consonant.

8.  The numerals are 0 (0), (1), (2), (3), (4), (5), (6), (7), (8), and (9).

9.  In Lao, words are not written separately.  In romanization, however, the shortest written form which can stand alone as a word is treated as such.  This applies also to Pali and Sanskrit loan-words.  Other loan-words have the same division as the original language.  Tonal marks ( - -, -, -) are not romanized.

10.  When the pronunciation requires one consonant to serve a double function, at the end of one syllable and at the beginning of the next syllable, it is romanized twice according to the respective values, thus:  ຄຼີສຕຽນ (khitsatīan).

Special Characters and Character Modifiers used in Romanization
(See the MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media / CHARACTER SETS: Part 3 / Code Tables / January 2000, updated September 2004)

http://www.loc.gov/marc/specifications/specchartables.html
http://lcweb2.loc.gov/cocoon/codetables/45.html  (Extended Latin)

http://www.atla.com/tsig/LatinCharacters/Latin%20Characters%20in%20Unicode%20and%20MARC-8.pdf  (full set of examples)

 

Character                                             Æ

LC-ALA Name                                    digraph æ (upper case)

USMARC hex code                             A5

Unicode Name                                     LATIN CAPITAL LETTER æ

Unicode hex code / decimal                00C6 / 198

 

Character                                             Œ

LC-ALA Name                                    digraph œ (upper case)

USMARC hex code                             A6

Unicode Name                                     LATIN CAPITAL LIGATURE œ

Unicode hex code / decimal                0152 / 338

 

Character                                            

LC-ALA Name                                    digraph ae (lower case)

USMARC hex code                             B5

Unicode Name                                     LATIN SMALL LETTER æ

Unicode hex code / decimal                00E6 / 230

 

Character                                             œ

LC-ALA Name                                    digraph oe (lower case)

USMARC hex code                             B6

Unicode Name                                     LATIN SMALL LIGATURE œ

Unicode hex code / decimal                0153 / 339

 

Character                                            

LC-ALA Name                                    digraph æ (upper case)

USMARC hex code                             A5

Unicode Name                                     CAPITAL LETTER æ

Unicode hex code / decimal                00C6 / 198

 

Character                                             Ư

LC-ALA Name                                    U-hook (upper case)

USMARC hex code                             AC

Unicode Name                                     LATIN CAPITAL LETTER U WITH HORN

Unicode hex code / decimal                01AF / 431

 

Character                                             ư

LC-ALA Name                                    u-hook (lower case)

USMARC hex code                             BD

Unicode Name                                     LATIN SMALL LETTER U WITH HORN

Unicode hex code / decimal                01B0 / 432

 

Character                                             ʻ

LC-ALA Name                                    ayn

USMARC hex code                             B0

Unicode Name                                     MODIFIER LETTER TURNED COMMA

Unicode hex code / decimal                02BB / 699

 

Character                                                        ̄

LC-ALA Name                                    macron

USMARC hex code                             E5

Unicode Name                                     COMBINING MACRON

Unicode hex code / decimal                0304 / 772

Ā LATIN CAPITAL LETTER A WITH MACRON 0100 / 256

ā LATIN SMALL LETTER A WITH MACRON 0101 / 257

Ō LATIN CAPITAL LETTER O WITH MACRON 014C1 / 332

ō LATIN SMALL LETTER O WITH MACRON 014D / 333

Ī LATIN CAPITAL LETTER I WITH MACRON 012A / 298

ī  LATIN SMALL LETTER I WITH MACRON 012B / 299

Ē LATIN CAPITAL LETTER E WITH MACRON 0112 / 274

ē LATIN SMALL LETTER E WITH MACRON 0113 / 275

Ū LATIN CAPITAL LETTER U WITH MACRON 016A / 362

ū LATIN SMALL LETTER U WITH MACRON 016B / 363

Ǣ LATIN CAPITAL LETTER æ WITH MACRON 01E2 / 482

ǣ LATIN SMALL LETTER æ WITH MACRON 01E3 / 483

 

Character                                             ̜

LC-ALA Name                                    right cedilla

USMARC hex code                             F8

Unicode Name                                     COMBINING LEFT HALF RING BELOW

Unicode hex code / decimal                031C / 796