org.thdl.tib.text.tshegbar
Interface UnicodeConstants

All Known Implementing Classes:
LegalTshegBar, test, UnicodeGraphemeCluster, UnicodeUtils

public interface UnicodeConstants

Provides handy Extended Wylie-inspired names for Unicode codepoints commonly used to represent Tibetan. The consonant that the Extended Wylie text "ka" refers to is named EWC_ka as in "The Extended Wylie Consonant ka", the vowel represented in Extended Wylie by "i" is EWV_i, and so on. There is at least one exception to the naming scheme, but exceptions are well-commented.

Author:
David Chandler
See Also:
LegalTshegBar

Field Summary
static char EW_ABSENT
          for those times when you need a char to represent a non-existent codepoint
static char EW_achung
          In the word for father, "pA lags", there is an a-chung (i.e., ).
static char EWC_a
           
static char EWC_achen
          Note the irregular name.
static char EWC_ba
           
static char EWC_ca
           
static char EWC_cha
           
static char EWC_da
           
static char EWC_dza
           
static char EWC_ga
           
static char EWC_ha
           
static char EWC_ja
           
static char EWC_ka
          first letter of the alphabet:
static char EWC_kha
           
static char EWC_la
           
static char EWC_ma
           
static char EWC_na
           
static char EWC_nga
           
static char EWC_nya
           
static char EWC_pa
           
static char EWC_pha
           
static char EWC_ra
           
static char EWC_sa
           
static char EWC_sha
           
static char EWC_ta
           
static char EWC_tha
           
static char EWC_tsa
           
static char EWC_tsha
           
static char EWC_wa
           
static char EWC_ya
           
static char EWC_za
           
static char EWC_zha
           
static char EWSUB_la_btags
          subscribed form of EWC_la
static char EWSUB_ra_btags
          subscribed form of EWC_ra
static char EWSUB_wa_zur
          subscribed form of EWC_wa, also known as wa-btags
static char EWSUB_ya_btags
          subscribed form of EWC_ya
static char EWV_e
          "'greng bu" (also known as "'greng po", and pronounced dang-bo), the 'a' sound in the English word gate:
static char EWV_i
          "gi gu", the 'i' sound in the English word keep:
static char EWV_o
          "na ro", the 'o' sound in the English word bone:
static char EWV_u
          "zhabs kyu", the 'u' sound in the English word tune:
static byte NORM_NFC
          Refers to Normalization Form C:
static byte NORM_NFD
          Refers to Normalization Form D:
static byte NORM_NFKC
          Refers to Normalization Form KC:
static byte NORM_NFKD
          Refers to Normalization Form KD:
static byte NORM_NFTHDL
          Refers to Normalization Form THDL, which is NFD except for U+0F77 and U+0F79, which are normalized according to NFKD.
static byte NORM_UNNORMALIZED
          Refers to unnormalized Unicode:
 

Field Detail

NORM_UNNORMALIZED

public static final byte NORM_UNNORMALIZED
Refers to unnormalized Unicode:

See Also:
Constant Field Values

NORM_NFC

public static final byte NORM_NFC
Refers to Normalization Form C:

See Also:
Constant Field Values

NORM_NFKC

public static final byte NORM_NFKC
Refers to Normalization Form KC:

See Also:
Constant Field Values

NORM_NFD

public static final byte NORM_NFD
Refers to Normalization Form D:

See Also:
Constant Field Values

NORM_NFKD

public static final byte NORM_NFKD
Refers to Normalization Form KD:

See Also:
Constant Field Values

NORM_NFTHDL

public static final byte NORM_NFTHDL
Refers to Normalization Form THDL, which is NFD except for U+0F77 and U+0F79, which are normalized according to NFKD. This is the One True Normalization Form, as it leaves no precomposed codepoints and does not normalize U+0F0C.

See Also:
Constant Field Values

EW_ABSENT

public static final char EW_ABSENT
for those times when you need a char to represent a non-existent codepoint

See Also:
Constant Field Values

EWC_ka

public static final char EWC_ka
first letter of the alphabet:

See Also:
Constant Field Values

EWC_kha

public static final char EWC_kha
See Also:
Constant Field Values

EWC_ga

public static final char EWC_ga
See Also:
Constant Field Values

EWC_nga

public static final char EWC_nga
See Also:
Constant Field Values

EWC_ca

public static final char EWC_ca
See Also:
Constant Field Values

EWC_cha

public static final char EWC_cha
See Also:
Constant Field Values

EWC_ja

public static final char EWC_ja
See Also:
Constant Field Values

EWC_nya

public static final char EWC_nya
See Also:
Constant Field Values

EWC_ta

public static final char EWC_ta
See Also:
Constant Field Values

EWC_tha

public static final char EWC_tha
See Also:
Constant Field Values

EWC_da

public static final char EWC_da
See Also:
Constant Field Values

EWC_na

public static final char EWC_na
See Also:
Constant Field Values

EWC_pa

public static final char EWC_pa
See Also:
Constant Field Values

EWC_pha

public static final char EWC_pha
See Also:
Constant Field Values

EWC_ba

public static final char EWC_ba
See Also:
Constant Field Values

EWC_ma

public static final char EWC_ma
See Also:
Constant Field Values

EWC_tsa

public static final char EWC_tsa
See Also:
Constant Field Values

EWC_tsha

public static final char EWC_tsha
See Also:
Constant Field Values

EWC_dza

public static final char EWC_dza
See Also:
Constant Field Values

EWC_wa

public static final char EWC_wa
See Also:
Constant Field Values

EWC_zha

public static final char EWC_zha
See Also:
Constant Field Values

EWC_za

public static final char EWC_za
See Also:
Constant Field Values

EWC_achen

public static final char EWC_achen
Note the irregular name. The Extended Wylie representation is 'a.

See Also:
Constant Field Values

EWC_ya

public static final char EWC_ya
See Also:
Constant Field Values

EWC_ra

public static final char EWC_ra
See Also:
Constant Field Values

EWC_la

public static final char EWC_la
See Also:
Constant Field Values

EWC_sha

public static final char EWC_sha
See Also:
Constant Field Values

EWC_sa

public static final char EWC_sa
See Also:
Constant Field Values

EWC_ha

public static final char EWC_ha
See Also:
Constant Field Values

EWC_a

public static final char EWC_a
See Also:
Constant Field Values

EW_achung

public static final char EW_achung
In the word for father, "pA lags", there is an a-chung (i.e., ). This is the constant for that little guy.

See Also:
Constant Field Values

EWV_i

public static final char EWV_i
"gi gu", the 'i' sound in the English word keep:

See Also:
Constant Field Values

EWV_u

public static final char EWV_u
"zhabs kyu", the 'u' sound in the English word tune:

See Also:
Constant Field Values

EWV_e

public static final char EWV_e
"'greng bu" (also known as "'greng po", and pronounced dang-bo), the 'a' sound in the English word gate:

See Also:
Constant Field Values

EWV_o

public static final char EWV_o
"na ro", the 'o' sound in the English word bone:

See Also:
Constant Field Values

EWSUB_wa_zur

public static final char EWSUB_wa_zur
subscribed form of EWC_wa, also known as wa-btags

See Also:
Constant Field Values

EWSUB_ya_btags

public static final char EWSUB_ya_btags
subscribed form of EWC_ya

See Also:
Constant Field Values

EWSUB_ra_btags

public static final char EWSUB_ra_btags
subscribed form of EWC_ra

See Also:
Constant Field Values

EWSUB_la_btags

public static final char EWSUB_la_btags
subscribed form of EWC_la

See Also:
Constant Field Values


These API docs were created 02/02/2003 08:19 PM.
Copyright © 2001-2002 Tibetan and Himalayan Digital Library. All Rights Reserved.
Hosted by SourceForge_Logo