cutext/wccat.h: Unicode General Categories
[cutext: Unicode and Text Handling]

Enumerations

enum  cutext_wccat_t {
  CUTEXT_WCCAT_NONE, CUTEXT_WCCAT_LU = 8, CUTEXT_WCCAT_LL, CUTEXT_WCCAT_LT,
  CUTEXT_WCCAT_LM, CUTEXT_WCCAT_LO, CUTEXT_WCCAT_MN = 16, CUTEXT_WCCAT_MC,
  CUTEXT_WCCAT_ME, CUTEXT_WCCAT_ND = 24, CUTEXT_WCCAT_NL, CUTEXT_WCCAT_NO,
  CUTEXT_WCCAT_PC = 32, CUTEXT_WCCAT_PD, CUTEXT_WCCAT_PS, CUTEXT_WCCAT_PE,
  CUTEXT_WCCAT_PI, CUTEXT_WCCAT_PF, CUTEXT_WCCAT_PO, CUTEXT_WCCAT_SM = 40,
  CUTEXT_WCCAT_SC, CUTEXT_WCCAT_SK, CUTEXT_WCCAT_SO, CUTEXT_WCCAT_ZS = 48,
  CUTEXT_WCCAT_ZL, CUTEXT_WCCAT_ZP, CUTEXT_WCCAT_CC = 56, CUTEXT_WCCAT_CF,
  CUTEXT_WCCAT_CS, CUTEXT_WCCAT_CO, CUTEXT_WCCAT_CN
}

Functions

cutext_wccat_t cutext_wccat_by_name (char *name)
cutext_wccat_t cutext_wchar_wccat (cu_wint_t ch)
cu_bool_t cutext_wccat_is_letter (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_mark (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_number (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_punctuation (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_symbol (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_separator (cutext_wccat_t ct)
cu_bool_t cutext_wccat_is_other (cutext_wccat_t ct)
cu_bool_t cutext_wchar_is_letter (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_mark (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_number (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_punctuation (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_symbol (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_separator (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_other (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_vertical_space (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_space (cu_wchar_t ch)
cu_bool_t cutext_wchar_is_print (cu_wchar_t ch)

Enumeration Type Documentation

Enumeration of the Unicode 5.1.0 character categories.

Enumerator:
CUTEXT_WCCAT_NONE 

not a category

CUTEXT_WCCAT_LU 

letter, uppercase

CUTEXT_WCCAT_LL 

letter, lowercase

CUTEXT_WCCAT_LT 

letter, titlecase

CUTEXT_WCCAT_LM 

letter, modifier

CUTEXT_WCCAT_LO 

letter, other

CUTEXT_WCCAT_MN 

mark, nonspacing

CUTEXT_WCCAT_MC 

mark, spacing combining

CUTEXT_WCCAT_ME 

mark, enclosing

CUTEXT_WCCAT_ND 

number, decimal digit

CUTEXT_WCCAT_NL 

number, letter

CUTEXT_WCCAT_NO 

number, other

CUTEXT_WCCAT_PC 

punctuation, connector

CUTEXT_WCCAT_PD 

punctuation, dash

CUTEXT_WCCAT_PS 

punctuation, open

CUTEXT_WCCAT_PE 

punctuation, close

CUTEXT_WCCAT_PI 

punctuation, initial quote

CUTEXT_WCCAT_PF 

punctuation, final quote

CUTEXT_WCCAT_PO 

punctuation, other

CUTEXT_WCCAT_SM 

symbol, math

CUTEXT_WCCAT_SC 

symbol, currency

CUTEXT_WCCAT_SK 

symbol, modifier

CUTEXT_WCCAT_SO 

symbol, other

CUTEXT_WCCAT_ZS 

separator, space

CUTEXT_WCCAT_ZL 

separator, line

CUTEXT_WCCAT_ZP 

separator, paragraph

CUTEXT_WCCAT_CC 

other, control

CUTEXT_WCCAT_CF 

other, format

CUTEXT_WCCAT_CS 

other, surrogate

CUTEXT_WCCAT_CO 

other, private use

CUTEXT_WCCAT_CN 

other, not assigned


Function Documentation

cutext_wccat_t cutext_wccat_by_name ( char *  name  ) 

Returns the general category denoted by its two-letter name name, or CUTEXT_WCCAT_NONE if name is invalid.

cu_bool_t cutext_wccat_is_letter ( cutext_wccat_t  ct  ) 

True iff ct is in the "letter" ("L*") main category.

cu_bool_t cutext_wccat_is_mark ( cutext_wccat_t  ct  ) 

True iff ct is in the "mark" ("M*") main category.

cu_bool_t cutext_wccat_is_number ( cutext_wccat_t  ct  ) 

True iff ct is in the "number" ("N*") main category.

cu_bool_t cutext_wccat_is_other ( cutext_wccat_t  ct  ) 

True iff ct is in the "other" ("C*") main category.

cu_bool_t cutext_wccat_is_punctuation ( cutext_wccat_t  ct  ) 

True iff ct is in the "punctuation" ("P*") main category.

cu_bool_t cutext_wccat_is_separator ( cutext_wccat_t  ct  ) 

True iff ct is in the "separator" ("Z*") main category.

cu_bool_t cutext_wccat_is_symbol ( cutext_wccat_t  ct  ) 

True iff ct is in the "symbol" ("S*") main category.

cutext_wccat_t cutext_wchar_wccat ( cu_wint_t  ch  ) 

Returns the general category of ch.

Generated 2009-11-23 for culibs-0.25 using Doxygen. Maintained by Petter Urkedal.