factor/basis/unicode/normalize/normalize-docs.factor

USING: help.syntax help.markup strings ;
IN: unicode.normalize

ABOUT: "unicode.normalize"

ARTICLE: "unicode.normalize" "Unicode normalization"
"The " { $vocab-link "unicode.normalize" "unicode.normalize" } " vocabulary defines words for normalizing Unicode strings."
$nl
"In Unicode, it is often possible to have multiple sequences of characters which really represent exactly the same thing. For example, to represent e with an acute accent above, there are two possible strings: " { $snippet "\"e\\u000301\"" } " (the e character, followed by the combining acute accent character) and " { $snippet "\"\\u0000e9\"" } " (a single character, e with an acute accent)."
$nl
"There are four normalization forms: NFD, NFC, NFKD, and NFKC. Basically, in NFD and NFKD, everything is expanded, whereas in NFC and NFKC, everything is contracted. In NFKD and NFKC, more things are expanded and contracted. This is a process which loses some information, so it should be done only with care."
$nl
"Most of the world uses NFC to communicate, but for many purposes, NFD/NFKD is easier to process. For more information, see Unicode Standard Annex #15 and section 3 of the Unicode standard."
{ $subsections
    nfc
    nfd
    nfkc
    nfkd
} ;

HELP: nfc
{ $values { "string" string } { "nfc" "a string in NFC" } }
{ $description "Converts a string to Normalization Form C." } ;

HELP: nfd
{ $values { "string" string } { "nfd" "a string in NFD" } }
{ $description "Converts a string to Normalization Form D." } ;

HELP: nfkc
{ $values { "string" string } { "nfkc" "a string in NFKC" } }
{ $description "Converts a string to Normalization Form KC." } ;

HELP: nfkd
{ $values { "string" string } { "nfkd" "a string in NFKD" } }
{ $description "Converts a string to Normalization Form KD." } ;
Unicode docs 2009-01-07 18:59:01 -05:00			`USING: help.syntax help.markup strings ;`
			`IN: unicode.normalize`

			`ABOUT: "unicode.normalize"`

			`ARTICLE: "unicode.normalize" "Unicode normalization"`
Update Unicode docs 2009-01-26 00:03:49 -05:00			`"The " { $vocab-link "unicode.normalize" "unicode.normalize" } " vocabulary defines words for normalizing Unicode strings."`
			`$nl`
			`"In Unicode, it is often possible to have multiple sequences of characters which really represent exactly the same thing. For example, to represent e with an acute accent above, there are two possible strings: " { $snippet "\"e\\u000301\"" } " (the e character, followed by the combining acute accent character) and " { $snippet "\"\\u0000e9\"" } " (a single character, e with an acute accent)."`
			`$nl`
			`"There are four normalization forms: NFD, NFC, NFKD, and NFKC. Basically, in NFD and NFKD, everything is expanded, whereas in NFC and NFKC, everything is contracted. In NFKD and NFKC, more things are expanded and contracted. This is a process which loses some information, so it should be done only with care."`
			`$nl`
			`"Most of the world uses NFC to communicate, but for many purposes, NFD/NFKD is easier to process. For more information, see Unicode Standard Annex #15 and section 3 of the Unicode standard."`
docs: change $subsection to $subsections 2009-10-01 15:56:36 -04:00			`{ $subsections`
			`nfc`
			`nfd`
			`nfkc`
			`nfkd`
			`} ;`
Unicode docs 2009-01-07 18:59:01 -05:00
			`HELP: nfc`
			`{ $values { "string" string } { "nfc" "a string in NFC" } }`
Update Unicode docs 2009-01-26 00:03:49 -05:00			`{ $description "Converts a string to Normalization Form C." } ;`
Unicode docs 2009-01-07 18:59:01 -05:00
			`HELP: nfd`
			`{ $values { "string" string } { "nfd" "a string in NFD" } }`
Update Unicode docs 2009-01-26 00:03:49 -05:00			`{ $description "Converts a string to Normalization Form D." } ;`
Unicode docs 2009-01-07 18:59:01 -05:00
			`HELP: nfkc`
			`{ $values { "string" string } { "nfkc" "a string in NFKC" } }`
Update Unicode docs 2009-01-26 00:03:49 -05:00			`{ $description "Converts a string to Normalization Form KC." } ;`
Unicode docs 2009-01-07 18:59:01 -05:00
			`HELP: nfkd`
Fixing doc errors 2009-01-08 16:38:03 -05:00			`{ $values { "string" string } { "nfkd" "a string in NFKD" } }`
Update Unicode docs 2009-01-26 00:03:49 -05:00			`{ $description "Converts a string to Normalization Form KD." } ;`