php-src/ext/mbstring
Alex Dowad eda73a5f6f Don't mangle non-Japanese chars which appear after a 'combining' kana in SJIS-2004
Unicode has 'combining' characters which join with another following character.
Japanese hiragana and katakana with the 'two dots' voice mark can be represented
in this way, with one Unicode character for the 'base' kana and another one which
adds the voice mark.

In SJIS-2004, however, there are dedicated characters for voiced and unvoiced
kana. So some special checks are done to identify sequences of Unicode characters
which need to be 'collapsed' into a single SJIS-2004 character.

If a kana is immediately followed by some other unrelated character, like a
Cyrillic letter, then the cached kana should be output 'as is' and we
proceed with encoding the unrelated character. When doing this, though,
we need to re-initialize local variables, or else the unrelated character
will be mangled in some cases.
2020-11-11 11:18:58 +02:00
..
libmbfl Don't mangle non-Japanese chars which appear after a 'combining' kana in SJIS-2004 2020-11-11 11:18:58 +02:00
tests Test EUC-JP and Shift-JIS more thoroughly 2020-11-11 11:18:58 +02:00
ucgendat [ci skip] Move OpenLDAP license to redistributable info file 2019-05-06 23:02:46 +02:00
config.m4 Remove mbstring identify filters 2020-11-09 13:45:17 +02:00
config.w32 Remove mbstring identify filters 2020-11-09 13:45:17 +02:00
CREDITS
mb_gpc.c Don't guard mbstring code with #ifdef HAVE_MBSTRING 2020-08-31 23:18:13 +02:00
mb_gpc.h Don't guard mbstring code with #ifdef HAVE_MBSTRING 2020-08-31 23:18:13 +02:00
mbstring.c Remove mbstring identify filters 2020-11-09 13:45:17 +02:00
mbstring.h Don't guard mbstring code with #ifdef HAVE_MBSTRING 2020-08-31 23:18:13 +02:00
mbstring.stub.php Normalize mb_ereg() return value 2020-10-13 20:40:55 +02:00
mbstring_arginfo.h Normalize mb_ereg() return value 2020-10-13 20:40:55 +02:00
php_mbregex.c Normalize mb_ereg() return value 2020-10-13 20:40:55 +02:00
php_mbregex.h Fix [-Wundef] warning in MBString extension 2020-05-16 15:31:20 +02:00
php_onig_compat.h
php_unicode.c Don't guard mbstring code with #ifdef HAVE_MBSTRING 2020-08-31 23:18:13 +02:00
php_unicode.h Don't guard mbstring code with #ifdef HAVE_MBSTRING 2020-08-31 23:18:13 +02:00
unicode_data.h Update Unicode tables to 13.0.0 2020-03-12 11:29:51 +01:00