Stig Venaas
e08bc054c7
Adding support for compatibility decomposition to ucdata lib, and switch
...
from NFC to NFKC in UTF8bvnormalize() and UTF8bvnormcmp()
2002-04-19 12:59:57 +00:00
Kurt Zeilenga
319440033f
Patch: ucdata 2.4 bugs (ITS#1751)
...
================
Written by Hallvard B. Furuseth and placed into the public domain.
This software is not subject to any license of the University of Oslo.
================
ucgendat.c accessed unallocated memory when i == ncodes_size.
The changes others are trivial, I just include them since I'm patching
ucdata anyway:
ucdata.c had some pointless '0 <= unsigned' comparisons.
ucstr.c assigned a long* to an unsigned long*. Since malloc()
returns void*, the result need not be cast at all.
I'll send the ucgendat.c and ucdata.c patches to Mark Leisher
<mleisher@crl.nmsu.edu>.
Hallvard B. Furuseth <h.b.furuseth@usit.uio.no>, April 2002.
2002-04-15 20:39:22 +00:00
Pierangelo Masarati
6b8e16efec
fix ambiguous use of && and || (please check :)
2002-04-06 09:30:24 +00:00
Stig Venaas
891f987928
Changed to use lower case for case folding
2002-04-04 12:50:46 +00:00
Stig Venaas
c476878fde
Using new UTF8bv* all over, getting rid of UTF8normalize() and
...
UTF8normcmp().
2002-02-27 16:11:24 +00:00
Stig Venaas
94983da942
Added code for approximate matching in UTF8bvnormalize() and changed to use
...
this in approxMatch etc in schema_init.c
2002-02-26 18:38:40 +00:00
Stig Venaas
d6c3ccadc4
Optimizing UTF8bvnormcmp() by adding flags that tells which arguments are
...
in NFC. By default no arguments are assumed to be NFC.
2002-02-25 13:20:09 +00:00
Kurt Zeilenga
6d9307b534
Silence a warning
2002-02-23 23:26:48 +00:00
Stig Venaas
9e8f3053cb
Added UTF8bvnormcmp(), should replace UTF8normcmp()
2002-02-20 15:51:10 +00:00
Stig Venaas
217103b138
Added UTF8bvnormalize which is like UTF8normalize but returns berval. It
...
can be passed a pointer to the berval to be used for the return value. If
this is NULL, it will allocate a new one.
2002-02-14 14:03:27 +00:00
Howard Chu
e21e900392
Use LDAP_UTF8_CHARLEN2 to check shortest possible encoding
2002-01-15 08:27:19 +00:00
Kurt Zeilenga
0e2af54a3f
Update copyright statements
2002-01-04 21:17:25 +00:00
Kurt Zeilenga
c8d7b334e6
Add copyright notices
2002-01-03 16:36:46 +00:00
Howard Chu
8c04993cdf
Use <lber.h> ber_memalloc, not malloc. Shortcuts for pure-ASCII strings.
...
Still need to add some kind of isNormalized() check to avoid rework on
UTF8 strings that are already properly normalized.
2002-01-03 16:22:53 +00:00
Kurt Zeilenga
763c0de59b
Rework filter code
...
Misc cleanup / lint removal
2002-01-02 17:06:56 +00:00
Kurt Zeilenga
c1a257a83f
stdlib.h should be included as <ac/stdlib.h>
2001-12-18 01:24:31 +00:00
Mark Adamson
156c9b3177
typo in declaration of function.
2001-11-09 19:22:03 +00:00
Mark Adamson
f05420c8d8
use berval's instead of strings with UTF8normalize()
2001-11-09 19:01:23 +00:00
Kurt Zeilenga
978e417699
Make some additional UTF8 public
...
Remove lint
2001-07-21 23:13:04 +00:00
Kurt Zeilenga
5ac196e3f9
Update headers
2001-06-15 00:02:12 +00:00
Stig Venaas
7a3c92f2a7
Adding UTF8normcmp() for normalizing and comparing two UTF8 strings
2001-01-31 15:45:30 +00:00
Stig Venaas
899357d702
Including ac/ctype.h to get TOUPPER macro
2001-01-26 14:35:20 +00:00
Stig Venaas
8c61bd279e
Added UTF8normalize() that does Unicode canonical normalization and
...
optionally case folding
2001-01-19 15:52:58 +00:00
Kurt Zeilenga
a4fac87687
Use ucdata (very experimental)
2000-09-07 23:19:10 +00:00