Commit Graph

24 Commits

Author SHA1 Message Date
Stig Venaas
e08bc054c7 Adding support for compatibility decomposition to ucdata lib, and switch
from NFC to NFKC in UTF8bvnormalize() and UTF8bvnormcmp()
2002-04-19 12:59:57 +00:00
Kurt Zeilenga
319440033f Patch: ucdata 2.4 bugs (ITS#1751)
================
Written by Hallvard B. Furuseth and placed into the public domain.
This software is not subject to any license of the University of Oslo.
			================

ucgendat.c accessed unallocated memory when i == ncodes_size.

The changes others are trivial, I just include them since I'm patching
ucdata anyway:

ucdata.c   had some pointless '0 <= unsigned' comparisons.

ucstr.c    assigned a long* to an unsigned long*.  Since malloc()
           returns void*, the result need not be cast at all.

I'll send the ucgendat.c and ucdata.c patches to Mark Leisher
<mleisher@crl.nmsu.edu>.

Hallvard B. Furuseth <h.b.furuseth@usit.uio.no>, April 2002.
2002-04-15 20:39:22 +00:00
Pierangelo Masarati
6b8e16efec fix ambiguous use of && and || (please check :) 2002-04-06 09:30:24 +00:00
Stig Venaas
891f987928 Changed to use lower case for case folding 2002-04-04 12:50:46 +00:00
Stig Venaas
c476878fde Using new UTF8bv* all over, getting rid of UTF8normalize() and
UTF8normcmp().
2002-02-27 16:11:24 +00:00
Stig Venaas
94983da942 Added code for approximate matching in UTF8bvnormalize() and changed to use
this in approxMatch etc in schema_init.c
2002-02-26 18:38:40 +00:00
Stig Venaas
d6c3ccadc4 Optimizing UTF8bvnormcmp() by adding flags that tells which arguments are
in NFC. By default no arguments are assumed to be NFC.
2002-02-25 13:20:09 +00:00
Kurt Zeilenga
6d9307b534 Silence a warning 2002-02-23 23:26:48 +00:00
Stig Venaas
9e8f3053cb Added UTF8bvnormcmp(), should replace UTF8normcmp() 2002-02-20 15:51:10 +00:00
Stig Venaas
217103b138 Added UTF8bvnormalize which is like UTF8normalize but returns berval. It
can be passed a pointer to the berval to be used for the return value. If
this is NULL, it will allocate a new one.
2002-02-14 14:03:27 +00:00
Howard Chu
e21e900392 Use LDAP_UTF8_CHARLEN2 to check shortest possible encoding 2002-01-15 08:27:19 +00:00
Kurt Zeilenga
0e2af54a3f Update copyright statements 2002-01-04 21:17:25 +00:00
Kurt Zeilenga
c8d7b334e6 Add copyright notices 2002-01-03 16:36:46 +00:00
Howard Chu
8c04993cdf Use <lber.h> ber_memalloc, not malloc. Shortcuts for pure-ASCII strings.
Still need to add some kind of isNormalized() check to avoid rework on
UTF8 strings that are already properly normalized.
2002-01-03 16:22:53 +00:00
Kurt Zeilenga
763c0de59b Rework filter code
Misc cleanup / lint removal
2002-01-02 17:06:56 +00:00
Kurt Zeilenga
c1a257a83f stdlib.h should be included as <ac/stdlib.h> 2001-12-18 01:24:31 +00:00
Mark Adamson
156c9b3177 typo in declaration of function. 2001-11-09 19:22:03 +00:00
Mark Adamson
f05420c8d8 use berval's instead of strings with UTF8normalize() 2001-11-09 19:01:23 +00:00
Kurt Zeilenga
978e417699 Make some additional UTF8 public
Remove lint
2001-07-21 23:13:04 +00:00
Kurt Zeilenga
5ac196e3f9 Update headers 2001-06-15 00:02:12 +00:00
Stig Venaas
7a3c92f2a7 Adding UTF8normcmp() for normalizing and comparing two UTF8 strings 2001-01-31 15:45:30 +00:00
Stig Venaas
899357d702 Including ac/ctype.h to get TOUPPER macro 2001-01-26 14:35:20 +00:00
Stig Venaas
8c61bd279e Added UTF8normalize() that does Unicode canonical normalization and
optionally case folding
2001-01-19 15:52:58 +00:00
Kurt Zeilenga
a4fac87687 Use ucdata (very experimental) 2000-09-07 23:19:10 +00:00