Unicode Utilities Change Log



Version 2.27
This release updates the character data to Unicode 5.1 and fixes a bug in the -V option of uniname as well as couple of other minor bugs.
Version 2.26
This release adds unifuzz, a utility that generates test input for programs expecting Unicode. Unifuzz can generate a random string of characters, tokens of various potentially problematic characters and sequences, very long lines, strings with embedded nulls, and ill-formed UTF-8. unirev is renamed unireverse.
Version 2.25
Adds to unidesc the option -r which causes it to list the ranges detected after reading all input rather than listing them as they are encountered, and adds to uniname the option -B which causes it to ignore characters within the Basic Multilingual Plane.
Version 2.24
Adds the utility Unirev, a filter which reverses UTF-8 strings.
Version 2.23
uniname and unidesc now provide information about the unofficial ranges within the Private Use Area registered with the ConScript Unicode Registry.
Version 2.22
Adds to uniname the command line option -A, proposed and implemented by H. Merijn Brand, which causes ASCII whitespace to be skipped.
Version 2.21
unidesc and unihist now print detailed error information on encountering ill-formed UTF-8.
Version 2.20
Added to uniname the -a option, which causes it to ignore ASCII characters. Also fixes bugs introduced into unihist and unidesc caused by bad version of UTF-8 reader.
Version 2.19
The data has been updated to Unicode version 5.0.
Version 2.18
A bug was fixed that incorrectly treated a read interrupted in the middle of a UTF-8 sequence as a truncated sequence.
Version 2.17
The data has been updated to Unicode version 4.1.
Version 2.16
Added -l option to print line number. Also added the shell script utf8lookup, which invokes uniname to provide an easy way to look up the character name corresponding to a codepoint from the command line. In addition to uniname it requires the utility Ascii2binary.
Version 2.15.1
Added ifdefs to unihist.c to allow compilation on systems lacking locale.h or libintl.h.
Version 2.15
Options were added to unidesc that list the Unicode ranges in numerical order and alphabetical order respectively.
Version 2.14
The code has been thoroughly cleaned up, so that gcc -Wall -pedantic produces no warnings, and GNU autoconfiguration set up.
Version 2.13
uniname can now be asked merely to validate its input as UTF-8. If the -V flag is given, uniname reads its input and does nothing unless it encounters invalid UTF-8. If so, it reports its location, explains why it is invalid, and exits.
Version 2.12
A couple of minor errors in the endpoints of Unicode ranges were corrected. (The errors were in unirange.c and affect uniname and unidesc.)
Version 2.11
Added unihist to package.
Version 2.10
uniname now explains why ill-formed UTF-8 is ill-formed when it encounters it and (optionally) prints column headers at the top of every screenfull.