Unicode Utilities Change Log

Version 2.27: This release updates the character data to Unicode 5.1 and fixes a bug in the -V option of uniname as well as couple of other minor bugs.
Version 2.26: This release adds unifuzz, a utility that generates test input for programs expecting Unicode. Unifuzz can generate a random string of characters, tokens of various potentially problematic characters and sequences, very long lines, strings with embedded nulls, and ill-formed UTF-8. unirev is renamed unireverse.
Version 2.25: Adds to unidesc the option -r which causes it to list the ranges detected after reading all input rather than listing them as they are encountered, and adds to uniname the option -B which causes it to ignore characters within the Basic Multilingual Plane.
Version 2.24: Adds the utility Unirev, a filter which reverses UTF-8 strings.
Version 2.23: uniname and unidesc now provide information about the unofficial ranges within the Private Use Area registered with the ConScript Unicode Registry.
Version 2.22: Adds to uniname the command line option -A, proposed and implemented by H. Merijn Brand, which causes ASCII whitespace to be skipped.
Version 2.21: unidesc and unihist now print detailed error information on encountering ill-formed UTF-8.
Version 2.20: Added to uniname the -a option, which causes it to ignore ASCII characters. Also fixes bugs introduced into unihist and unidesc caused by bad version of UTF-8 reader.
Version 2.19: The data has been updated to Unicode version 5.0.
Version 2.18: A bug was fixed that incorrectly treated a read interrupted in the middle of a UTF-8 sequence as a truncated sequence.
Version 2.17: The data has been updated to Unicode version 4.1.
Version 2.16: Added -l option to print line number. Also added the shell script utf8lookup, which invokes uniname to provide an easy way to look up the character name corresponding to a codepoint from the command line. In addition to uniname it requires the utility Ascii2binary.
Version 2.15.1: Added ifdefs to unihist.c to allow compilation on systems lacking locale.h or libintl.h.
Version 2.15: Options were added to unidesc that list the Unicode ranges in numerical order and alphabetical order respectively.
Version 2.14: The code has been thoroughly cleaned up, so that gcc -Wall -pedantic produces no warnings, and GNU autoconfiguration set up.
Version 2.13: uniname can now be asked merely to validate its input as UTF-8. If the -V flag is given, uniname reads its input and does nothing unless it encounters invalid UTF-8. If so, it reports its location, explains why it is invalid, and exits.
Version 2.12: A couple of minor errors in the endpoints of Unicode ranges were corrected. (The errors were in unirange.c and affect uniname and unidesc.)
Version 2.11: Added unihist to package.
Version 2.10: uniname now explains why ill-formed UTF-8 is ill-formed when it encounters it and (optionally) prints column headers at the top of every screenfull.