[A-Z]* oddities

Alan Cox alan at redhat.com
Mon Feb 2 21:46:42 UTC 2004


On Mon, Feb 02, 2004 at 07:04:22PM +0000, Luciano Miguel Ferreira Rocha wrote:
> On Mon, Feb 02, 2004 at 01:15:01PM -0500, Behdad Esfahbod wrote:
> > This is totally expected :).
> > 
> > For what you want try LANG=C echo [A-Z]* ...
> > 
> > In LANG=en_US.UTF-8 which is the default, both 'A' and 'a' sort
> > before 'B' and 'b'.
> 
> But 'A' is *after* 'a'?
> 
> And I thought utf-8 was supposed to be compatible with ascii...

utf-8 is a character encoding. It doesn't relate to sort (or more properly
"collation") order.

> Well, if that's the standard, then it isn't broken...

The standard depends on the actual language/location. Some sort AaBbCc others
have more complex rules (a b c ch d dd e f ff g ng ...) etc

Alan





More information about the fedora-devel-list mailing list