samIT-prosjektet ble lagt ned i januar 2011, og sidene blir ikke lenger oppdaterte!


Sun og Unicode

Hjem Opp Innhold Søkeside

Sun og Unicode

 


Hjem
Opp
Regjeringens mål
Oversettere
Samiske termer
Stillingsbenevelser
Tastaturløsninger
Samiske fonter
Programtester
Koding av nettsider
Lover og forskrifter
Konferanse
Linker
Kontaktinformasjon
Nyhetsarkiv

"Unicode is the only practical character set option for applications that support multilingual documents. However, applications do have several options for how they encode Unicode. An encoding is the mapping of Unicode code points to a stream of storable code units or octets. The most common encodings include UTF-8, UTF-16 and UTF-32. Each encoding has advantages and drawbacks. However, one encoding in particular has gained widespread acceptance. That encoding is UTF-8. UTF-8 is an important encoding because it is:
  • ASCII compatible
  • easily supported
  • compact and efficient for most scripts
  • easily processed, unlike other multibyte encodings

One complaint often aimed at Unicode is that it requires so much more space than legacy encodings for Latin-based scripts. However, UTF-8 stores the ASCII subset of all these charsets in as little as one byte. The ASCII subset is definitely the most used set of characters for Western European and American languages.

Unlike some legacy character encodings, UTF-8 is fairly easy to parse and manipulate. The bit patterns of the encoding allow you to quickly determine whether your character index points to a character's beginning or somewhere else. Moving backward or forward within a string is easy."


Kontakt hal@fad.dep.no dersom du har spørsmål om dette nettstedet.
Siste endring: 2007-09-03 11:19