Universal Coded Character Set

The Universal Coded Character Set (UCS) is a standard set of characters defined by the International Standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings. ^[1]

53 relations: ASCII, Bi-directional text, Binary Ordered Compression for Unicode, Bit, Byte, C0 and C1 control codes, Character (computing), Character encoding, China, Code point, Collation, Comparison of Unicode encodings, Devanagari, Euro sign, GB 18030, Georgian lari, Graphical user interface, Hentaigana, Hexadecimal, Hugh McGregor Ross, Indian rupee sign, Integer, International Electrotechnical Commission, International Organization for Standardization, ISO 14651, ISO 15924, ISO/IEC 2022, ISO/IEC 646, ISO/IEC 8859, ISO/IEC JTC 1/SC 2, Ken Thompson, Language, List of International Organization for Standardization standards, List of XML and HTML character entity references, Plan 9 from Bell Labs, Plane (Unicode), Right-to-left, Rob Pike, Specials (Unicode block), Standard Compression Scheme for Unicode, Text normalization, Turkish lira sign, Unicode, Unicode character property, Unicode Consortium, Universal Character Set characters, UTF-1, UTF-16, UTF-32, UTF-8, ..., Working group, Writing system, Xterm. Expand index (3 more) » « Shrink index

ASCII

ASCII, abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication.

New!!: Universal Coded Character Set and ASCII · See more »

Bi-directional text

Bi-directional text is text containing text in both text directionalities, both right-to-left (RTL or dextrosinistral) and left-to-right (LTR or sinistrodextral).

New!!: Universal Coded Character Set and Bi-directional text · See more »

Binary Ordered Compression for Unicode

Binary Ordered Compression for Unicode (BOCU) is a MIME compatible Unicode compression scheme.

New!!: Universal Coded Character Set and Binary Ordered Compression for Unicode · See more »

Bit

The bit (a portmanteau of binary digit) is a basic unit of information used in computing and digital communications.

New!!: Universal Coded Character Set and Bit · See more »

Byte

The byte is a unit of digital information that most commonly consists of eight bits, representing a binary number.

New!!: Universal Coded Character Set and Byte · See more »

C0 and C1 control codes

The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use the ISO/IEC 2022 system of specifying control and graphic characters.

New!!: Universal Coded Character Set and C0 and C1 control codes · See more »

Character (computing)

In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language.

New!!: Universal Coded Character Set and Character (computing) · See more »

Character encoding

Character encoding is used to represent a repertoire of characters by some kind of encoding system.

New!!: Universal Coded Character Set and Character encoding · See more »

China

China, officially the People's Republic of China (PRC), is a unitary one-party sovereign state in East Asia and the world's most populous country, with a population of around /1e9 round 3 billion.

New!!: Universal Coded Character Set and China · See more »

Code point

In character encoding terminology, a code point or code position is any of the numerical values that make up the code space.

New!!: Universal Coded Character Set and Code point · See more »

Collation

Collation is the assembly of written information into a standard order.

New!!: Universal Coded Character Set and Collation · See more »

Comparison of Unicode encodings

This article compares Unicode encodings.

New!!: Universal Coded Character Set and Comparison of Unicode encodings · See more »

Devanagari

Devanagari (देवनागरी,, a compound of "''deva''" देव and "''nāgarī''" नागरी; Hindi pronunciation), also called Nagari (Nāgarī, नागरी),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group,, page 83 is an abugida (alphasyllabary) used in India and Nepal.

New!!: Universal Coded Character Set and Devanagari · See more »

Euro sign

The euro sign (€) is the currency sign used for the euro, the official currency of the Eurozone in the European Union (EU).

New!!: Universal Coded Character Set and Euro sign · See more »

GB 18030

GB 18030 is a Chinese government standard, described as Information technology — Chinese coded character set and defines the required language and character support necessary for software in China.

New!!: Universal Coded Character Set and GB 18030 · See more »

Georgian lari

The lari (ლარი; ISO 4217: GEL) is the currency of Georgia.

New!!: Universal Coded Character Set and Georgian lari · See more »

Graphical user interface

The graphical user interface (GUI), is a type of user interface that allows users to interact with electronic devices through graphical icons and visual indicators such as secondary notation, instead of text-based user interfaces, typed command labels or text navigation.

New!!: Universal Coded Character Set and Graphical user interface · See more »

Hentaigana

In the Japanese writing system, are obsolete or nonstandard hiragana.

New!!: Universal Coded Character Set and Hentaigana · See more »

Hexadecimal

In mathematics and computing, hexadecimal (also base, or hex) is a positional numeral system with a radix, or base, of 16.

New!!: Universal Coded Character Set and Hexadecimal · See more »

Hugh McGregor Ross

Hugh McGregor Ross (31 August 1917 – 1 September 2014) was an early pioneer in the history of British computing.

New!!: Universal Coded Character Set and Hugh McGregor Ross · See more »

Indian rupee sign

The Indian rupee sign (sign:; code: INR) is the currency sign for the Indian rupee, the official currency of India.

New!!: Universal Coded Character Set and Indian rupee sign · See more »

Integer

An integer (from the Latin ''integer'' meaning "whole")Integer 's first literal meaning in Latin is "untouched", from in ("not") plus tangere ("to touch").

New!!: Universal Coded Character Set and Integer · See more »

International Electrotechnical Commission

The International Electrotechnical Commission (IEC; in French: Commission électrotechnique internationale) is an international standards organization that prepares and publishes International Standards for all electrical, electronic and related technologies – collectively known as "electrotechnology".

New!!: Universal Coded Character Set and International Electrotechnical Commission · See more »

International Organization for Standardization

The International Organization for Standardization (ISO) is an international standard-setting body composed of representatives from various national standards organizations.

New!!: Universal Coded Character Set and International Organization for Standardization · See more »

ISO 14651

, Information technology -- International string ordering and comparison -- Method for comparing character strings and description of the common template tailorable ordering, is an ISO Standard specifying an algorithm that can be used when comparing two strings.

New!!: Universal Coded Character Set and ISO 14651 · See more »

ISO 15924

ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts).

New!!: Universal Coded Character Set and ISO 15924 · See more »

ISO/IEC 2022

ISO/IEC 2022 Information technology—Character code structure and extension techniques, is an ISO standard (equivalent to the ECMA standard ECMA-35) specifying.

New!!: Universal Coded Character Set and ISO/IEC 2022 · See more »

ISO/IEC 646

ISO/IEC 646 is the name of a set of ISO standards, described as Information technology — ISO 7-bit coded character set for information interchange and developed in cooperation with ASCII at least since 1964.

New!!: Universal Coded Character Set and ISO/IEC 646 · See more »

ISO/IEC 8859

ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings.

New!!: Universal Coded Character Set and ISO/IEC 8859 · See more »

ISO/IEC JTC 1/SC 2

ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets.

New!!: Universal Coded Character Set and ISO/IEC JTC 1/SC 2 · See more »

Ken Thompson

Kenneth Lane "Ken" Thompson (born February 4, 1943), commonly referred to as ken in hacker circles, is an American pioneer of computer science.

New!!: Universal Coded Character Set and Ken Thompson · See more »

Language

Language is a system that consists of the development, acquisition, maintenance and use of complex systems of communication, particularly the human ability to do so; and a language is any specific example of such a system.

New!!: Universal Coded Character Set and Language · See more »

List of International Organization for Standardization standards

This is a list of publishedThis list generally excludes draft versions.

New!!: Universal Coded Character Set and List of International Organization for Standardization standards · See more »

List of XML and HTML character entity references

In SGML, HTML and XML documents, the logical constructs known as character data and attribute values consist of sequences of characters, in which each character can manifest directly (representing itself), or can be represented by a series of characters called a character reference, of which there are two types: a numeric character reference and a character entity reference.

New!!: Universal Coded Character Set and List of XML and HTML character entity references · See more »

Plan 9 from Bell Labs

Plan 9 from Bell Labs is a distributed operating system, originating in the Computing Sciences Research Center (CSRC) at Bell Labs in the mid-1980s, and building on UNIX concepts first developed there in the late 1960s; until the Labs' final release at the start of 2015.

New!!: Universal Coded Character Set and Plan 9 from Bell Labs · See more »

Plane (Unicode)

In the Unicode standard, a plane is a continuous group of 65,536 (216) code points.

New!!: Universal Coded Character Set and Plane (Unicode) · See more »

Right-to-left

In a right-to-left, top-to-bottom script (commonly shortened to right to left or abbreviated RTL), writing starts from the right of the page and continues to the left.

New!!: Universal Coded Character Set and Right-to-left · See more »

Rob Pike

Robert "Rob" C. Pike (born 1956) is a Canadian programmer and author.

New!!: Universal Coded Character Set and Rob Pike · See more »

Specials (Unicode block)

Specials is a short Unicode block allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF.

New!!: Universal Coded Character Set and Specials (Unicode block) · See more »

Standard Compression Scheme for Unicode

The Standard Compression Scheme for Unicode (SCSU) is a Unicode Technical Standard for reducing the number of bytes needed to represent Unicode text, especially if that text uses mostly characters from one or a small number of per-language character blocks.

New!!: Universal Coded Character Set and Standard Compression Scheme for Unicode · See more »

Text normalization

Text normalization is the process of transforming text into a single canonical form that it might not have had before.

New!!: Universal Coded Character Set and Text normalization · See more »

Turkish lira sign

The Turkish lira sign (symbol: ₺; image) is the currency symbol used for the Turkish lira, the official currency of Turkey and Northern Cyprus.

New!!: Universal Coded Character Set and Turkish lira sign · See more »

Unicode

Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems.

New!!: Universal Coded Character Set and Unicode · See more »

Unicode character property

The Unicode Standard assigns character properties to each code point.

New!!: Universal Coded Character Set and Unicode character property · See more »

Unicode Consortium

The Unicode Consortium (Unicode Inc.) is a 501(c)(3) non-profit organization that coordinates the development of the Unicode standard, based in Mountain View, California.

New!!: Universal Coded Character Set and Unicode Consortium · See more »

Universal Character Set characters

No description.

New!!: Universal Coded Character Set and Universal Character Set characters · See more »

UTF-1

UTF-1 is one way of transforming ISO 10646/Unicode into a stream of bytes.

New!!: Universal Coded Character Set and UTF-1 · See more »

UTF-16

UTF-16 (16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode.

New!!: Universal Coded Character Set and UTF-16 · See more »

UTF-32

UTF-32 stands for Unicode Transformation Format in 32 bits.

New!!: Universal Coded Character Set and UTF-32 · See more »

UTF-8

UTF-8 is a variable width character encoding capable of encoding all 1,112,064 valid code points in Unicode using one to four 8-bit bytes.

New!!: Universal Coded Character Set and UTF-8 · See more »

Working group

A working group or working party is a group of experts working together to achieve specified goals.

New!!: Universal Coded Character Set and Working group · See more »

Writing system

A writing system is any conventional method of visually representing verbal communication.

New!!: Universal Coded Character Set and Writing system · See more »

Xterm

In computing, xterm is the standard terminal emulator for the X Window System.

New!!: Universal Coded Character Set and Xterm · See more »

Redirects here:

10646-1:1993, IEC 10646, ISO 10646, ISO-10646, ISO/CEI 10646, ISO/CEI 10646-1, ISO/CEI 10646-1:1993, ISO/CEI 10646-1:2000, ISO/CEI 10646-2, ISO/CEI 10646-2:2001, ISO/CEI 10646:1993, ISO/CEI 10646:2000, ISO/CEI 10646:2001, ISO/CEI 10646:2003, ISO/CEI 10646:2011, ISO/CEI 10646:2012, ISO/CEI 10646:2014, ISO/IEC 10646, ISO/IEC 10646-1, ISO/IEC 10646-1:1993, ISO/IEC 10646-1:2000, ISO/IEC 10646-1:2000(E), ISO/IEC 10646-2, ISO/IEC 10646-2:2001, ISO/IEC 10646:1993, ISO/IEC 10646:2000, ISO/IEC 10646:2001, ISO/IEC 10646:2003, ISO/IEC 10646:2011, ISO/IEC 10646:2012, ISO/IEC 10646:2014, ISO/IEC JTC1/SC2/WG2, ISO10646, Iso 10646-1, List of Unicode entities, UCS-16, UCS-2, Universal Character Set, Universal Code (Typography), Universal character set, Universal code (typography).

References

[1] https://en.wikipedia.org/wiki/Universal_Coded_Character_Set

Unionpedia is a concept map or semantic network organized like an encyclopedia – dictionary. It gives a brief definition of each concept and its relationships.

This is a giant online mental map that serves as a basis for concept diagrams. It's free to use and each article or document can be downloaded. It's a tool, resource or reference for study, research, education, learning or teaching, that can be used by teachers, educators, pupils or students; for the academic world: for school, primary, secondary, high school, middle, technical degree, college, university, undergraduate, master's or doctoral degrees; for papers, reports, projects, ideas, documentation, surveys, summaries, or thesis. Here is the definition, explanation, description, or the meaning of each significant on which you need information, and a list of their associated concepts as a glossary. Available in English, Spanish, Portuguese, Japanese, Chinese, French, German, Italian, Polish, Dutch, Russian, Arabic, Hindi, Swedish, Ukrainian, Hungarian, Catalan, Czech, Hebrew, Danish, Finnish, Indonesian, Norwegian, Romanian, Turkish, Vietnamese, Korean, Thai, Greek, Bulgarian, Croatian, Slovak, Lithuanian, Filipino, Latvian, Estonian and Slovenian. More languages soon.

All the information was extracted from Wikipedia, and it's available under the Creative Commons Attribution-ShareAlike License.

Unionpedia is not endorsed by or affiliated with the Wikimedia Foundation.

Google Play, Android and the Google Play logo are trademarks of Google Inc.

Universal Coded Character Set

Redirects here:

References

Languages