78 relations: Apache OpenOffice, ASCII, AWK, Backslash, Byte order mark, Character (computing), Character encoding, Comma, Comparison of data serialization formats, Computer file, Computing, Cut (Unix), Data exchange, Database, De facto, De jure, Decimal separator, Delimiter, Delimiter-separated values, Demographic and Health Surveys, Document, EBCDIC, Emacs, Endianness, Escape character, Field (computer science), File format, Filename extension, Foreign key, Fortran, Grapheme, Hierarchy, HTML, IBM, Internet Assigned Numbers Authority, Internet Engineering Task Force, Join (Unix), LibreOffice Calc, Locale (computer software), Media type, Metadata, Microsoft Excel, Newline, Numbers (spreadsheet), Object-oriented programming, Open Knowledge International, OS/360 and successors, Osborne Computer Corporation, Paste (Unix), Plain text, ..., Programming language, Proprietary format, Punched card, Quotation mark, Record (computer science), Relation (database), Relational database, Row (database), Scripting language, Semantic Web, Shift JIS, Sort (Unix), Spreadsheet, Statistics, SuperCalc, Sybase, Tab-separated values, Table (information), Text editor, Text file, Unicode, Uniq, Unix, UTF-16, UTF-8, Word (computer architecture), World Wide Web Consortium, XML. Expand index (28 more) » « Shrink index
Apache OpenOffice (AOO) is an open-source office productivity software suite.
ASCII, abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication.
AWK is a programming language designed for text processing and typically used as a data extraction and reporting tool.
The backslash (\) is a typographical mark (glyph) used mainly in computing and is the mirror image of the common slash (/).
The byte order mark (BOM) is a Unicode character,, whose appearance as a magic number at the start of a text stream can signal several things to a program consuming the text.
In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language.
Character encoding is used to represent a repertoire of characters by some kind of encoding system.
The comma is a punctuation mark that appears in several variants in different languages.
This is a comparison of data serialization formats, various ways to convert complex objects to sequences of bits.
A computer file is a computer resource for recording data discretely in a computer storage device.
Computing is any goal-oriented activity requiring, benefiting from, or creating computers.
In computing, cut is a Unix command line utility which is used to extract sections from each line of input — usually from a file.
Data exchange is the process of taking data structured under a source schema and transforming it into data structured under a target schema, so that the target data is an accurate representation of the source data.
A database is an organized collection of data, stored and accessed electronically.
In law and government, de facto (or;, "in fact") describes practices that exist in reality, even if not legally recognised by official laws.
In law and government, de jure (lit) describes practices that are legally recognised, whether or not the practices exist in reality.
A decimal separator is a symbol used to separate the integer part from the fractional part of a number written in decimal form.
A delimiter is a sequence of one or more characters used to specify the boundary between separate, independent regions in plain text or other data streams.
Formats that use delimiter-separated values (also DSV)DSV stands for Delimiter Separated Values store two-dimensional arrays of data by separating the values in each row with specific delimiter characters.
The Demographic and Health Surveys (DHS) Program is responsible for collecting and disseminating accurate, nationally representative data on health and population in developing countries.
A document is a written, drawn, presented, or memorialized representation of thought.
Extended Binary Coded Decimal Interchange Code (EBCDIC) is an eight-bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems.
Emacs is a family of text editors that are characterized by their extensibility.
Endianness refers to the sequential order in which bytes are arranged into larger numerical values when stored in memory or when transmitted over digital links.
In computing and telecommunication, an escape character is a character which invokes an alternative interpretation on subsequent characters in a character sequence.
In computer science, data that has several parts, known as a record, can be divided into fields.
A file format is a standard way that information is encoded for storage in a computer file.
A filename extension is an identifier specified as a suffix to the name of a computer file.
In the context of relational databases, a foreign key is a field (or collection of fields) in one table that uniquely identifies a row of another table or the same table.
Fortran (formerly FORTRAN, derived from Formula Translation) is a general-purpose, compiled imperative programming language that is especially suited to numeric computation and scientific computing.
In linguistics, a grapheme is the smallest unit of a writing system of any given language.
A hierarchy (from the Greek hierarchia, "rule of a high priest", from hierarkhes, "leader of sacred rites") is an arrangement of items (objects, names, values, categories, etc.) in which the items are represented as being "above", "below", or "at the same level as" one another A hierarchy can link entities either directly or indirectly, and either vertically or diagonally.
Hypertext Markup Language (HTML) is the standard markup language for creating web pages and web applications.
The International Business Machines Corporation (IBM) is an American multinational technology company headquartered in Armonk, New York, United States, with operations in over 170 countries.
The Internet Assigned Numbers Authority (IANA) is a function of ICANN, a nonprofit private American corporation that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System (DNS), media types, and other Internet Protocol-related symbols and Internet numbers.
The Internet Engineering Task Force (IETF) develops and promotes voluntary Internet standards, in particular the standards that comprise the Internet protocol suite (TCP/IP).
join is a command in Unix-like operating systems that merges the lines of two sorted text files based on the presence of a common field.
LibreOffice Calc is the spreadsheet component of the LibreOffice software package.
In computing, a locale is a set of parameters that defines the user's language, region and any special variant preferences that the user wants to see in their user interface.
A media type (formerly known as MIME type) is a two-part identifier for file formats and format contents transmitted on the Internet.
Metadata is "data that provides information about other data".
Microsoft Excel is a spreadsheet developed by Microsoft for Windows, macOS, Android and iOS.
Newline (frequently called line ending, end of line (EOL), line feed, or line break) is a control character or sequence of control characters in a character encoding specification, e.g. ASCII or EBCDIC.
Numbers is a spreadsheet application developed by Apple Inc. as part of the iWork productivity suite alongside Keynote and Pages.
Object-oriented programming (OOP) is a programming paradigm based on the concept of "objects", which may contain data, in the form of fields, often known as attributes; and code, in the form of procedures, often known as methods. A feature of objects is that an object's procedures can access and often modify the data fields of the object with which they are associated (objects have a notion of "this" or "self").
Open Knowledge International (OKI) (known as the Open Knowledge Foundation (OKF) until April 2014, then Open Knowledge until May 2016) is a global non-profit network that promotes and shares information at no charge, including both content and data.
OS/360, officially known as IBM System/360 Operating System, is a discontinued batch processing operating system developed by IBM for their then-new System/360 mainframe computer, announced in 1964; it was heavily influenced by the earlier IBSYS/IBJOB and Input/Output Control System (IOCS) packages.
The Osborne Computer Corporation (OCC) was a pioneering maker of portable computers.
paste is a Unix command line utility which is used to join files horizontally (parallel merging) by outputting lines consisting of the sequentially corresponding lines of each file specified, separated by tabs, to the standard output.
In computing, plain text is the data (e.g. file contents) that represent only characters of readable material but not its graphical representation nor other objects (images, etc.). It may also include a limited number of characters that control simple arrangement of text, such as line breaks or tabulation characters.
A programming language is a formal language that specifies a set of instructions that can be used to produce various kinds of output.
A proprietary format is a file format of a company, organization, or individual that contains data that is ordered and stored according to a particular encoding-scheme, designed by the company or organization to be secret, such that the decoding and interpretation of this stored data is easily accomplished only with particular software or hardware that the company itself has developed.
A punched card or punch card is a piece of stiff paper that can be used to contain digital data represented by the presence or absence of holes in predefined positions.
Quotation marks, also called quotes, quote marks, quotemarks, speech marks, inverted commas or talking marks, are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase.
In computer science, a record (also called a structure, struct, or compound data) is a basic data structure.
In relational database theory, a relation, as originally defined by E. F. Codd, is a set of tuples (d1, d2,..., dn), where each element dj is a member of Dj, a data domain.
A relational database is a digital database based on the relational model of data, as proposed by E. F. Codd in 1970.
In the context of a relational database, a row—also called a tuple—represents a single, implicitly structured data item in a table.
A scripting or script language is a programming language that supports scripts: programs written for a special run-time environment that automate the execution of tasks that could alternatively be executed one-by-one by a human operator.
The Semantic Web is an extension of the World Wide Web through standards by the World Wide Web Consortium (W3C).
--> Shift JIS (Shift Japanese Industrial Standards, also SJIS, MIME name Shift_JIS) is a character encoding for the Japanese language, originally developed by a Japanese company called ASCII Corporation in conjunction with Microsoft and standardized as JIS X 0208 Appendix 1.
In Unix-like operating systems, sort is a standard command line program that prints the lines of its input or concatenation of all files listed in its argument list in sorted order.
A spreadsheet is an interactive computer application for organization, analysis and storage of data in tabular form.
Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data.
SuperCalc is a CP/M-80 spreadsheet application published by Sorcim in 1981.
Sybase was an enterprise software and services company that produced software to manage and analyze information in relational databases.
A tab-separated values (TSV) file is a simple text format for storing data in a tabular structure, e.g., database table or spreadsheet data, and a way of exchanging information between databases.
A table is an arrangement of data in rows and columns, or possibly in a more complex structure.
A text editor is a type of computer program that edits plain text.
A text file (sometimes spelled "textfile"; an old alternative name is "flatfile") is a kind of computer file that is structured as a sequence of lines of electronic text.
Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems.
uniq is a Unix utility which, when fed a text file, outputs the file with adjacent identical lines collapsed to one.
Unix (trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, development starting in the 1970s at the Bell Labs research center by Ken Thompson, Dennis Ritchie, and others.
UTF-16 (16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode.
UTF-8 is a variable width character encoding capable of encoding all 1,112,064 valid code points in Unicode using one to four 8-bit bytes.
In computing, a word is the natural unit of data used by a particular processor design.
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web (abbreviated WWW or W3).
In computing, Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable.
.csv, COMMA-SEPARATED VALUES, CSV (file format), CSV file, CSV-1203, Comma Separated Value, Comma Separated Values, Comma delimited, Comma delimited format, Comma separated list, Comma separated lists, Comma separated value, Comma separated values, Comma seperated values, Comma-Separated Values, Comma-separated list, Comma-separated value, Comma-seperated values, Csv-schema.