Get it on Google Play
New! Download Unionpedia on your Android™ device!
Faster access than browser!

Fault tolerance

Index Fault tolerance

Fault tolerance is the property that enables a system to continue operating properly in the event of the failure (or one or more faults within) some of its components. [1]

72 relations: ACM Computing Surveys, Airbag, Availability, Chernobyl disaster, Computer, Computer hardware, Control reconfiguration, Corrosion, Crossbar switch, Damage tolerance, Data redundancy, Data storage, Defence in depth, Dual modular redundancy, Ecological resilience, Elegant degradation, Error-tolerant design, Fail-deadly, Fail-fast, Fail-safe, Failing badly, Failover, Failure, Failure semantics, Failure transparency, Fatigue (material), Fault detection and isolation, Fault-tolerant computer system, Firewall (computing), Forward compatibility, Graceful exit, Gravity, High availability, Hot swapping, HTML, Human error, Human spaceflight, John von Neumann, List of system quality attributes, Lockstep (computing), Mean time between failures, Mean time to repair, National Institute of Standards and Technology, NonStop (server computers), Nuclear reactor, Parallel computing, Peter J. Denning, Progressive enhancement, Quorum, RAID, ..., Redundancy (engineering), Replication (computing), Resilience (engineering and construction), Resilience (network), Response time (technology), Reversion (software development), Robustness (computer science), Safe-life design, Safety-critical system, Seat belt, Self-stabilization, Single point of failure, Software, Software brittleness, Synchronization (computer science), System, Tandem Computers, Throughput, Triple modular redundancy, Uptime, Web browser, Western Electric. Expand index (22 more) »

ACM Computing Surveys

ACM Computing Surveys (CSUR) is a peer reviewed scientific journal published by the Association for Computing Machinery.

New!!: Fault tolerance and ACM Computing Surveys · See more »


An airbag is a type of vehicle safety device and is an occupant restraint system.

New!!: Fault tolerance and Airbag · See more »


In reliability theory and reliability engineering, the term availability has the following meanings.

New!!: Fault tolerance and Availability · See more »

Chernobyl disaster

The Chernobyl disaster, also referred to as the Chernobyl accident, was a catastrophic nuclear accident.

New!!: Fault tolerance and Chernobyl disaster · See more »


A computer is a device that can be instructed to carry out sequences of arithmetic or logical operations automatically via computer programming.

New!!: Fault tolerance and Computer · See more »

Computer hardware

Computer hardware includes the physical parts or components of a computer, such as the central processing unit, monitor, keyboard, computer data storage, graphic card, sound card and motherboard.

New!!: Fault tolerance and Computer hardware · See more »

Control reconfiguration

Control reconfiguration is an active approach in control theory to achieve fault-tolerant control for dynamic systems.

New!!: Fault tolerance and Control reconfiguration · See more »


Corrosion is a natural process, which converts a refined metal to a more chemically-stable form, such as its oxide, hydroxide, or sulfide.

New!!: Fault tolerance and Corrosion · See more »

Crossbar switch

In electronics, a crossbar switch (cross-point switch, matrix switch) is a collection of switches arranged in a matrix configuration.

New!!: Fault tolerance and Crossbar switch · See more »

Damage tolerance

Damage tolerance is a property of a structure relating to its ability to sustain defects safely until repair can be effected.

New!!: Fault tolerance and Damage tolerance · See more »

Data redundancy

In computer main memory, auxiliary storage and computer buses, data redundancy is the existence of data that is additional to the actual data and permits correction of errors in stored or transmitted data.

New!!: Fault tolerance and Data redundancy · See more »

Data storage

Data storage is the recording (storing) of information (data) in a storage medium.

New!!: Fault tolerance and Data storage · See more »

Defence in depth

Defence in depth (also known as deep or elastic defence) is a military strategy that seeks to delay rather than prevent the advance of an attacker, buying time and causing additional casualties by yielding space.

New!!: Fault tolerance and Defence in depth · See more »

Dual modular redundancy

In reliability engineering, dual modular redundancy (DMR) is when components of a system are duplicated, providing redundancy in case one should fail.

New!!: Fault tolerance and Dual modular redundancy · See more »

Ecological resilience

In ecology, resilience is the capacity of an ecosystem to respond to a perturbation or disturbance by resisting damage and recovering quickly.

New!!: Fault tolerance and Ecological resilience · See more »

Elegant degradation

Elegant degradation is a term used in engineering to describe what occurs to machines which are subject to constant, repetitive stress.

New!!: Fault tolerance and Elegant degradation · See more »

Error-tolerant design

An error-tolerant design (also: human-error-tolerant design) is one that does not unduly penalize user or human errors.

New!!: Fault tolerance and Error-tolerant design · See more »


Fail-deadly is a concept in nuclear military strategy that encourages deterrence by guaranteeing an immediate, automatic, and overwhelming response to an attack.

New!!: Fault tolerance and Fail-deadly · See more »


In systems design, a fail-fast system is one which immediately reports at its interface any condition that is likely to indicate a failure.

New!!: Fault tolerance and Fail-fast · See more »


A fail-safe in engineering is a design feature or practice that in the event of a specific type of failure, inherently responds in a way that will cause no or minimal harm to other equipment, the environment or to people.

New!!: Fault tolerance and Fail-safe · See more »

Failing badly

Failing badly and failing well are concepts in systems security and network security (and engineering in general) describing how a system reacts to failure.

New!!: Fault tolerance and Failing badly · See more »


In computing and related technologies such as networking, failover is switching to a redundant or standby computer server, system, hardware component or network upon the failure or abnormal termination of the previously active application, server, system, hardware component, or network.

New!!: Fault tolerance and Failover · See more »


Failure is the state or condition of not meeting a desirable or intended objective, and may be viewed as the opposite of success.

New!!: Fault tolerance and Failure · See more »

Failure semantics

In distributed computing, failure semantics is used to describe and classify errors that distributed systems can experience.

New!!: Fault tolerance and Failure semantics · See more »

Failure transparency

In a distributed system, failure transparency refers to the extent to which errors and subsequent recoveries of hosts and services within the system are invisible to users and applications.

New!!: Fault tolerance and Failure transparency · See more »

Fatigue (material)

In materials science, fatigue is the weakening of a material caused by repeatedly applied loads.

New!!: Fault tolerance and Fatigue (material) · See more »

Fault detection and isolation

Fault detection, isolation, and recovery (FDIR) is a subfield of control engineering which concerns itself with monitoring a system, identifying when a fault has occurred, and pinpointing the type of fault and its location.

New!!: Fault tolerance and Fault detection and isolation · See more »

Fault-tolerant computer system

Fault-tolerant computer systems are systems designed around the concepts of fault tolerance.

New!!: Fault tolerance and Fault-tolerant computer system · See more »

Firewall (computing)

In computing, a firewall is a network security system that monitors and controls incoming and outgoing network traffic based on predetermined security rules.

New!!: Fault tolerance and Firewall (computing) · See more »

Forward compatibility

Forward compatibility or upward compatibility is a design characteristic that allows a system to accept input intended for a later version of itself.

New!!: Fault tolerance and Forward compatibility · See more »

Graceful exit

A graceful exit (or graceful handling) is a simple programming idiom wherein a program detects a serious error condition and "exits gracefully" in a controlled manner as a result.

New!!: Fault tolerance and Graceful exit · See more »


Gravity, or gravitation, is a natural phenomenon by which all things with mass or energy—including planets, stars, galaxies, and even light—are brought toward (or gravitate toward) one another.

New!!: Fault tolerance and Gravity · See more »

High availability

High availability is a characteristic of a system, which aims to ensure an agreed level of operational performance, usually uptime, for a higher than normal period.

New!!: Fault tolerance and High availability · See more »

Hot swapping

Hot swapping (frequently inaccurately called hot plugging) is replacing or adding components without stopping or shutting down the system.

New!!: Fault tolerance and Hot swapping · See more »


Hypertext Markup Language (HTML) is the standard markup language for creating web pages and web applications.

New!!: Fault tolerance and HTML · See more »

Human error

Human error has been cited as a primary cause contributing factor in disasters and accidents in industries as diverse as nuclear power (e.g., the Three Mile Island accident), aviation (see pilot error), space exploration (e.g., the Space Shuttle Challenger Disaster and Space Shuttle Columbia disaster), and medicine (see medical error).

New!!: Fault tolerance and Human error · See more »

Human spaceflight

Human spaceflight (also referred to as crewed spaceflight or manned spaceflight) is space travel with a crew or passengers aboard the spacecraft.

New!!: Fault tolerance and Human spaceflight · See more »

John von Neumann

John von Neumann (Neumann János Lajos,; December 28, 1903 – February 8, 1957) was a Hungarian-American mathematician, physicist, computer scientist, and polymath.

New!!: Fault tolerance and John von Neumann · See more »

List of system quality attributes

Within systems engineering, quality attributes are realized non-functional requirements used to evaluate the performance of a system.

New!!: Fault tolerance and List of system quality attributes · See more »

Lockstep (computing)

Lockstep systems are fault-tolerant computer systems that run the same set of operations at the same time in parallel.

New!!: Fault tolerance and Lockstep (computing) · See more »

Mean time between failures

Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a mechanical or electronic system, during normal system operation.

New!!: Fault tolerance and Mean time between failures · See more »

Mean time to repair

Mean Time To Repair (MTTR) is a basic measure of the maintainability of repairable items.

New!!: Fault tolerance and Mean time to repair · See more »

National Institute of Standards and Technology

The National Institute of Standards and Technology (NIST) is one of the oldest physical science laboratories in the United States.

New!!: Fault tolerance and National Institute of Standards and Technology · See more »

NonStop (server computers)

NonStop is a series of server computers introduced to market in 1976 by Tandem Computers Inc., beginning with the NonStop product line, which was followed by the Hewlett-Packard Integrity NonStop product line extension.

New!!: Fault tolerance and NonStop (server computers) · See more »

Nuclear reactor

A nuclear reactor, formerly known as an atomic pile, is a device used to initiate and control a self-sustained nuclear chain reaction.

New!!: Fault tolerance and Nuclear reactor · See more »

Parallel computing

Parallel computing is a type of computation in which many calculations or the execution of processes are carried out concurrently.

New!!: Fault tolerance and Parallel computing · See more »

Peter J. Denning

Peter James Denning (born January 6, 1942) is an American computer scientist and writer.

New!!: Fault tolerance and Peter J. Denning · See more »

Progressive enhancement

Progressive enhancement is a strategy for web design that emphasizes core webpage content first.

New!!: Fault tolerance and Progressive enhancement · See more »


A quorum is the minimum number of members of a deliberative assembly (a body that uses parliamentary procedure, such as a legislature) necessary to conduct the business of that group.

New!!: Fault tolerance and Quorum · See more »


RAID (Redundant Array of Independent Disks, originally Redundant Array of Inexpensive Disks) is a data storage virtualization technology that combines multiple physical disk drive components into one or more logical units for the purposes of data redundancy, performance improvement, or both.

New!!: Fault tolerance and RAID · See more »

Redundancy (engineering)

In engineering, redundancy is the duplication of critical components or functions of a system with the intention of increasing reliability of the system, usually in the form of a backup or fail-safe, or to improve actual system performance, such as in the case of GNSS receivers, or multi-threaded computer processing.

New!!: Fault tolerance and Redundancy (engineering) · See more »

Replication (computing)

Replication in computing involves sharing information so as to ensure consistency between redundant resources, such as software or hardware components, to improve reliability, fault-tolerance, or accessibility.

New!!: Fault tolerance and Replication (computing) · See more »

Resilience (engineering and construction)

The resilience concept originated from ecology and then gradually applied to other fields.

New!!: Fault tolerance and Resilience (engineering and construction) · See more »

Resilience (network)

In computer networking: resilience is the ability to provide and maintain an acceptable level of service in the face of faults and challenges to normal operation.” Threats and challenges for services can range from simple misconfiguration over large scale natural disasters to targeted attacks.

New!!: Fault tolerance and Resilience (network) · See more »

Response time (technology)

In technology, response time is the time a system or functional unit takes to react to a given input.

New!!: Fault tolerance and Response time (technology) · See more »

Reversion (software development)

In software development (and, by extension, in content-editing environments, especially wikis, that make use of the software development process of revision control), reversion or reverting is the abandonment of one or more recent changes in favor of a return to a previous version of the material at hand (typically software source code in the context of application development; HTML, CSS or script code in the context of web development; or content and formatting thereof in the context of wikis).

New!!: Fault tolerance and Reversion (software development) · See more »

Robustness (computer science)

In computer science, robustness is the ability of a computer system to cope with errors during execution1990.

New!!: Fault tolerance and Robustness (computer science) · See more »

Safe-life design

In safe-life design products are designed to survive a specific design life with a chosen reserve.

New!!: Fault tolerance and Safe-life design · See more »

Safety-critical system

A safety-critical system or life-critical system is a system whose failure or malfunction may result in one (or more) of the following outcomes.

New!!: Fault tolerance and Safety-critical system · See more »

Seat belt

A seat belt (also known as a seatbelt or safety belt) is a vehicle safety device designed to secure the occupant of a vehicle against harmful movement that may result during a collision or a sudden stop.

New!!: Fault tolerance and Seat belt · See more »


Self-stabilization is a concept of fault-tolerance in distributed computing.

New!!: Fault tolerance and Self-stabilization · See more »

Single point of failure

A single point of failure (SPOF) is a part of a system that, if it fails, will stop the entire system from working.

New!!: Fault tolerance and Single point of failure · See more »


Computer software, or simply software, is a generic term that refers to a collection of data or computer instructions that tell the computer how to work, in contrast to the physical hardware from which the system is built, that actually performs the work.

New!!: Fault tolerance and Software · See more »

Software brittleness

In computer programming and software engineering, software brittleness is the increased difficulty in fixing older software that may appear reliable, but fails badly when presented with unusual data or altered in a seemingly minor way.

New!!: Fault tolerance and Software brittleness · See more »

Synchronization (computer science)

In computer science, synchronization refers to one of two distinct but related concepts: synchronization of processes, and synchronization of Data.

New!!: Fault tolerance and Synchronization (computer science) · See more »


A system is a regularly interacting or interdependent group of items forming an integrated whole.

New!!: Fault tolerance and System · See more »

Tandem Computers

Tandem Computers, Inc. was the dominant manufacturer of fault-tolerant computer systems for ATM networks, banks, stock exchanges, telephone switching centers, and other similar commercial transaction processing applications requiring maximum uptime and zero data loss.

New!!: Fault tolerance and Tandem Computers · See more »


In general terms, throughput is the maximum rate of production or the maximum rate at which something can be processed.

New!!: Fault tolerance and Throughput · See more »

Triple modular redundancy

In computing, triple modular redundancy, sometimes called triple-mode redundancy, (TMR) is a fault-tolerant form of N-modular redundancy, in which three systems perform a process and that result is processed by a majority-voting system to produce a single output.

New!!: Fault tolerance and Triple modular redundancy · See more »


Uptime is a measure of the time a machine, typically a computer, has been working and available.

New!!: Fault tolerance and Uptime · See more »

Web browser

A web browser (commonly referred to as a browser) is a software application for accessing information on the World Wide Web.

New!!: Fault tolerance and Web browser · See more »

Western Electric

Western Electric Company (WE, WECo) was an American electrical engineering and manufacturing company that served as the primary supplier to AT&T from 1881 to 1996.

New!!: Fault tolerance and Western Electric · See more »

Redirects here:

Damage tolerant design, Degrade gracefully, Degrades gracefully, Fail gracefully, Fail soft, Fail-soft operation, Failure resistance, Failure tolerance, Fault Tolerance, Fault tolerant, Fault tolerant design, Fault tolerant designs, Fault tolerant system, Fault tolerant systems, Fault-tolerance, Fault-tolerant, Fault-tolerant computing, Fault-tolerant design, Fault-tolerant designs, Fault-tolerant system, Fault-tolerant systems, Graceful degradation, Graceful failure.


[1] https://en.wikipedia.org/wiki/Fault_tolerance

Hey! We are on Facebook now! »