Definition

Windows Server Failover Clustering (WSFC)

Gavin Wright

By

Gavin Wright

Published: Mar 08, 2022

What is Windows Server Failover Clustering (WSFC)?

Windows Server Failover Clustering (WSFC) -- a feature of Microsoft Windows Server operating system for fault tolerance and high availability (HA) of applications and services -- enables several computers to host a service, and if one has a fault, the remaining computers automatically take over the hosting of the service. It is included with Windows Server 2022, Windows Server 2019, Windows Server 2016 and Azure Stack HCI.

In WSFC, each individual server is called a node. The nodes can be physical computers or virtual machines, and are connected through physical connections and through software. Two or more nodes are combined to form a cluster, which hosts the service. The cluster and nodes are constantly monitored for faults. If a fault is detected, the nodes with issues are removed from the cluster and the services may be restarted or moved to another node.

Capabilities of Windows Server Failover Clustering (WSFC)

Windows Server Failover Cluster performs several functions, including:

Unified cluster management. The configuration of the cluster and service is stored on each node within the cluster. Changes to the configuration of the service or cluster are automatically sent to each node. This allows for a single update to change the configuration on all participating nodes.
Resource management. Each node in the cluster may have access to resources such as networking and storage. These resources can be shared by the hosted application to increase the cluster performance beyond what a single node can accomplish. The application can be configured to have startup dependencies on these resources. The nodes can work together to ensure resource consistency.
Health monitoring. The health of each node and the overall cluster is monitored. Each node uses heartbeat and service notifications to determine health. The cluster health is voted on by the quorum of participating nodes.
Automatic and manual failover. Resources have a primary node and one or more secondary nodes. If the primary node fails a health check or is manually triggered, ownership and use of the resource is transferred to the secondary node. Nodes and the hosted application are notified of the failover. This provides fault tolerance and allows rolling updates not to affect overall service health.

Common applications that use WSFC

A number of different applications can use WSFC, including:

Database Server
Windows Distributed File System (NFS) Namespace Server
File Server
Hyper-V
Microsoft Exchange Server
Microsoft SQL Server
Namespace Server
Windows Internet Name Server

Windows Server Failover Clustering, WSFC — Failover cluster configuration wizard

WSFC voting, quorum and witnesses

Every cluster network must account for the possibility of individual nodes losing communication to the cluster but still being able to serve requests or access resources. If this were to happen, the service could become corrupt and serve bad responses or cause data stores to become out of sync. This is known as split-brain condition.

WSFC uses a voting system with quorum to determine failover and to prevent a split-brain condition. In the cluster, the quorum is defined as half of the total nodes. After a fault, the nodes vote to stay online. If less than the quorum amount votes yes, those nodes are removed. For example, a cluster of five nodes has a fault, causing three to stay in communication in one segment and two in the other. The group of three will have the quorum and stay online, while the other two will not have a quorum and will go offline.

In small clusters, an extra witness vote should be added. The witness is an extra vote that is added as a tiebreaker in clusters with even numbers of nodes. Without a witness, if half of the nodes go offline at one time the whole service is stopped. A witness is required in clusters with only two nodes and recommended for three and four node clusters. In clusters of five or more nodes, a witness does not provide benefits and is not needed. The witness information is stored in a witness.log file. It can be hosted as a File Share Witness, an Azure Cloud Witness or as a Disk Witness (aka custom quorum disk).

A Dynamic Quorum allows the number of votes to constitute a quorum to adjust as faults occur. This way, as long as more than half of the nodes don't go offline at one time, the cluster will be able to continuously lose nodes without it going offline. This allows for a single node to run the services as the "last man standing."

Windows Server Failover Clustering and Microsoft SQL Server Always On

SQL Server Always On is a high-availability and disaster recovery product for Microsoft SQL server that takes advantage of WSFC. SQL Server Always On has two configurations that can be used separately or in tandem. Failover Cluster Instance (FCI) is a SQL Server instance that is installed across several nodes in a WSFC. Availability Group (AG) is a one or more databases that fail over together to replicated copies. Both register components with WSFC as cluster resources.

PowerShell cmdlets, Windows Server Failover Clustering, WSFC — Windows Server Failover Clustering PowerShell cmdlets

Windows Server Failover Clustering Setup Steps

See Microsoft for full documentation on how to deploy a failover cluster using WSFC.

Verify prerequisites
- All nodes on same Windows Server version
- All nodes using supported hardware
- All nodes are members of the same Active Directory domain
Install the Failover Clustering feature using Windows Server Manager add Roles and Features
Validate the failover cluster configuration
Create the failover cluster in server manager
Create the cluster roles and services using Microsoft Failover Cluster Manager (MSFCM)

See failover cluster quorum considerations for Windows admins, 10 top tips to maximize hyper-converged infrastructure benefits and how to build a Hyper-V home lab in Windows Server 2019.

Continue Reading About Windows Server Failover Clustering (WSFC)

How does a Hyper-V failover cluster work behind the scenes?

Manage Windows Server HCI with Windows Admin Center

Guest clustering achieves high availability at the VM level

5 skills every Hyper-V administrator needs to succeed

How does a Hyper-V failover cluster work behind the scenes?

Dig Deeper on IT operations and infrastructure management

Search Cloud Computing

Top legacy modernization tools of 2026
Enterprises can modernize legacy systems faster with the right tools. Discover how to choose the right modernization tool for ...
A primer on modernization strategies for legacy systems
Legacy systems are more than just a drag on efficiency or merely older programs; they are strategic inhibitors that prevent ...
Peloton's engineering team makes the case for test in production
Peloton cut its performance environment and saved 40% on infrastructure costs without disruptions. How? Essential prerequisites ...

Search Enterprise Desktop

Over 600 CVEs: Mythos' effect on vulnerability and patch management
Microsoft's July Patch Tuesday broke records by addressing over 600 CVEs. What does this reveal about the effect that AI models ...
Mobile governance has to follow the data
Mobile security, endpoint audits and compliance must become ongoing governance functions as mobile apps and devices become part ...
Mobile apps are enterprise data's first mile
Mobile apps are becoming a primary access point for enterprise software and a critical point of data capture, integration and app...

Search Virtual Desktop

Understanding persistent vs. nonpersistent VDI
Persistent and nonpersistent VDI differ in storage, personalization, security, app delivery and management. Learn how IT should ...
Understand methods for virtual application delivery
Virtual app delivery technologies help centralize management and provide consistent UX across distributed workforces. But how do ...
How to ensure webcam functionality on remote desktops
As distributed workforces become the norm, more employees use remote desktops. To support these workers, IT must be able to set ...

Search Data Center

IBM infrastructure revenue down; open source, quantum on rise
IBM is looking to close what its CEO is calling "delayed sales" and eyeing a quantum future. While infrastructure and mainframe ...
Preparing for the data center talent shortage
Organizations are encountering a talent shortage in data centers as demand for AI-driven infrastructure grows. Explore insights ...
Data center sustainability: What are renewable energy credits?
Data centers claim 100% renewable energy by using renewable energy credits (RECs) and power purchase agreements (PPAs), ...

Close