Definition

FMEA (Failure Mode and Effects Analysis)

By

Alexander S. Gillis, Technical Writer and Editor
Wendy Schuchart, TechTarget

Published: Dec 20, 2022

What is FMEA (Failure Mode and Effects Analysis)?

FMEA (failure mode and effects analysis) is a step-by-step approach for collecting knowledge about possible points of failure in a design, manufacturing process, product or service.

Failure mode (FM) refers to the way in which something might break down. It includes potential errors that might occur, especially errors that could affect the customer. Effective analysis (EA) involves deciphering the consequences of those breakdowns. It does this by ensuring all failures can be detected, by determining how frequently a failure might occur and by identifying which potential failures should be prioritized. Business analysts typically use FMEA templates to assist them in the completion of analyses.

Used as a risk assessment tool, FMEA has a scoring scale of 1-10. A one is a sign of low risk, while a 10 is a sign of a very high risk.

For development and manufacturing organizations, FMEA is an effective method of lowering the possible failures in phases of the product lifecycle.

Types of FMEA analyses

There are three main types of failure mode and effects analysis.

Design FMEA (DFMEA). This focuses on how to prevent or mitigate possible system, product or process failures. DFMEA is used to determine potential failures, how bad the effect could be, and how to prevent and mitigate failures. This process helps engineers detect failures early on so they can be corrected without being costly.
Process FMEA (PFMEA). This focuses on identifying potential risks to process PFMEA helps identify process functions, failure modes and effects to help organizations understand possible risks for each process step as early as possible.
Functional FMEA (FFMEA). This focuses on avoiding possible failures before corrective actions must be taken. FFMEA identifies and prioritizes potential functional failure modes.

When to use FMEA

A business analyst might perform an FMEA when a product or service is being designed or fixed, or when an existing product or service is being used in a new way. FMEA can also be used before developing control plans for a new process or following a quality function deployment. Lean production methodology uses FMEA periodically throughout the lifecycle of a product or service. FMEA can also be used to identify and mitigate potential hardware risks as well.

FMEA is generally used in situations where improvement goals are implemented, or when designs, changes, new features, regulations or feedback is given -- as this is where potential failure and detection can occur.

Benefits of using FMEA

FMEA offers organizations the following benefits:

gives them an early way to identify and mitigate potential modes of failure;
minimizes the need to make late changes to a project due to potential issues;
reduces the risk of a problem happening more than once;
provides prompts for employees to follow when facing a potential failure mode;
promotes more collaboration among teams that handle areas such as design, manufacturing, quality, testing and sales; and
reduces the cost involved by avoiding fixing issues in development.

FMEA procedures — FMEA procedures may differ depending on the organization, but these are eight general steps to follow while implementing FMEA.

FMEA procedure

Failure mode and effects analysis might be implemented differently, depending on the organization. As such, the number of steps involved may also differ by organization. As a general process, FMEA steps include the following:

Create a team of employees who have collective knowledge or experience with the system, design or process and customer needs. This includes employees with experience in customer service, design, maintenance, manufacturing, quality, reliability, testing and sales.
Identify the scope of the system, design, process, product or service. Define the purpose of the system process, service and design.
Break down a system, design or process into its different components.
Go through system, design or process elements to determine each possible issue or single point of failure.
Analyze the potential causes of those failures as well as the effects the failures would have.
Rank each potential failure effect based on decided criteria such as severity, likelihood of occurrence and probability of being detected. Organizations can use a risk priority number to score a system, design or process for risk potential.
Determine how to detect, minimize, mitigate and solve the most critical failures. This helps keep failure effect risks low by creating a list of potential failures and corrective actions to take.
Revise risk levels as needed.

Learn more about event-driven failures, including some different types of failures and some potential ways to handle event-driven architecture failures.

Continue Reading About FMEA (Failure Mode and Effects Analysis)

10 DevSecOps metrics that actually measure success

9 common risk management failures and how to avoid them

What to include in a network disaster recovery plan checklist

How does SAP PLM support the product validation process?

Make your pitch for chaos engineering practices

Dig Deeper on Risk management and governance

Search Cloud Computing

Real-world examples of cloud observability in action
Observability platforms are no longer just IT tools --they're strategic business enablers that directly affect revenue, customer ...
OpenTelemetry vs. Prometheus: Which should you choose?
Choosing the right observability tool has a big impact on growing and future-proofing your business. Discover how to make ...
Conquer 8 cloud observability challenges to maximize ROI
Cloud administrators and operations teams face all types of observability challenges. With the right practices in place, you can ...

Search Mobile Computing

Best mobile antivirus software for the enterprise
Antivirus protection is a built-in feature on most desktop computers, but what about mobile devices? Many smartphones need the ...
How to prevent and remove mobile spyware
Mobile devices can store a lot of data, from sensitive user information to work apps and files. Mobile spyware gives bad actors ...
What Android security threats should IT know about?
IT must understand the nature of the most recent Android security issues to protect users. Learn the current top threats and ...

Search Data Center

Composable architecture: Future-proofing AI expansion
Data center admins should adopt a composable architecture to improve resource utilization, reduce costs and enhance AI workload ...
ISO 14644 standards: Cleanroom guidelines for data centers
There are regulated requirements to maintain data center equipment and functionality. ISO 14644 cleanroom standards lay out ...
Increase data center energy efficiency with Linux
Linux kernel 6.13 introduces adaptive polling, which significantly reduces energy consumption and improves network performance in...

Sustainability
and ESG

10 top ESG reporting frameworks explained and compared
Here's an overview of 10 ESG reporting frameworks and standards that companies can use to file reports on their practices and ...
7 best practices for resource optimization
Effective resource optimization can help businesses navigate common challenges, like limited budgets and complex supply chains, ...
Inside Ford's CSRD reporting journey
Ford's sustainability executive details how the company completed its CSRD report as a Wave 1 reporter, establishing rigorous ...

Close