How to find cyber-risk data sources for a FAIR analysis
Cyber-risk quantification with FAIR can change the game for CISOs -- but sourcing enough accurate data for analysis can feel impossible. Learn how and where to find it.
The easiest and fastest -- but also least reliably accurate -- way to assess relative cyber-risk is qualitatively. A qualitative analysis uses subjective data, such as a rating of excellent, good, fair or poor; a rating from 1 to 5, where 1 is excellent and 5 is poor; or a rating of blue, green, yellow, orange or red, where blue is excellent and red is poor.
Quantitative risk analysis is more challenging but also generally more substantive and useful than qualitative analysis. Cyber-risk quantification (CRQ) requires data that reflects reality as closely as possible and is objectively accurate, if not precise. For example, if the precise but unknown value is 63%, a range -- say, between 60% and 70% -- is imprecise yet accurate.
The Factor Analysis of Information Risk (FAIR) model is a widely respected, mathematically based open standard for CRQ that enables CISOs to translate cyber-risk into financial risk. One of the biggest challenges of using the FAIR model, however, is that its analytical output is only as good as its data inputs -- and finding accurate data to feed the model is not always easy or intuitive.
Don't aim for certainty -- aim for less uncertainty
According to the FAIR Institute, most FAIR analyses start with incomplete and imperfect data, which CISOs should not view as a barrier to success. Even without much or any empirical data, CRQ results can still be highly credible, useful and defensible -- if practitioners transparently and consistently document their sources, assumptions, estimations and confidence levels.
The organization also notes that the goal of CRQ is not to predict the future with certainty, but "to reduce uncertainty to a level that supports informed decision-making." With that in mind, informed, calibrated estimates -- based on structured interviews with internal or external subject matter experts (SMEs), for example -- can be as useful as empirical data.
In identifying data for a FAIR analysis, the goal is often to arrive at a reasonable range rather than a single data point. "There is literally nothing we will likely ever need to measure where our only bounds are negative infinity to positive infinity," CRQ expert Douglas Hubbard wrote in his book How to Measure Anything: Finding the Value of "Intangibles" in Business.
There is literally nothing we will likely ever need to measure where our only bounds are negative infinity to positive infinity.
Douglas HubbardOwner, Hubbard Decision Research
In a FAIR Institute blog post, Jack Jones, creator of the FAIR methodology, offered the following tips for estimating an accurate range:
Start with an absurd estimate -- e.g, the person is likely taller than an inch and shorter than 10 feet.
Use references and logical reasoning to continually narrow the range.
Challenge your team's reasoning throughout the calibration process.
Remember that the goal is accuracy, not precision.
Where to find data for a FAIR analysis
Every risk calculation depends on the following fundamental pieces of data:
The likelihood of an event occurring. The FAIR model uses the term loss event frequency.
The severity or impact of the event if it does occur. The FAIR model uses the term loss event magnitude.
How the FAIR model works
While FAIR is conceptually straightforward, it is complex in practice. Practitioners can learn how to use the model themselves through the FAIR Institute's free training resources. Alternatively, they can partner with a vendor that offers FAIR CRQ capabilities, such as Safe, Black Kite or CyberSaint.
Whether in a DIY Excel spreadsheet or a third-party SaaS platform, every FAIR analysis includes the following basic steps:
1. Identify a risk scenario. Establishes the relevant asset -- e.g., servers -- and threat -- e.g., malicious hackers. A risk scenario might also include attack vectors -- e.g., malware -- and possible outcomes -- e.g., system outage.
2. Calculate loss event frequency. Based on the likelihood of an event occurring during a given period and vulnerabilities that increase the likelihood.
3. Evaluate loss event magnitude. Establishes likely financial losses -- primary and secondary -- if a given event occurs.
4.Calculate financial risk. Multiplies loss event frequency by loss event magnitude to calculate overall risk to the business in dollars.
Where to find data for loss event frequency
Loss event frequency represents the number of times a disruptive operational event is likely to occur in a designated timeframe, typically a year.
Practitioners can either estimate loss event frequency using empirical data or derive it by multiplying the following factors:
Threat event frequency. The statistical likelihood of an event. For example, the odds of a home in a particular ZIP code being robbed, based on recent crime data.
Susceptibility. Vulnerabilities that increase the event's likelihood. For example, how often residents of the home leave doors unlocked.
The FAIR Institute suggests practitioners use the following data sources to inform loss event frequency, as well as its contributing factors, threat event frequency and susceptibility.
Data sources for loss event frequency:
Internal data sources:
Incident response (IR) logs from past security events.
Security operations center logs detailing successful exploits.
Historical loss event logs from risk registers or ticketing systems.
External data sources:
Industry-specific information sharing and analysis centers (ISACs).
Loss event magnitude reflects the operational and financial effects of a given event. It might factor in both direct or primary losses, such as ransomware payments and lost productivity, and indirect or secondary losses, such as regulatory fines and reputational damage.
The loss event magnitude value should be computed in financial terms -- e.g., lost revenue.
The FAIR Institute suggests practitioners use the following data sources to inform loss event magnitude.
Data sources for loss event magnitude:
Internal data sources:
Financial and accounting records related to past security incidents.
Securities and Exchange Commission (SEC) disclosures.
Crisis reports.
Regulatory disclosures and enforcement databases -- e.g., General Data Protection Regulation and the SEC.
Public breach databases.
Breach follow-on reports from Cyentia, Deloitte and legal analysis firms.
Industry loss studies from Ponemon, Cyentia and Forrester.
Publicly disclosed fines or class-action settlements.
Market research on brand impact and consumer trust.
SME interviews with PR, crisis management, law and insurance firms.
Alissa Irei is senior site editor of Informa TechTarget Security.
Paul Kirvan, FBCI, CISA, is an independent consultant and technical writer with more than 35 years of experience in business continuity, disaster recovery, resilience, cybersecurity, GRC, telecom and technical writing.