After reading our post, Unemployment as a Commercial Loan Loss Predictor, someone asked, “…what would be a better leading indicator than unemployment rate in your opinion?”

Bottom Line
National unemployment can be a blunt instrument. Industry and geographic segmentation often enables simpler, more intuitive regressors that better explain and forecast C&I losses.

Why Banks Default to Unemployment

At a national level, there are economic variables that lead bank C&I credit losses—they’re available for free at FRED[1]—and we won’t completely discard unemployment, just yet, but the more interesting question is why many banks use the unemployment rate in the first place.

From our experience, banks that use national unemployment as a C&I loss regressor tend also to be firms that don’t segment or partition industries, and, therefore, use it as a one-size-fits-all regressor.[2]

While there are a variety of reasons that developers might be attracted to unemployment, we think it’s frequently related to model choice. For example, the data requirements of using the transition matrix method[3] are tough. There need to be sufficient observations to fill cells with meaningful and reliable transition rates, and that makes it nearly impossible for all but a handful of giant banks to treat industries or segments separately.

A Better Approach: Strategic Segmentation

However, if other regression methods are used, it’s possible to analyze partitions or grouping of industries—either by industry or geography or both—and then it’s possible to use more intuitive variables that are related to those segments. There is no theory, per se, but folks who have been around and have learned from their expertise can easily provide parameters to investigate when the industries are grouped logically. Sometimes they work, sometimes they don’t. An obvious, and usually correct example, is oil prices and the energy industry defaults.[4]

For both types of credit loss forecasts—CECL and stress testing[5]—we would encourage banks to try to use segments similar to how they run the business and manage risks, including C&I portfolio risks and industry or geographic concentrations and limits.

There are instances where it’s not feasible to be at that level of detail, but the goal is to partition industries/geographies into groups that are homogeneous within and heterogeneous across those groups. Broadly put, when there’s an intuitive reason to combine industries, and those industries behave similarly with respect to the explanatory variables, the segment makes sense. Likewise, when other industries behave differently to those same variables, then they should be in other segments. While we write, “industries,” we really mean the industries’ default rates, LGDs or other borrower or loan characteristics that need to be investigated and modeled.

Why bother? Because such segmentation allows for much more meaningful analysis and discussion at ALLL/CECL committee meetings, as well as much more meaningful and useful sensitivity analysis. This type of development work—by either confirming or denying suspicions of relationships among variables—provides valuable information about the loan portfolio. It’s basic research of the bank’s C&I portfolio.

Testing Multiple Approaches

That being said, it’s likely that developers will need to test many different partitions of industries. For example, we once tested 19 different ways to partition a C&I portfolio with 21 different “industries” or collections of NAICS[6] (as regularly reported by its portfolio management group) into homogeneous groups, each with its own regressors and coefficients.

Of the 19 tried, one was selected as best, but we recommended that the bank continue to run the other 18 schemes as a way to show the robustness (or not) of the chosen one. All modeling requires arbitrary choices, and by running all these different schemes through time, we were able to show that anticipated losses didn’t vary more than 10%; so, even if an executive didn’t like our recommendation of the “best” one, he or she could feel comfortable that his or her preferred scheme would have generated similar results. Of course, you can only do that IF you have knowledgeable developers, and good model governance, including well-documented developmental evidence.

Our Recommendation

All else equal, we discourage our clients from using the national unemployment rate for C&I loss forecasting, and with partitioning, its use is often unnecessary—except possibly for that last segment that always exists: “everything else.” There are frequently better, more-specific regressors that are simpler to explain and understand and more relevant to the segment, especially for a regional bank’s footprint.[7]

In that regard, we frequently ask the rhetorical question, “if you can’t forecast your losses, how can you possibly manage your portfolio?” If all you can say is that you estimate losses through the unemployment rate, then you haven’t told me much, and based on the longer time series graph in the earlier post, you’re probably late, too.


Notes

  1. FRED (Federal Reserve Economic Data) is the Federal Reserve Bank of St. Louis’s database of economic data, providing free access to hundreds of thousands of economic time series from various sources. ↑ Back to text
  2. Before proceeding, please note that we’re generalizing. We know it, but we’d rather mention it once rather than qualify each of our statements below. ↑ Back to text
  3. Transition matrix methods track the migration of loans between risk ratings over time, requiring substantial historical data to populate each cell with statistically meaningful transition probabilities. ↑ Back to text
  4. The relationship between oil prices and energy sector defaults became particularly evident during the 2014-2016 oil price collapse, when energy-focused banks saw significant increases in charge-offs. ↑ Back to text
  5. CECL (Current Expected Credit Losses) is the FASB accounting standard requiring banks to estimate lifetime expected losses at origination, while stress testing evaluates portfolio performance under adverse scenarios. ↑ Back to text
  6. NAICS (North American Industry Classification System) codes provide standardized industry classifications used by statistical agencies and banks for portfolio segmentation and reporting purposes. ↑ Back to text
  7. Regional banks often have geographic concentration that enables more targeted economic variables than national unemployment—such as local employment rates, regional commodity prices, or metro area economic indicators. ↑ Back to text
  8. The CARES Act (Coronavirus Aid, Relief, and Economic Security Act) provided unprecedented government support including PPP loans, forbearance programs, and other measures that disrupted traditional loss patterns. ↑ Back to text