Definition

model card in machine learning

Stephen J. Bigelow

By

Stephen J. Bigelow, Senior Technology Editor

Published: Mar 25, 2024

What is a model card in machine learning?

A model card is a type of documentation that is created for, and provided with, machine learning models. A model card functions as a type of data sheet, similar in principle to the consumer safety labels, food nutritional labels, a material safety data sheet or product spec sheets.

There has been a dramatic rise in the development and adoption of machine learning (ML) and artificial intelligence (AI) during recent years. Further advances in generative AI employ large language models (LLMs) as a core component. However, the many models used in those platforms are increasingly complex and difficult to understand. Even model developers sometimes struggle to fully understand and describe the ways a given model behaves. This complexity has created serious questions about core business values such as transparency, ethics and accountability. Common questions include the following:

What is a particular ML model used for?
How does a particular ML model work or perform?
What training data was used to train the model (and what were the results)?

First proposed by Google in 2018, the model card is a means of documenting vital elements of a ML model so users -- including AI designers, business leaders and ML end users -- can readily understand the intended use cases, characteristics, behaviors, ethical considerations, and the biases and limitations of a particular ML model.

As late as 2024, there are no current legislative or regulatory requirements to produce or provide model card documentation with ML models. Similarly, there are no currently established standards in model card format or content. However, major ML developers have spearheaded the adoption of model card documentation as a way of demonstrating responsible AI development, and adopters can find model cards for major platforms such as Meta Llama, Google face detection and OpenAI GPT-3.

Benefits of using model cards for AI, ML and LLM projects

The rise of ML and AI is driving the need for transparency and responsible governance. Businesses must understand what ML models are for, how they work, how they compare to other competitive models, how they're trained and their suitability for intended tasks.

Model cards are a tool that can address such concerns which readily impact governance and regulatory issues for the business. Model cards can provide a range of important benefits to ML and AI projects including the following:

Model selection. By adopting a consistent standard for format and content, model cards can help ML and AI developers evaluate and select the most appropriate ML model for the project. Model cards can help whittle down an initial list of ML candidates, focus testing and evaluation on the most appropriate models, and accelerate final selection of the best ML model for the task at hand.
Model behavior and performance. ML models are not perfect, and limitations in the training data sets or biases in the training methodologies can impact the model's behavior when dealing with real-world data. Model cards typically include known limitations in data and training, allowing ML model users to see and address such limitations through additional training and data. This can help the business reduce errors and biases in the ML model and improve AI project performance and outcomes.
Continued AI improvements. The details and information provided by ML model cards help businesses select the most appropriate models and allow ML model developers to compare their models to others to make platforms more capable and competitive. This leads to more innovation and improvements to ML models and the resulting ML and AI platforms that use those models.
Business governance. The old axiom "you can't manage what you can't see" is entirely appropriate for modern ML models. A business that employs opaque or incomprehensible ML models cannot understand or explain the behavior of their developing ML or AI project-- risking serious business continuance, business governance and regulatory compliance violations. ML model cards are a means for the business to demonstrate an understanding of the underlying ML models that they select and use.
Business transparency and ethics. ML and AI technologies have an impact on society -- both in the collected and used data and in the ways that data is processed to make decisions. The documentation and details provided in ML model cards can help a business manage responsible AI development and address social concerns regarding the organization's use of AI and underlying data.

7 key sections of a model card

Labels and other informational summaries are generally most effective when they allow comparing similar products side by side using comparable content and formats. However, the information presented on an ML model card can vary. Unlike highly regulated informational displays -- such as food nutritional labeling -- there are no current standards to govern the information or formatting included on ML model cards.

ML models can vary dramatically in their scope, purpose and capabilities, so this makes it hard to regulate. For example, an ML model developed to aid in medical diagnosis can be distinctly different than an ML model created to run analytics on retail sales operations, or a complex LLM used in an AI construct. Consequently, ML model developers largely use their own discretion to determine what information to include, and how that information should be presented. Yet, as leading technology firms develop ML/AI platforms and document those offerings through model cards, some de facto documentation standards are taking shape. Model cards should include the following:

1. Basic details

This first section of a model card is typically the introduction to the model which can outline the model's essential details including the model's name, version, revision list, a brief general description of the model, business or developer details and contact information, and licensing details or limits.

2. Use case details

This section describes the intended uses, use cases and users for the model. For example, a section on use cases may describe uses in object detection, facial detection or medical diagnoses. This section may also include caveats, use limitations or uses deemed out of scope. For example, a model intended for object detection may detail input from photos or video; output including detection of a specified number of object classes; and other output data such as object bounding box coordinates, knowledge graph ID, object description and confidence score.

3. Architectural details

This section describes the overall design of the model and any underlying hardware back end that runs the model and hosts related data. Readers can refer to the model card to understand the design elements or underlying technologies that make the model work. For the object detection model example, the model card may describe an architecture including a single image detector model with a Resnet 101 backbone and a feature pyramid network feature map.

4. Training details

This section outlines, describes or summarizes the data used in model training; where and when the data was obtained; and any statistical distribution of key factors in the data which may allow for inadvertent bias. Since training data may be proprietary to the model's developers, training details may be deliberately limited or protected by a separate confidentiality agreement. Training details may also describe training methodologies employed with the model.

5. Performance details

This section outlines details related to the model's performance measured against a test data set, not a training data set, as well as details about the test data set itself. For the object detection model example, performance metrics included on the model card may note the use of both Google's internal image data set as well as an open source image set as test data and the number of object classes the model can detect in each data set. Additionally, performance details may outline reported metrics including the precision and accuracy of the object detection. More sophisticated models may utilize other detailed metrics to measure performance.

6. Limitation details

A key segment of any model card is the section describing limitations, possible biases or variable factors that might affect the model's performance or output. For the object detection model example, known limitations may include factors such as object size, clutter, lighting, blur, resolution and object type since the model can't recognize everything.

7. Business details

This final segment of a model card is often dedicated to business-related details including information about the model's developers, detailed contact, support and licensing information, fairness/privacy and usage information, suggestions for model monitoring, any relevant assessment of impacts to individuals or society, and other ethical or potential legal concerns related to the model's usage.

Examples of model cards

As leading technology organizations build ML and AI platforms, their work on model cards and other documentation has provided a standard for other ML firms to follow. Today there are many examples of ML model cards to review including the following major examples:

Google's face detection model card.
Meta's Llama model card.
OpenAI's GPT-3 model card.

There are also more standardized tools for model card creation, as well as model card repositories, such as these examples:

GitHub hosts a template for creating ML model cards.
Google offers a model card toolkit available through GitHub.
Hugging Face offers a tool to generate model cards.

Both GitHub and Hugging Face provide a repository of model cards which are available for review and study, offering model card examples across many different model types, purposes and industry segments.

Continue Reading About model card in machine learning

The best large language models

AI regulation: What businesses need to know

How do LLMs like ChatGPT work?

Generative AI ethics: The biggest concerns and risks

Compare large language models vs. generative AI

Search Networking

What is fiber to the home (FTTH)?
Fiber to the home (FTTH) is the installation and use of optical fiber from a central point to individual buildings to provide ...
What is an SDN controller (software-defined networking controller)?
A software-defined networking controller is an application in SDN architecture that manages Flow control for improved network ...
What is a network service provider (NSP)?
A network service provider (NSP), also known as a backbone provider, is a company that owns, operates and sells access to ...

Search Security

What is integrated risk management (IRM)?
Integrated risk management (IRM) is a set of proactive, businesswide practices that contribute to an organization's security, ...
What is COMSEC (communications security)?
Communications security (COMSEC) is the prevention of unauthorized access to telecommunications traffic or to any written ...
What is the Mitre ATT&CK framework?
The Mitre ATT&CK -- pronounced miter attack -- framework is a free, globally accessible knowledge base that describes the latest ...

Search CIO

What is the three lines model and what is its purpose?
The three lines model is a risk management approach to help organizations identify and manage risks effectively by creating three...
What is enterprise risk management (ERM)?
Enterprise risk management (ERM) is the process of planning, organizing, directing and controlling the activities of an ...
What is a procurement plan?
A procurement plan -- also called a procurement management plan -- is a document that is used to manage the process of finding ...

Search HRSoftware

What is a talent pool?
A talent pool is a database of job candidates who have the potential to meet an organization's immediate and long-term needs.
What is a 360 review?
A 360 review, or 360-degree review, is a continuous performance management strategy aimed at helping employees at all levels ...
What is a talent pipeline?
A talent pipeline is a pool of candidates who are ready to fill a position.

Search Customer Experience

What is direct marketing?
Direct marketing is a type of advertising campaign that seeks to elicit an action (such as an order, a visit to a store or ...
What is mobile CRM?
Mobile CRM, or mobile customer relationship management, enables those working in the field or remote employees to use mobile ...
What is field service management (FSM)?
Field service management (FSM) is a system of managing off-site workers and the resources they require to do their jobs ...

Close