X
Definition

What is automated machine learning (AutoML)?

Automated machine learning (AutoML) is the automated process that applies machine learning models to real-world problems. AutoML automates the selection, composition and tuning of algorithms. It simplifies complex tasks, such as model selection, feature engineering and hyperparameter optimization. Automating the machine learning process makes it more user-friendly and provides faster, more accurate outputs than hand-coded algorithms.

AutoML software platforms make machine learning more user-friendly and give organizations without a specialized data scientist or machine learning expert access to machine learning. These platforms can be built in-house, or they can be acquired from a third-party vendor and accessed through open source repositories such as GitHub.

AutoML platforms and tools are available as open source software and commercial offerings. They can be deployed on premises or in the cloud. These tools let businesses create and implement predictive models using streamlined, automated workflows.

How does the automated machine learning work?

AutoML is typically a platform or open source library that simplifies each step in the machine learning process, from handling a raw data set to deploying a practical machine learning model. In traditional machine learning, models are developed by hand, and each step in the process is handled as a separate task.

AutoML automatically locates and uses the optimal type of machine learning algorithm for a given task. Two concepts help achieve this:

  • Neural architecture search. This automates the design of neural networks. It helps AutoML models discover new architectures for problems that require them.
  • Transfer learning. Pretrained models apply what they've learned to new data sets, applying existing architectures to new problems that require them.

Neural architecture search and transfer learning enhance AutoML's adaptability by creating new model architectures and applying pretrained ones to new data. Users with minimal machine learning and deep learning knowledge can interface with the models through a coding language such as Python.

Generally, AutoML works through the following steps:

  • Problem definition. The user defines the problem, such as classification, regression or clustering.
  • Data preparation. This includes data cleaning, formatting and splitting the project into training and testing sets.
  • Feature engineering and selection. AutoML extracts meaningful features from raw data and selects the most relevant ones.
  • Model selection. The platform identifies the best-suited machine learning algorithms for the task.
  • Hyperparameter tuning. Hyperparameters are adjusted for better performance, optimizing the model.
  • Model training and evaluation. The model is trained, validated and assessed using selected performance metrics.
  • Model deployment. The final model is deployed, often using application programming interfaces or embedding it into applications.
  • Monitoring and maintenance. AutoML tools can track model performance and trigger retraining as needed.
Five steps in the machine learning process
Five AutoML steps speed the machine learning process.

AutoML use cases

AutoML use cases are similar to those of traditional machine learning. They span the finance, healthcare, agriculture, cybersecurity and marketing industries. Some of the most important use cases are the following:

  • Classification. This is where AutoML is used for image recognition and sentiment analysis.
  • Regression. AutoML is applied to pricing models, weather prediction and inventory forecasting.
  • Clustering. This approach is used in customer segmentation and market research.
  • Time-series forecasting. AutoML is used for stock prediction and sales forecasting.
  • Natural language processing. Chatbots, summarization and translation applications use AutoML.

Components of AutoML

The following are several important components of AutoML:

  • Data preprocessing. This automates handling of missing data values, encoding and normalization.
  • Feature engineering. This selects and transforms input variables into useful features.
  • Model selection. This chooses from a pool of candidate machine learning models.
  • Hyperparameter optimization. This tunes model parameters for optimal performance.
  • Ensembling. This combines multiple models for improved accuracy.
  • Deployment. This makes a model usable in production systems.
  • Monitoring. This tracks and evaluates ongoing model performance.

Benefits of AutoML

The main benefits of autoML are the following:

  • Efficiency. AutoML speeds up and simplifies the machine learning process and reduces the training time step when building machine learning models.
  • Cost savings. A faster, more efficient machine learning process means a company devotes less of its budget to maintaining that process.
  • Performance. AutoML algorithms tend to be more efficient than hand-coded models.
  • Visibility and accessibility. In some cases, AutoML makes it easier to track how an algorithm delivers its output, providing more visibility and accessibility than traditional machine learning.

AutoML challenges

There are several challenges to AutoML. The most important include the following:

  • Overreliance. The main challenge of AutoML is the temptation to view it as a replacement for human knowledge and judgment. AutoML is designed to perform rote tasks efficiently, freeing employees for more complex and novel tasks. AutoML handles tasks such as monitoring, analysis and problem detection faster than if a human did them. However, people still must assess and supervise the model.
  • Novelty. AutoML is a new field, and some of the most popular tools aren't yet fully developed. They might not support all algorithms or advanced learning techniques either.
  • Limited customization. Advanced users may find customization constrained by predefined templates.
  • Lack of transparency. Some platforms function as black boxes, offering limited interpretability.

AutoML tools and platforms

Tools such as Auto-sklearn and AutoKeras are open source. They can be run on local infrastructure, and users avoid the costs of proprietary cloud services. They rely on known architectures and data they've already seen, and they support classification and regression techniques, among other tasks.

Tools such as Google Cloud AutoML and Microsoft Azure Machine Learning are proprietary cloud platforms that offer scale. They use recurrent neural networks, convolutional neural networks, long short-term memory and other machine learning models. However, they incur costs associated with cloud services.

The future of AutoML

AutoML is expected to evolve rapidly, with a focus on greater transparency, scalability and usability across industries. One key development is increased explainability, which will let data scientists and business users better understand how automated machine learning models make decisions. AutoML platforms will also expand their capabilities to handle unstructured data, including text, images and video. This will broaden their applicability in the healthcare, legal and media industries.

Edge deployment is another growing area, where lightweight AutoML models can be trained and run on edge devices for real-time analytics. AutoML is also being integrated into enterprise environments, providing end-to-end approaches that align with existing infrastructure and workflows. As large businesses increasingly adopt AI for predictive tasks, the role of AutoML as a bridge between data and decision-making is expected to continue growing in ways that support both expert users and those with minimal machine learning background.

Learn more about the top machine learning and AI trends.

Continue Reading About What is automated machine learning (AutoML)?

Dig Deeper on AI technologies