Metalearning using structure-rich pipeline representations for improved AutoML Online publication date: Mon, 27-Feb-2023
by Brandon Schoenfeld; Kevin Seppi; Christophe Giraud-Carrier
International Journal of Data Analysis Techniques and Strategies (IJDATS), Vol. 14, No. 4, 2022
Abstract: Automatic machine learning (AutoML) systems have been shown to perform better when they learn from past experience. Examples include Auto-sklearn, which warm-starts the ML pipeline search using existing programs known to perform well on 'similar' tasks, and AlphaD3M, which uses online reinforcement learning to search the ML pipeline space. These metalearning approaches, as well as many others, depend on simplifying assumptions about the pipeline search space and/or the pipeline representation. Here, we attempt to extend the applicability of AutoML by relaxing such simplifications. Using a sizable metadataset of 194 classification tasks and 4,592 pipelines, we show that using pipeline metadata, including the underlying DAG structure, leads to better estimates of pipeline performance and to more robust rankings of pipelines.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Analysis Techniques and Strategies (IJDATS):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com