Intelligent Modeling of AA-UHPC: Active Learning-Enhanced Stacked ML for Sustainable Concrete Design

Significance 

Concrete remains the backbone of modern infrastructure, yet now facing growing environmental scrutiny. While ultra-high-performance concrete (UHPC) has opened up new frontiers in structural engineering, with its exceptional mechanical properties and durability, it’s impossible to ignore the environmental consequences that come bundled with its use. Much of UHPC’s strength is rooted in its high content of ordinary Portland cement (OPC), a material that accounts for a significant portion of global CO₂ emissions. One alternative that’s drawn increasing interest is alkali-activated concrete, especially alkali-activated UHPC (AA-UHPC), which substitutes OPC with industrial byproducts like fly ash and GGBS. These materials don’t just reduce the carbon footprint; they also perform surprisingly well when properly formulated. That said, designing with AA-UHPC isn’t as straightforward as swapping ingredients. Its behavior is highly sensitive to shifts in mix composition, from activator ratios to curing temperatures. Even experienced materials scientists often find it difficult to predict outcomes without extensive and often tedious lab work.

To this account, recent research paper published in Archives of Civil and Mechanical Engineering and led by Professor Doo-Yeol Yoo from the Department of Architecture and Architectural Engineering at Yonsei University together with Dr. Farzin Kazemi and Professor Robert Jankowski from Gdańsk University of Technology and Dr. Torkan Shafighfard from the Polish Academy of Sciences, researchers recognized that while there is a fair amount of experimental data on AA-UHPC, but still lack reliable tools to extract predictive conclusions from it. Rather than continuing down the path of labor-intensive trial and error, the team decided to explore whether machine learning—specifically a stacked model augmented with active learning—could offer a more efficient way forward. Their idea was to create a system that could identify the most informative data points, adapt its internal logic, and improve as it learned. The research team gathered data and compiled 284 distinct AA-UHPC mix designs from 26 peer-reviewed studies. These weren’t cherry-picked examples but a broad and intentionally messy collection—reflecting diverse material combinations, different curing strategies, and varied dosages of key ingredients like fly ash, GGBS, silica fume, and alkaline activators. What unified these data points was a common endpoint: compressive strength. But everything else—fiber aspect ratio, water-to-binder ratios, even curing temperature—varied widely. That variability was central to the study’s purpose. Moreover, instead of repeating these tests experimentally, the authors turned to machine learning to build a predictive tool and also to see if they could train a system that actually learned where its own uncertainties were. They used a stacked model architecture with active learning layered on top. That meant the model wasn’t just fed a static dataset. It was allowed to iteratively select the most “informative” samples—those that would best improve its predictions—and retrain itself over time. In materials science, where data is often scarce and expensive to generate, that approach provide a practical edge.

Of course, before any model could be trained, the data had to be cleaned. They used scaling and imputation to manage missing values and outliers—an unavoidable reality when pulling from multiple sources. Then, to interpret how the model made its decisions, they applied Shapley values. What came out wasn’t entirely surprising: flowability, NaOH content, curing duration, and water content had the strongest influence on strength predictions. Still, some subtleties emerged. For instance, flow had a clearly positive effect, likely tied to better compaction. But the role of NaOH was less straightforward—too much seemed to weaken the structure, possibly by disturbing the setting reactions. When tested, the model delivered. Their best configuration—AL-Stacked ML-3—reached an accuracy close to 99%. That’s impressive on its own, but what really stood out was the external validation. On a completely independent dataset, one it hadn’t seen before, the model performed with the same confidence. That consistency suggests it’s not overfitting noise but capturing real, generalizable patterns. For researchers working on sustainable concrete design, this kind of tool could meaningfully cut down the trial-and-error cycle—and that’s no small win.

What stands out most in the research work of Professor Doo-Yeol Yoo and colleagues, at least from a research perspective, is its potential to change how we approach the design of advanced concretes—especially those in the alkali-activated category. The field has, for some time, acknowledged the benefits of AA-UHPC in terms of performance and sustainability. Yet, its complexity has been a persistent barrier. You can’t just swap in a few ingredients and expect reliable results. The material is sensitive, and even small changes in mix proportions or curing conditions can shift its behavior dramatically. That unpredictability makes it tough to scale. What this study offers is a practical workaround—not by simplifying the material, but by tackling the design process itself. Rather than relying entirely on traditional experimental methods, which are expensive and slow, the researchers turned to machine learning, more specifically a stacked model guided by active learning. In doing so, they developed a system that doesn’t just predict compressive strength; it learns which variables matter most and adapts as more data becomes available. That’s a big step toward computationally assisted mix design. In materials labs, it’s easy to underestimate how much time is spent on trial-and-error—preparing batches, waiting on curing cycles, testing, and then tweaking based on partial intuition. And yet, even after all that effort, the outcome isn’t always conclusive. What the team demonstrates here is that if we mine the data we already have with enough precision and context, we can reduce that guesswork significantly.  Additionally, there are wider implications, too, especially when it comes to sustainability. AA-UHPC has enormous potential to reduce emissions, but its adoption has been mostly limited to research labs or high-end projects. By creating an accessible predictive tool, this work effectively lowers the entry threshold for engineers who may not have the resources to run dozens of test mixes. That kind of accessibility could accelerate the shift toward greener infrastructure, particularly in regions where cost and material availability are constraints. Lastly, the study does a commendable job addressing one of the main critiques of ML in engineering: interpretability. Through Shapley values and visualization tools, it becomes possible to not only see the outcome but understand why a particular prediction was made. In safety-critical fields, that kind of clarity isn’t just useful—it’s essential.

About the author

Doo-Yeol Yoo is a Professor of Architecture and Architectural Engineering at Yonsei University in Seoul, Korea. He earned his B.S. and Ph.D. degrees in the Department of Civil, Environmental, and Architectural Engineering from Korea University in Seoul, South Korea. He also served as a post-doctoral researcher at the University of British Columbia (UBC), Vancouver, BC, Canada. His research interests include the development of cementless eco-friendly ultra-high-performance concrete, performance enhancement through novel fiber developments, and the achievement of ultra-high ductility in cement-based composites with a strain capacity exceeding 10%.

His total number of citations is 14,742 and h-index is 66 (Scopus). Over the past decade, he has published over 275 peer-reviewed international journal papers and 1 book chapter and holds 8 domestic patents in the fields of construction materials and structures. He holds 9 highly cited papers (WoS) and his 13 published papers in journals of Elsevier have been selected as one to the topmost cited and downloaded papers. He serves on the editorial board of six international journals, including Cement and Concrete Composites with an impact factor of 10.8. He is ranked #53 in the world and #1 in Korea in the field of Building & Construction in terms of c-score (Elsevier). He has received several prestigious awards: World’s Top 2% Scientists 2022~2024 (Stanford University-Elsevier); Member of Young Korean Academy of Science and Technology; The 25th Young Scientist Award (Ministry of Science and ICT, Korea); HYU Young Researcher Awards (Hanyang University); Best Paper Award, Int. J. Concr. Struct. Mater. (Springer); Ministry’s Commendation (Ministry of Education, Korea).

About the author

Robert Jankowski is a Professor of Civil Engineering at Gdansk University of Technology, Poland. He was a student of Gdansk University of Technology, Poland (MSc studies, 1987-1991 and 1992-1993), University of Sheffield, England (BSc studies, 1991-1992), University of Roskilde, Denmark (MSc course, 1993) and University of Tokyo, Japan (PhD studies, 1994-1997). His research interests are mainly related to earthquake engineering, dynamics of metal structures and artificial intelligence in civil engineering.

His total number of citations is 5,058 and h-index is 43 (Scopus). He is the author/co-author of over 350 scientific publications and holds 3 patents in the field of civil engineering. He holds 6 highly cited papers (WoS). He serves on the editorial board of a number of international journals. He has received several prestigious awards: World’s Top 2% Scientists 2022~2024 (Stanford University-Elsevier); Award of the Ministry of Environmental Protection, Natural Resources and Forestry of Poland (1994); Research scholarship of the Foundation for Polish Science (1999); Award of the Department of Technical Sciences of the Polish Academy of Sciences (2008); Prize of the Warsaw Branch of the Polish Society of Theoretical and Applied Mechanics (2015).

About the author

Farzin Kazemi studied his Ph.D. degrees in Department of Building Engineering, Faculty of Civil and Environmental Engineering, Gdansk University of Technology, Gdansk, Poland. He also served as visiting researcher at the University College London (UCL), London, UK, the University of Naples Federico II (UNINA), Naples, Italy, and the National Technical University of Athens (NTUA), Athens, Greece. His research interests include seismic retrofitting of steel and reinforced concrete (RC) structures, seismic performance and failure probability assessment, machine learning methods, data scientist, and developing novel predictive tools for estimating different engineering demands. His total number of citations is 1439 and h-index is 22 (Scopus). Over the past decade, he has published 64 peer-reviewed international journal papers and holds 9 highly cited papers (WoS).

About the author

Torkan Shafighfard is a Research, insight, and data analyst in science system investment performance branch of ministry of business, innovation, and employment in Wellington, New Zealand.

He earned his B.Sc. in Mechanical Engineering from University of Tabriz, Iran following with M.Sc. in manufacturing engineering from Sabanci University, Istanbul, Turkey and Ph.D. in Mechanical/Material Engineering from Polish academy of sciences, Gdansk, Poland.
He also served as a researcher at Kordsa composite technologies center of excellence in Istanbul. His research interests include the development of machine learning models, engineering applications of artificial intelligence, data analysis, finite element analysis, composite structures and additive manufacturing. He has published 18 peer-reviewed international journal papers and 1 book chapter. Meanwhile he serves as a reviewer in various prestigious journals.

Reference

Kazemi, F., Shafighfard, T., Jankowski, R. et al. Active learning on stacked machine learning techniques for predicting compressive strength of alkali-activated ultra-high-performance concreteArch. Civ. Mech. Eng. 25, 24 (2025). https://doi.org/10.1007/s43452-024-01067-5

Go to Arch. Civ. Mech. Eng.

Check Also

Dual Adaptive UKF-Based Model Updating for Hybrid Seismic Testing

Significance  Reference Yutong Jiang, Guoshan Xu, Jiedun Hao, Model updating hybrid testing method based on …