Learning to Hear Anomalies: A Frequency-Driven Framework for Explainable Machine Fault Detection

Significance 

In the landscape of industrial machinery monitoring, anomaly detection serves as the crucial first line of defense against mechanical failure. As systems become increasingly complex and data-rich, traditional condition monitoring strategies—typically reliant on handcrafted features or static thresholds—have struggled to keep pace. These conventional methods often falter when faced with high-dimensional signals and nonlinear behaviors. To address these limitations, researchers have turned toward deep learning models, particularly autoencoders (AEs), for their ability to learn representations directly from raw data without the need for labeled anomalies. Yet, as effective as they are in capturing signal patterns, these models carry a major drawback: they operate largely as black boxes. Their internal workings, especially the latent transformations and learned features, remain opaque to users, making it difficult to validate or interpret the basis of an anomaly score—an issue that is increasingly problematic in high-stakes engineering environments. Recognizing this tension between performance and interpretability, the authors of this study were motivated to rethink how deep learning could be merged with a more physically grounded framework. They noted a structural parallel between autoencoders and the discrete wavelet transform (DWT): both perform decomposition and reconstruction of signals. But unlike AEs, wavelet transforms come with the benefit of mathematical transparency. Every operation—filtering, downsampling, upsampling—is well understood, with a clear frequency-domain interpretation. This insight prompted a question: could the frequency-domain interpretability of wavelets be combined with the learning capacity of neural networks to develop a more explainable approach to anomaly detection?

However, the integration is not straightforward. Wavelets inherently aim for perfect signal reconstruction, which runs counter to the core idea behind AE-based anomaly detection—namely, that a model trained only on normal signals should fail to reconstruct abnormal ones. To resolve this contradiction, a new research paper published in Mechanical Systems and Signal Processing—conducted by PhD candidate Zuogang Shang, Associate Professor Zhibin Zhao, Professor Ruqiang Yan, and Professor Xuefeng Chen from the Xi’an Jiaotong University—proposed a novel deep architecture grounded in wavelet theory but intentionally designed to sacrifice universal reconstruction in favor of class-specific fidelity. The goal was to create a model that reconstructs normal signals well, while systematically discarding frequency components irrelevant—or unique—to abnormal conditions.

The researchers tested against conventional 2-band wavelet transforms, the outcome was consistent with expectations: the traditional method struggled to produce a refined representation, with frequency components smearing across scales and lacking temporal precision. In contrast, M-band wavelet network (MWNet)—even without thresholding enabled—generated a sharper and more coherent time-frequency decomposition. The team also reinforced this observation through a denoising benchmark, where MWNet yielded lower mean squared error than a fixed 3-band wavelet setup, highlighting its data-driven adaptability and improved quantitative performance. The study then advanced to test MWNet’s central claim: its capacity to distinguish normal from abnormal signals based purely on reconstruction error. To do this, they designed two illustrative cases using synthetic data. The first involved what they called an “in-band” anomaly, where both normal and abnormal signals resided within the same frequency band but had different detailed content. Initially, without any sparsity constraint, MWNet treated both inputs similarly—reconstructing each with comparable accuracy, and thus failing to differentiate them. But when sparsity regularization was introduced, the model began prioritizing frequency components that were truly characteristic of normal signals. This selective focus caused the reconstruction of the abnormal input to degrade which confirmed that sparsity played a key role in refining MWNet’s sensitivity to signal deviations that would otherwise be masked. The second scenario explored “inter-band” anomalies—cases where the normal and abnormal signals occupied entirely different parts of the frequency spectrum. This is where the learnable thresholding mechanism came into play. Without it, MWNet reconstructed both signals reasonably well, offering little insight into which was anomalous. Once the threshold learning and its associated maximization constraint were activated, however, MWNet began selectively discarding frequency bands not essential to reconstructing normal signals. This change had a dramatic effect: abnormal signals relying on these suppressed bands were poorly reconstructed, resulting in high error. This experiment was more than a technical win—it crystallized the idea that anomaly detection doesn’t just hinge on what is learned, but on what is intentionally forgotten.

Afterward, the researchers tested MWNet on real-world data. One of the key datasets came from a fuel control system, where high-frequency pressure signals were gathered under varying degrees of wear. MWNet consistently outperformed both traditional and state-of-the-art models, including AE-MLP, AE-CNN, and DeSpaWN. Its advantage was particularly evident under mild wear conditions, where other models often fail to detect subtle changes. Here, MWNet’s frequency-based design allowed it to remain sensitive to minute deviations while discarding irrelevant spectral content. What made this result especially compelling was the interpretability of the filters themselves—many of which evolved into forms closely resembling classical low-pass or band-pass filters, reinforcing the sense that the network was learning in ways aligned with signal processing theory.

In another test using the planetary gear dataset, MWNet handled more complex fault types, including cracks, broken teeth, and surface spalling. Analysis of the model’s behavior showed that it consistently learned to amplify abnormal spectral regions while suppressing irrelevant ones. The learned thresholds reflected this dynamic, increasing in bands unique to fault signatures. As a result, the reconstruction of faulty signals deteriorated, creating clear anomalies in the error distribution and enhancing detection robustness across multiple fault modes. Lastly, the authors evaluated MWNet’s performance in a prognostic setting using the IMS bearing dataset, which simulates a real-life degradation process over time. Here, MWNet didn’t just detect the onset of failure—it mapped out the progression. Rather than producing erratic alarms, the model yielded a smooth, interpretable increase in reconstruction error, effectively mirroring the bearing’s physical deterioration. Notably, it identified early degradation well before any visible failure occurred.

The significance of the research work of Professors Ruqiang Yan and Xuefeng Chen and their colleagues in its thoughtful reimagining of how machines can be taught to recognize what’s normal—and, more importantly, what isn’t—by listening not just to the signals they produce, but to the underlying frequencies that define them. At a time when deep learning is often criticized for being opaque and difficult to trust in critical engineering applications, this work offers an alternative path—one that doesn’t abandon performance for interpretability, but weaves both together through a principled, wavelet-based framework.

What makes this contribution especially compelling is its ability to offer clarity in a field that has long leaned on black-box models. The MWNet does more than flag anomalies with high accuracy—it gives insight into why a signal was flagged. It achieves this not by inventing new forms of abstraction, but by grounding itself in the frequency domain, where engineers already have intuition and decades of domain knowledge. This alignment between model behavior and human understanding is rare and valuable. It allows practitioners to trace back reconstruction errors to specific frequency bands and thresholds, revealing whether an anomaly was due to a missing component, a foreign signature, or a shift in the signal’s energy distribution. Beyond its explanatory power, the study also introduces a flexible architecture that adapts to various forms of signal irregularities—whether subtle, in-band distortions or clear spectral deviations. This makes MWNet applicable across a wide range of real-world scenarios, from early-stage bearing faults to complex wear patterns in fuel injection systems. The fact that it generalizes well, across both synthetic and experimental datasets, reinforces its utility not just as a research prototype but as a tool with potential for deployment in production-grade monitoring systems. Another key implication is the introduction of learnable hard thresholding as a trainable mechanism—not only as a technical solution, but as a conceptual contribution. Thresholding, long used in signal denoising, is reinterpreted here as a dynamic, data-driven decision gate that determines which parts of the signal matter for defining “normal.” This rethinking could inspire future work across domains, from speech processing to medical diagnostics, where signal fidelity and anomaly sensitivity must co-exist.

Learning to Hear Anomalies: A Frequency-Driven Framework for Explainable Machine Fault Detection - Advances in Engineering

 

About the author

Zuogang Shang received the B.S. degree in mechanical engineering from Xi’an Jiaotong University, Xi’an, China, in 2020. He is currently working toward the Ph.D degree mechanical engineering in the Department of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China.

His current research is focused on signal processing, explainable deep learning, and machinery health monitoring. 

webpage linkshttps://github.com/albertszg/WaveletEnabledDeepLearning

About the author

Zhibin Zhao received the B.S. and the Ph.D. degrees in mechanical engineering from Xi’an Jiaotong University, Xi’an, China, in 2015 and 2020, respectively.

He is currently a Lecturer in mechanical engineering with the Department of Mechanical Engineering, Xi’an Jiaotong University.

His research interests include sparse signal processing and machine learning algorithms for machinery health monitoring and healthcare.

webpage linkshttps://gr.xjtu.edu.cn/en/web/zhaozhibin0124

About the author

Ruqiang Yan received the Ph.D. degree in mechanical engineering from the University of Massachusetts at Amherst, MA, USA, in 2007. He is currently a Professor of mechanical engineering with Xi’an Jiaotong University.

His research interests include data analytics, machine learning, and energy-efficient sensing and health diagnosis of large-scale, complex, dynamical systems.

Dr. Yan is a Fellow of ASME (2019) and IEEE (2022). His honors and awards include the IEEE Instrumentation and Measurement Society Technical Award in 2019 and Outstanding Service Award in 2022, and multiple best paper awards, such as Andrew P. Sage Best Transactions Paper Award. Dr. Yan serves as an Associate Editor-in-Chief of Chinese Journal of Mechanical Engineering. He was also the Editor-in-Chief of the IEEE Transactions on Instrumentation and Measurement (2022-2024).

webpage linkshttps://gr.xjtu.edu.cn/en/web/yanruqiang

About the author

Xuefeng Chen received the Ph.D. degree from Xi’an Jiaotong University, Xi’an, China, in 2004. He is currently a Professor of Mechanical Engineering with Xi’an Jiaotong University.

His current research interests include finite-element method, mechanical system and signal processing, diagnosis and prognosis for complicated industrial systems, smart structures, aero-engine fault diagnosis, and wind turbine system monitoring.

Dr. Chen was a recipient of the National Excellent Doctoral Dissertation of China in 2007, the National Science Fund for Distinguished Young Scholars in 2012, and a Chief Scientist of the National Key Basic Research Program of China (973 Program) in 2015, and the Second Award of Technology Invention of China in 2009, 2018 and 2023, respectively.

webpage linkshttps://gr.xjtu.edu.cn/en/web/chenxf

Reference

Zuogang Shang, Zhibin Zhao, Ruqiang Yan, Xuefeng Chen, M-band wavelet network for machine anomaly detection from a frequency perspective, Mechanical Systems and Signal Processing, Volume 216, 2024, 111489,

Go to Mechanical Systems and Signal Processing

Check Also

Redefining Strength and Lightness: Carbon Nanolattices Optimized Through Bayesian Design - Advances in Engineering

Redefining Strength and Lightness: Carbon Nanolattices Optimized Through Bayesian Design