Fusing Log and Metric Streams Through Contrastive Representation Learning for System Anomaly Detection

Sébastien Laurent; Anja Bergström

doi:10.71465/ajml3565

Vol. 7 No. 1 (2026), Articles

Vol. 7 No. 1 (2026)

Fusing Log and Metric Streams Through Contrastive Representation Learning for System Anomaly Detection

Articles

Published 2026-01-31

Sébastien Laurent⁺⁻
Anja Bergström ⁺⁻

https://doi.org/10.71465/ajml3565

Sébastien Laurent

Department of Information Technology, Uppsala University, Sweden

Anja Bergström

Department of Information Technology, Uppsala University, Sweden

PDF

Keywords

System Anomaly Detection
Log Analysis
Performance Metrics
Contrastive Learning
Multimodal Fusion
Deep Learning

Abstract

Modern distributed systems generate massive volumes of heterogeneous monitoring data, primarily consisting of unstructured log messages and structured performance metrics. Traditional anomaly detection approaches analyze these data streams independently, failing to capture critical cross-modal correlations that indicate system failures. This paper proposes a novel multimodal fusion framework that leverages contrastive representation learning to unify log and metric analysis forcomprehensive system anomaly detection. Our approach employs dual encoders to extract semantic representations from log sequences and temporal patterns from metric time series, then aligns these representations in a shared embedding space through contrastive learning objectives. The framework learns to maximize agreement between temporally correlated log-metric pairs while distinguishing anomalous patterns from normal system behavior. Extensive experiments on three production datasets including HDFS, OpenStack, and real-world AIOps systems demonstrate that our method achieves F1-scores exceeding 96%, outperforming single-modality baselines by substantial margins ranging from 12% to 18%. The learned representations enable early anomaly detection with improved interpretability, providing operators with actionable insights for rapid incident response. Our framework processes monitoring data with an average latency of 180 milliseconds, making it suitable for real-time production deployments

PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.