craiyon logo

Distributed computing infrastructure illustration

Distributed computing infrastructure illustration

Distributed computing infrastructures have become the fundamental platforms for many production systems. However, due to their inherent complexity and sharing nature, those computing infrastructures are prone to various system anomalies such as performance degradation, software hang and unexpected system halt. In this talk, Helen will present a set of automatic system anomaly prediction and diagnosis techniques using unsupervised machine learning over system metrics/logs/traces. Her techniques Ver mais