2016 journal article
Perfcompass: Online performance anomaly fault localization and inference in infrastructure-as-a-service clouds
IEEE Transactions on Parallel and Distributed Systems, 27(6), 1742–1755.
2016 conference paper
RDE: Replay DEbugging for Diagnosing Production Site Failures
Proceedings of 2016 ieee 35th symposium on reliable distributed systems (srds), 327–336.
2012 conference paper
PREPARE: Predictive performance anomaly prevention for virtualized cloud systems
2012 ieee 32nd international conference on distributed computing systems (icdcs), 285–294.