Works (5)

Updated: July 5th, 2023 15:34

2021 article

Systemic Assessment of Node Failures in HPC Production Platforms

Das, A., Mueller, F., & Rountree, B. (2021, May 1).

By: A. Das n, F. Mueller n & B. Rountree*

author keywords: Root Cause; Node Failures; Holistic Analysis
topics (OpenAlex): Software System Performance and Reliability; Cloud Computing and Resource Management; Distributed systems and fault tolerance
TL;DR: It is shown that external environmental influence is not strongly correlated with node failures in terms of the root cause, and lead time enhancements are feasible for nodes showing fail slow characteristics. (via Semantic Scholar)
UN Sustainable Development Goals Color Wheel
UN Sustainable Development Goal Categories
Source: Web Of Science
Added: October 4, 2021

2020 article

Aarohi: Making Real-Time Node Failure Prediction Feasible

Das, A., Mueller, F., & Rountree, B. (2020, May 1).

By: A. Das n, F. Mueller n & B. Rountree*

author keywords: Online Prediction; HPC; Node Failures; Parsing
topics (OpenAlex): Software System Performance and Reliability; Network Security and Intrusion Detection; Software Reliability and Analysis Research
TL;DR: This work tackles online anomaly prediction in computing systems by exploiting context free grammar-based rapid event analysis and presents the framework Aarohi, which describes an effective way to predict failures online. (via Semantic Scholar)
Source: Web Of Science
Added: June 10, 2021

2018 article

Desh

Das, A., Mueller, F., Siegel, C., & Vishnu, A. (2018, June 11).

By: A. Das n, F. Mueller n, C. Siegel & A. Vishnu*

author keywords: LSTM; Failure Prediction; Log Mining; HPC; Node Failures; Lead Times; Anomaly Detection; Deep Learning
topics (OpenAlex): Software System Performance and Reliability; Software Reliability and Analysis Research; Anomaly Detection Techniques and Applications
TL;DR: This work aims to predict node failures that occur in supercomputing systems via long short-term memory (LSTM) networks that exploit recurrent neural networks (RNNs), and identifies failure indicators with enhanced training and classification for generic applicability to logs from operating systems and software components without the need to modify them. (via Semantic Scholar)
Source: Web Of Science
Added: April 2, 2019

2018 article

KeyValueServe: Design and performance analysis of a multi‐tenant data grid as a cloud service

Das, A., Iyengar, A., & Mueller, F. (2018, January 17). Concurrency and Computation Practice and Experience.

By: A. Das n, A. Iyengar* & F. Mueller n

author keywords: cloud computing; data-grid; in-memory; key-value store; multi-tenancy; NoSQL; performance; quality of service
topics (OpenAlex): Cloud Computing and Resource Management; Caching and Content Delivery; Distributed and Parallel Computing Systems
TL;DR: This paper presents KeyValueServe, a low overhead cloud service with features aiding resource management that can efficiently provide services to tenants without degrading performance, and indicates that a Hazelcast cluster can get congested with multiple concurrent connections when processing client requests, resulting in poor performance. (via Semantic Scholar)
Source: Web Of Science
Added: August 6, 2018

2016 article

Performance Analysis of a Multi-tenant In-Memory Data Grid

Das, A., Mueller, F., Gu, X., & Iyengar, A. (2016, June 1).

By: A. Das n, F. Mueller n, X. Gu n & A. Iyengar

topics (OpenAlex): Cloud Computing and Resource Management; Distributed and Parallel Computing Systems; Caching and Content Delivery
TL;DR: This study suggests that processing increasing number of client requests spawning fewer number of threads help improve performance, and uncovers scenarios of performance degradation followed by optimized performance via end-point multiplexing. (via Semantic Scholar)
UN Sustainable Development Goals Color Wheel
UN Sustainable Development Goal Categories
9. Industry, Innovation and Infrastructure (OpenAlex)
Source: NC State University Libraries
Added: August 6, 2018

Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.

Certain data included herein are derived from the Web of Science© and InCites© (2026) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.