Anwesha Das

Works (5)

Updated: July 5th, 2023 15:34

2021 article

Systemic Assessment of Node Failures in HPC Production Platforms

Das, A., Mueller, F., & Rountree, B. (2021, May 1).

By: A. Dasⁿ, F. Muellerⁿ & B. Rountree ^*

author keywords: Root Cause; Node Failures; Holistic Analysis

topics (OpenAlex): Software System Performance and Reliability; Cloud Computing and Resource Management; Distributed systems and fault tolerance

TL;DR: It is shown that external environmental influence is not strongly correlated with node failures in terms of the root cause, and lead time enhancements are feasible for nodes showing fail slow characteristics. (via Semantic Scholar)

10.1109/IPDPS49936.2021.00035

Find Text @ NCSU

UN Sustainable Development Goals Color Wheel

UN Sustainable Development Goal Categories

11. Sustainable Cities and Communities (Web of Science)

Source: Web Of Science

Added: October 4, 2021

2020 article

Aarohi: Making Real-Time Node Failure Prediction Feasible

Das, A., Mueller, F., & Rountree, B. (2020, May 1).

By: A. Dasⁿ, F. Muellerⁿ & B. Rountree ^*

author keywords: Online Prediction; HPC; Node Failures; Parsing

topics (OpenAlex): Software System Performance and Reliability; Network Security and Intrusion Detection; Software Reliability and Analysis Research

TL;DR: This work tackles online anomaly prediction in computing systems by exploiting context free grammar-based rapid event analysis and presents the framework Aarohi, which describes an effective way to predict failures online. (via Semantic Scholar)

10.1109/IPDPS47924.2020.00115

Find Text @ NCSU

Source: Web Of Science

Added: June 10, 2021

2018 article

Desh

Das, A., Mueller, F., Siegel, C., & Vishnu, A. (2018, June 11).

By: A. Dasⁿ, F. Muellerⁿ, C. Siegel & A. Vishnu ^*

author keywords: LSTM; Failure Prediction; Log Mining; HPC; Node Failures; Lead Times; Anomaly Detection; Deep Learning

topics (OpenAlex): Software System Performance and Reliability; Software Reliability and Analysis Research; Anomaly Detection Techniques and Applications

TL;DR: This work aims to predict node failures that occur in supercomputing systems via long short-term memory (LSTM) networks that exploit recurrent neural networks (RNNs), and identifies failure indicators with enhanced training and classification for generic applicability to logs from operating systems and software components without the need to modify them. (via Semantic Scholar)

10.1145/3208040.3208051

Find Text @ NCSU

Source: Web Of Science

Added: April 2, 2019

2018 article

KeyValueServe^†: Design and performance analysis of a multi‐tenant data grid as a cloud service

Das, A., Iyengar, A., & Mueller, F. (2018, January 17). Concurrency and Computation Practice and Experience.

By: A. Dasⁿ, A. Iyengar ^* & F. Muellerⁿ

author keywords: cloud computing; data-grid; in-memory; key-value store; multi-tenancy; NoSQL; performance; quality of service

topics (OpenAlex): Cloud Computing and Resource Management; Caching and Content Delivery; Distributed and Parallel Computing Systems

TL;DR: This paper presents KeyValueServe, a low overhead cloud service with features aiding resource management that can efficiently provide services to tenants without degrading performance, and indicates that a Hazelcast cluster can get congested with multiple concurrent connections when processing client requests, resulting in poor performance. (via Semantic Scholar)

10.1002/cpe.4424

Find Text @ NCSU

open access via doi.org (publisher)

Source: Web Of Science

Added: August 6, 2018

2016 article

Performance Analysis of a Multi-tenant In-Memory Data Grid

Das, A., Mueller, F., Gu, X., & Iyengar, A. (2016, June 1).

By: A. Dasⁿ, F. Muellerⁿ, X. Guⁿ & A. Iyengar

topics (OpenAlex): Cloud Computing and Resource Management; Distributed and Parallel Computing Systems; Caching and Content Delivery

TL;DR: This study suggests that processing increasing number of client requests spawning fewer number of threads help improve performance, and uncovers scenarios of performance degradation followed by optimized performance via end-point multiplexing. (via Semantic Scholar)

10.1109/cloud.2016.0144

Find Text @ NCSU

UN Sustainable Development Goal Categories

9. Industry, Innovation and Infrastructure (OpenAlex)

Source: NC State University Libraries

Added: August 6, 2018

Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.

Certain data included herein are derived from the Web of Science© and InCites© (2026) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Anwesha Das

Works (5) 1 open access

Works (5)