Works (3)

2016 conference paper

FlipSphere: A software-based DRAM error detection and correction library for HPC

Ieee acm international symposium on distributed simulation and real-time, 19–28.

By: D. Fiala, F. Mueller & K. Ferreira

Source: NC State University Libraries
Added: August 6, 2018

2012 conference paper

Combining partial redundancy and checkpointing for HPC

2012 ieee 32nd international conference on distributed computing systems (icdcs), 615–626.

By: J. Elliott, K. Kharbas, D. Fiala, F. Mueller, K. Ferreira & C. Engelmann

Source: NC State University Libraries
Added: August 6, 2018

2012 conference paper

Detection and correction of silent data corruption for large-scale high-performance computing

International conference for high performance computing networking.

By: D. Fiala, F. Mueller, C. Engelmann, R. Riesen, K. Ferreira & R. Brightwell

Source: NC State University Libraries
Added: August 6, 2018