Frank Mueller Zhang, H., & Mueller, F. (2022). CLAIRE: Enabling Continual Learning for Real-time Autonomous Driving with a Dual-head Architecture. 2022 IEEE 25TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2022), pp. 51–60. https://doi.org/10.1109/ISORC52572.2022.9812816 Wilson, E., Mueller, F., & Pakin, S. (2022). Combining Hard and Soft Constraints in Quantum Constraint-Satisfaction Systems. SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS. https://doi.org/10.1109/SC41404.2022.00018 Cucinotta, T., Mueller, F., & Simmhan, Y. (2022, March). Guest editorial: Special issue on the 2020 IEEE symposium on real-time distributed computing (ISORC). JOURNAL OF SYSTEMS ARCHITECTURE, Vol. 124. https://doi.org/10.1016/j.sysarc.2022.102437 Behera, S., Wan, L., Mueller, F., Wolf, M., & Klasky, S. (2022). P-ckpt: Coordinated Prioritized Checkpointing. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), pp. 436–446. https://doi.org/10.1109/IPDPS53621.2022.00049 McDonald, B., & Mueller, F. (2022). T-SYS: Timed-Based System Security for Real-Time Kernels. 2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), pp. 247–258. https://doi.org/10.1109/ICCPS54341.2022.00029 Bahmani, A., Xing, Z., Krishnan, V., Ray, U., Mueller, F., Alavi, A., … Pan, C. (2021). Hummingbird: efficient performance prediction for executing genomic applications in the cloud. BIOINFORMATICS, 37(17), 2537–2543. https://doi.org/10.1093/bioinformatics/btab161 Wilson, E., Mueller, F., & Pakin, S. (2021). Mapping Constraint Problems onto Quantum Gate and Annealing Devices. PROCEEDINGS OF SECOND INTERNATIONAL WORKSHOP ON QUANTUM COMPUTING SOFTWARE (QCS 2021), pp. 110–117. https://doi.org/10.1109/QCS54837.2021.00016 Pan, X., & Mueller, F. (2021). NUMA-aware memory coloring for multicore real-time systems. JOURNAL OF SYSTEMS ARCHITECTURE, 118. https://doi.org/10.1016/j.sysarc.2021.102188 Fustero, J., Palmtag, S., & Mueller, F. (2021). Quantum Annealing Stencils with Applications to Fuel Loading of a Nuclear Reactor. 2021 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2021) / QUANTUM WEEK 2021, pp. 265–275. https://doi.org/10.1109/QCE52317.2021.00044 Das, A., Mueller, F., & Rountree, B. (2021). Systemic Assessment of Node Failures in HPC Production Platforms. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), pp. 267–276. https://doi.org/10.1109/IPDPS49936.2021.00035 Mittal, S., & Mueller, F. (2021). T-Pack: Timed Network Security for Real Time Systems. 2021 IEEE 24TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2021), pp. 20–28. https://doi.org/10.1109/ISORC52013.2021.00014 Das, A., Mueller, F., & Rountree, B. (2020). Aarohi: Making Real-Time Node Failure Prediction Feasible. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, pp. 1092–1101. https://doi.org/10.1109/IPDPS47924.2020.00115 Wang, T., Yu, X., Qiu, Z., Jin, G., & Mueller, F. (2020). BarrierFinder: recognizing ad hoc barriers. EMPIRICAL SOFTWARE ENGINEERING, 25(6), 4676–4706. https://doi.org/10.1007/s10664-020-09862-3 Wilson, E., Singh, S., & Mueller, F. (2020). Just-in-time Quantum Circuit Transpilation Reduces Noise. IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE20), pp. 345–355. https://doi.org/10.1109/QCE49297.2020.00050 Patil, O., Mueller, F., Ionkov, L., Lee, J., & Lang, M. (2020). Symbiotic HW Cache and SW DTLB Prefetching for DRAM/NVM Hybrid Memory. 2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), 1–8. https://doi.org/10.1109/MASCOTS50786.2020.9285963 Ferriter, K., Mueller, F., Bahmani, A., & Pan, C. (2020). VCFC: Structural and Semantic Compression and Indexing of Genetic Variant Data. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, pp. 200–203. https://doi.org/10.1109/BIBM49941.2020.9313221 Regan, M., Eastwood, B., Nagabhiru, M., & Mueller, F. (2019). Automatically Translating Quantum Programs from a Subset of Common Gates to an Adiabatic Representation. REVERSIBLE COMPUTATION (RC 2019), Vol. 11497, pp. 146–161. https://doi.org/10.1007/978-3-030-21500-2_9 Wang, T., Yu, X., Qiu, Z., Jin, G., & Mueller, F. (2019). BARRIERFINDER: Recognizing Ad Hoc Barriers. 2019 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2019), pp. 323–327. https://doi.org/10.1109/ICSME.2019.00049 Rezaei, A., Khetawat, H., Patil, O., Mueller, F., Hargrove, P., & Roman, E. (2019). End-to-End Resilience for HPC Applications. HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2019, Vol. 11501, pp. 271–290. https://doi.org/10.1007/978-3-030-20656-7_14 Wang, T., Jain, N., Beckingsale, D., Boehme, D., Mueller, F., & Gamblin, T. (2019). FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation. PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019). https://doi.org/10.1145/3337821.3337842 Khetawat, H., Atrey, A., Li, G., Mueller, F., & Pakin, S. (2019). Implementing NChooseK on IBM Q Quantum Computer Systems. REVERSIBLE COMPUTATION (RC 2019), Vol. 11497, pp. 209–223. https://doi.org/10.1007/978-3-030-21500-2_13 Patil, O., Ionkov, L., Lee, J., Mueller, F., & Lang, M. (2019). Performance characterization of a DRAM-NVM hybrid memory architecture for HPC applications using intel optane DC persistent memory modules. Proceedings of the International Symposium on Memory Systems - MEMSYS '19. Presented at the the International Symposium. https://doi.org/10.1145/3357526.3357541 Mueller, F., Byrd, G., & Dreher, P. (2019). Programming Quantum Computers: A Primer with IBM Q and D-Wave Exercises. PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), pp. 451–451. https://doi.org/10.1145/3293883.3302578 Pan, X., & Mueller, F. (2019). The Colored Refresh Server for DRAM. 2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2019), pp. 27–34. https://doi.org/10.1109/ISORC.2019.00015 Pan, X., & Mueller, F. (2019). The Colored Refresh Server for DRAM. 2019 IEEE 40TH REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2019), pp. 146–153. https://doi.org/10.1109/RTSS46320.2019.00023 Gholkar, N., Mueller, F., & Rountree, B. (2019). Uncore Power Scavenger: A Runtime for Uncore Power Conservation on HPC Systems. PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS. https://doi.org/10.1145/3295500.3356150 Qian, T., & Mueller, F. (2018, November). A Failure Recovery Protocol for Software-Defined Real-Time Networks. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, Vol. 37, pp. 2222–2232. https://doi.org/10.1109/TCAD.2018.2857299 Bahmani, A., & Mueller, F. (2018). Chameleon: Online Clustering of MPI Program Traces. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), pp. 1102–1112. https://doi.org/10.1109/IPDPS.2018.00119 Kukreti, S., & Mueller, F. (2018). CloneHadoop: Process Cloning to Reduce Hadoop's Long Tail. 2018 IEEE/ACM 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING APPLICATIONS AND TECHNOLOGIES (BDCAT), pp. 11–20. https://doi.org/10.1109/BDCAT.2018.00011 Damschen, M., Mueller, F., & Henkel, J. (2018, November). Co-Scheduling on Fused CPU-GPU Architectures With Shared Last Level Caches. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, Vol. 37, pp. 2337–2347. https://doi.org/10.1109/TCAD.2018.2857042 Pan, X., & Mueller, F. (2018). Controller-Aware Memory Coloring for Multicore Real-Time Systems. 33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, pp. 584–592. https://doi.org/10.1145/3167132.3167196 Das, A., Mueller, F., Siegel, C., & Vishnu, A. (2018). Desh: Deep Learning for System Health Prediction of Lead Times to Failure in HPC. HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, pp. 40–51. https://doi.org/10.1145/3208040.3208051 Das, A., Iyengar, A., & Mueller, F. (2018). KeyValueServe(dagger): Design and performance analysis of a multi-tenant data grid as a cloud service. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 30(14). https://doi.org/10.1002/cpe.4424 Gholkar, N., Mueller, F., Rountree, B., & Marathe, A. (2018). PShiter: Feedback-based Dynamic Power Shiting within HPC Jobs for Performance. HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, pp. 106–117. https://doi.org/10.1145/3208040.3208047 Xu, H., & Mueller, F. (2018). Work-In-Progress: Making Machine Learning Real-Time Predictable. 2018 39TH IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2018), pp. 157–160. https://doi.org/10.1109/RTSS.2018.00029 Rezaei, A., Mueller, F., Hargrove, P., & Roman, E. (2017). DINO: Divergent node cloning for sustained redundancy in HPC. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 109, 350–362. https://doi.org/10.1016/j.jpdc.2017.06.010 Luo, X., Mueller, F., Carns, P., Jenkins, J., Latham, R., Ross, R., & Snyder, S. (2017). ScalaIOExtrap: Elastic I/O Tracing and Extrapolation. 2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), pp. 585–594. https://doi.org/10.1109/ipdps.2017.45 Bahmani, A., & Mueller, F. (2017). Scalable communication event tracing via clustering. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 109, 230–244. https://doi.org/10.1016/j.jpdc.2017.06.008 Gholkar, N., Mueller, F., & Rountree, B. (2016). A Power-aware Cost Model for HPC Procurement. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), pp. 1110–1113. https://doi.org/10.1109/ipdpsw.2016.35 Qian, T., Xu, H., Zhang, J. H., Chakrabortty, A., Mueller, F., & Xin, Y. F. (2016). A resilient software infrastructure for wide-area measurement systems. 2016 ieee power and energy society general meeting (pesgm). https://doi.org/10.1109/pesgm.2016.7741949 Lagadapati, M., Mueller, F., & Engelmann, C. (2016). Benchmark Generation and Simulation at Extreme Scale. 2016 IEEE/ACM 20TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), pp. 9–18. https://doi.org/10.1109/ds-rt.2016.18 Ramachandran, S., & Mueller, F. (2016). Distributed Job Allocation for Large-Scale Manycores. In Lecture Notes in Computer Science (pp. 404–425). https://doi.org/10.1007/978-3-319-41321-1_21 Ramachandran, S., & Mueller, F. (2016). Distributed job allocation for large-scale manycores. High performance computing, 9697, 404–425. Yagna, K., Patil, O., & Mueller, F. (2016). Efficient and Predictable Group Communication for Manycore NoCs. In Lecture Notes in Computer Science (pp. 383–403). https://doi.org/10.1007/978-3-319-41321-1_20 Yagna, K., Patil, O., & Mueller, F. (2016). Efficient and predictable group communication for manycore NoCs. High performance computing, 9697, 383–403. Bahmani, A., & Mueller, F. (2016). Efficient clustering for ultra-scale application tracing. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 98, 25–39. https://doi.org/10.1016/j.jpdc.2016.08.001 Elliott, J., Hoemmen, M., & Mueller, F. (2016). Exploiting data representation for fault tolerance. Journal of Computational Science, 14, 51–60. https://doi.org/10.1016/J.JOCS.2015.12.002 Fiala, D., Mueller, F., & Ferreira, K. B. (2016). FlipSphere: A Software-based DRAM Error Detection and Correction Library for HPC. 2016 IEEE/ACM 20TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), pp. 19–28. https://doi.org/10.1109/ds-rt.2016.27 Chandru, V., & Mueller, F. (2016). Hybrid MPI/OpenMP programming on the Tilera manycore architecture. 2016 International Conference on High Performance Computing & Simulation (HPCS 2016), 326–333. https://doi.org/10.1109/hpcsim.2016.7568353 Das, A., Mueller, F., Gu, X. H., & Iyengar, A. (2016). Performance analysis of a multi-tenant in-memory data grid. Proceedings of 2016 ieee 9th international conference on cloud computing (cloud), 956–959. https://doi.org/10.1109/cloud.2016.0144 Chandru, V., & Mueller, F. (2016). Reducing NoC and Memory Contention for Manycores. In Architecture of Computing Systems – ARCS 2016 (pp. 293–305). https://doi.org/10.1007/978-3-319-30695-7_22 Leon, L. S., Smith, R. C., Oates, W. S., & Miles, P. (2016). Sensitivity analysis for a quantum informed ferroelectric energy model. Proceedings of the asme conference on smart materials adaptive. https://doi.org/10.1115/smasis2016-9035 Bahmani, A., Sibley, A. B., Parsian, M., Owzar, K., & Mueller, F. (2016). SparkScore: Leveraging Apache Spark for Distributed Genomic Inference. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), pp. 435–442. https://doi.org/10.1109/ipdpsw.2016.6 Pan, X., Gownivaripalli, Y. J., & Mueller, F. (2016). TintMalloc: Reducing Memory Access Divergence via Controller-Aware Coloring. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), pp. 363–372. https://doi.org/10.1109/ipdps.2016.26 Luo, L., Edwards, J. R., Luo, H., & Mueller, F. (2015). A fine-grained block ILU scheme on regular structures for GPGPUs. COMPUTERS & FLUIDS, 119, 149–161. https://doi.org/10.1016/j.compfluid.2015.07.005 Shekhar, M., Ramaprasad, H., Sarkar, A., & Mueller, F. (2015). Architecture aware semi partitioned real-time scheduling on multicore platforms. REAL-TIME SYSTEMS, 51(3), 274–313. https://doi.org/10.1007/s11241-015-9221-4 Rezaei, A., & Mueller, F. (2015). DINO: Divergent Node Cloning for Sustained Redundancy in HPC. 2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, pp. 180–183. https://doi.org/10.1109/cluster.2015.36 Shekhar, M., Ramaprasad, H., & Mueller, F. (2015). Evaluation of Memory Access Arbitration Algorithm on Tilera's TILEPro64 platform. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), pp. 1154–1159. https://doi.org/10.1109/hpcc-css-icess.2015.245 Qian, T., Mueller, F., & Xin, Y. (2015). Hybrid EDF Packet Scheduling for Real-Time Distributed Systems. PROCEEDINGS OF THE 2015 27TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS (ECRTS 2015), pp. 37–46. https://doi.org/10.1109/ecrts.2015.11 Zimmer, C., Bhat, B., Mueller, F., & Mohan, S. (2015). Intrusion Detection for CPS Real-Time Controllers. CYBER PHYSICAL SYSTEMS APPROACH TO SMART ELECTRIC POWER GRID, pp. 329–358. https://doi.org/10.1007/978-3-662-45928-7_12 Zimmer, C., & Mueller, F. (2015). NoCMsg: A Scalable Message-Passing Abstraction for Network-on-Chips. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 12(1). https://doi.org/10.1145/2701426 Xia, Y., Lou, J., Luo, H., Edwards, J., & Mueller, F. (2015). OpenACC acceleration of an unstructured CFD solver based on a reconstructed discontinuous Galerkin method for compressible flows. International Journal for Numerical Methods in Fluids, 78(3), 123–139. https://doi.org/10.1002/fld.4009 Panchamukhi, S. A., & Mueller, F. (2015). Providing task isolation via TLB coloring. 21st IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS 2015), 3–13. https://doi.org/10.1109/rtas.2015.7108391 Zimmer, C., & Mueller, F. (2015). Reliable and scalable communication for the power grid. Cyber Physical Systems Approach to Smart Electric Power Grid, 195–217. https://doi.org/10.1007/978-3-662-45928-7_8 Sarkar, A., Mueller, F., & Ramaprasad, H. (2015). Static Task Partitioning for Locked Caches in Multicore Real-Time Systems. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 14(1). https://doi.org/10.1145/2638557 Qian, T., Mueller, F., & Xin, Y. F. (2014). A real-time distributed hash table. 2014 IEEE 20th International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA). https://doi.org/10.1109/rtcsa.2014.6910537 Qian, T., Chakrabortty, A., Mueller, F., & Xin, Y. (2014). A real-time distributed storage system for multi-resolution virtual synchrophasor. 2014 IEEE PES General Meeting | Conference & Exposition. Presented at the 2014 IEEE Power & Energy Society General Meeting. https://doi.org/10.1109/pesgm.2014.6939832 Elliott, J., Hoemmen, M., & Mueller, F. (2014). Evaluating the Impact of SDC on the GMRES Iterative Solver. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM. https://doi.org/10.1109/ipdps.2014.123 Zimmer, C., & Mueller, F. (2014). NoCMsg: Scalable NoC-Based Message Passing. 2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), pp. 186–195. https://doi.org/10.1109/ccgrid.2014.19 Ananthakrishnan, S. K., & Mueller, F. (2014). ScalaJack: Customized Scalable Tracing with In-situ Data Analysis. In Lecture Notes in Computer Science (pp. 13–25). https://doi.org/10.1007/978-3-319-09873-9_2 Lagadapati, M., Mueller, F., & Engelmann, C. (2014). Tools for Simulation and Benchmark Generation at Exascale. In Tools for High Performance Computing 2013 (pp. 19–24). https://doi.org/10.1007/978-3-319-08144-1_2 Li, C., Yang, Y., Dai, H. W., Yan, S. G., Mueller, F., & Zhou, H. Y. (2014). Understanding the tradeoffs between software-managed vs. hardware-managed caches in GPUs. Ieee international symposium on performance analysis of systems and, 231–241. Zhang, Y., & Mueller, F. (2013). Autogeneration and Autotuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 24(3), 417–427. https://doi.org/10.1109/tpds.2012.160 Mueller, F. (2013, July). Best papers, IPDPS 2011. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, Vol. 73, pp. 939–939. https://doi.org/10.1016/j.jpdc.2013.05.001 Zhang, Y. P., & Mueller, F. (2013). HiDP: A hierarchical data parallel language. Proceedings of the 2013 ieee/acm international symposium on code generation and optimization (cgo), 171–181. Fiala, D., Ferreira, K. B., Mueller, F., & Engelmann, C. (2012). A Tunable, Software-Based DRAM Error Detection and Correction Library for HPC. In Euro-Par 2011: Parallel Processing Workshops (pp. 251–261). https://doi.org/10.1007/978-3-642-29740-3_29 Elliott, J., Kharbas, K., Fiala, D., Mueller, F., Ferreira, K., & Engelmann, C. (2012). Combining Partial Redundancy and Checkpointing for HPC. 2012 IEEE 32ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), pp. 615–626. https://doi.org/10.1109/icdcs.2012.56 Fiala, D., Mueller, F., Engelmann, C., Riesen, R., Ferreira, K., & Brightwell, R. (2012). Detection and correction of silent data corruption for large-scale high-performance computing. International conference for high performance computing networking. https://doi.org/10.1109/sc.2012.49 Zimmer, C., & Mueller, F. (2012). Low ContentionMapping of Real-Time Tasks onto a TilePro 64 Core Processor. 2012 IEEE 18TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS), pp. 131–140. https://doi.org/10.1109/rtas.2012.36 Budanur, S., Mueller, F., & Gamblin, T. (2012). Memory Trace Compression and Replay for SPMD Systems Using Extended PRSDs. COMPUTER JOURNAL, 55(2), 206–217. https://doi.org/10.1093/comjnl/bxr071 Wang, C., Mueller, F., Engelmann, C., & Scott, S. L. (2012). Proactive process-level live migration and back migration in HPC environments. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 72(2), 254–267. https://doi.org/10.1016/j.jpdc.2011.10.009 Wu, X., & Mueller, F. (2012). SCALAEXTRAP: Trace-Based Communication Extrapolation for SPMD Programs. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 34(1). https://doi.org/10.1145/2160910.2160914 Wu, X., Deshpande, V., & Mueller, F. (2012). ScalaBenchGen: Auto-Generation of Communication Benchmarks Traces. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), pp. 1250–1260. https://doi.org/10.1109/ipdps.2012.114 Mueller, F., Wu, X., Schulz, M., de Supinski, B. R., & Gamblin, T. (2012). ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale. In Applied Parallel and Scientific Computing (pp. 410–418). https://doi.org/10.1007/978-3-642-28145-7_40 Mueller, F., Wu, X., Schulz, M., Supinski, B. R., & Gamblin, T. (2012). ScalaTrace: Tracing, analysis and modeling of HPC codes at scale. Applied parallel and scientific computing, pt ii, 7134, 410–418. Sarkar, A., Mueller, F., & Ramaprasad, H. (2012). Static task partitioning for locked caches in multi-core real-time systems. Cases'12: proceedings of the 2012 ACM International Conference on Compilers, Architectures and Synthesis for Embedded Systems, 161–170. https://doi.org/10.1145/2380403.2380434 Zhang, Y., Mueller, F., Cui, X., & Potok, T. (2011). Data-intensive document clustering on graphics processing unit (GPU) clusters. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 71(2), 211–224. https://doi.org/10.1016/j.jpdc.2010.08.002 Bhat, B., & Mueller, F. (2011). Making DRAM refresh predictable. REAL-TIME SYSTEMS, 47(5), 430–453. https://doi.org/10.1007/s11241-011-9129-6 Sarkar, A., Mueller, F., & Ramaprasad, H. (2011). Predictable Task Migration for Locked Caches in Multi-Core Systems. ACM SIGPLAN NOTICES, 46(5), 131–140. https://doi.org/10.1145/2016603.1967696 Sarkar, A., Mueller, F., & Ramaprasad, H. (2011). Predictable task migration for locked caches in multi-core systems. LCTES 11: Proceedings of the ACM Sigplan/Sigbed 2011 Conference on Languages, Complilers, Tools and Theory for Embedded Systems, 131–140. https://doi.org/10.1145/1967677.1967696 Wu, X., & Mueller, F. (2011, August). ScalaExtrap: Trace-Based Communication Extrapolation for SPMD Programs. ACM SIGPLAN NOTICES, Vol. 46, pp. 113–122. https://doi.org/10.1145/2038037.1941569 Marathe, J., Thakkar, V., & Mueller, F. (2010). Feedback-directed page placement for ccNUMA via hardware-generated memory traces. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 70(12), 1204–1219. https://doi.org/10.1016/j.jpdc.2010.08.015 Mohan, S., Mueller, F., Root, M., Hawkins, W., Healy, C., Whalley, D., & Vivancos, E. (2010). Parametric Timing Analysis and Its Application to Dynamic Voltage Scaling. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 10(2). https://doi.org/10.1145/1880050.1880061 Ramaprasad, H., & Mueller, F. (2010). Tightening the Bounds on Feasible Preemptions. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 10(2). https://doi.org/10.1145/1880050.1880063 Scott, S. L., Engelmann, C., Vallee, G. R., Naughton, T., Tikotekar, A., Ostrouchov, G., … Varma, J. (2009, April). A Tunable Holistic Resiliency Approach for High-Performance Computing Systems. ACM SIGPLAN NOTICES, Vol. 44, pp. 305–306. https://doi.org/10.1145/1594835.1504227 Wang, C., Zhang, Z., Ma, X., Vazhkudai, S. S., & Mueller, F. (2009). Improving the availability of supercomputer job input data using temporal replication. Computer Science - Research and Development, 23(3-4), 149–157. https://doi.org/10.1007/s00450-009-0082-8 Sarkar, A., Mueller, F., Ramaprasad, H., & Mohan, S. (2009, July). Push-Assisted Migration of Real-Time Tasks in Multi-Core Processors. ACM SIGPLAN NOTICES, Vol. 44, pp. 80–89. https://doi.org/10.1145/1543136.1542464 Noeth, M., Ratn, P., Mueller, F., Schulz, M., & Supinski, B. R. (2009, August). ScalaTrace: Scalable compression and replay of communication traces for high-performance computing. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, Vol. 69, pp. 696–710. https://doi.org/10.1016/j.jpdc.2008.09.001 Zhu, Y., & Mueller, F. (2008). Exploiting synchronous and asynchronous DVS for feedback EDF scheduling on an embedded platform. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 7(1). https://doi.org/10.1145/1324969.1324972 Wilhelm, R., Engblom, J., Ermedahl, A., Holsti, N., Thesing, S., Whalley, D., … Stenstrom, P. (2008). [Review of The worst-case execution-time problem - Overview of methods and survey of tools]. ACM Transactions on Embedded Computing Systems, 7(3). Zhu, Y., & Mueller, F. (2007, July). DVSleak: Combining leakage reduction and voltage scaling in feedback EDF scheduling. ACM SIGPLAN NOTICES, Vol. 42, pp. 31–40. https://doi.org/10.1145/1273444.1254772 Coffman, J., Healy, C., Mueller, F., & Whalley, D. (2007, July). Generalizing parametric timing analysis. ACM SIGPLAN NOTICES, Vol. 42, pp. 152–154. https://doi.org/10.1145/1273444.1254795 Marathe, J., Mueller, F., Mohan, T., McKee, S. A., De Supinski, B. R., & Yoo, A. (2007). METRIC: Memory tracing via dynamic binary rewriting to identify cache inefficiencies. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 29(2). https://doi.org/10.1145/1216374.1216380 Marathe, J., & Mueller, F. (2007). Source-code-correlated cache coherence characterization of OpenMP benchmarks. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 18(6), 818–834. https://doi.org/10.1109/TPDS.2007.1058 Seth, K., Anantaraman, A., Mueller, F., & Rotenberg, E. (2006). FAST: Frequency-Aware Static Timing Analysis. ACM Transactions on Programming Languages and Systems, 5(1), 200–224. Zhao, W., Kreahling, W., Whalley, D., Healy, C., & Mueller, F. (2006). Improving WCET by applying worst-case path optimizations. REAL-TIME SYSTEMS, 34(2), 129–152. https://doi.org/10.1007/s11241-006-8643-4 Zhu, Y. F., & Mueller, F. (2005, July). Feedback EDF scheduling exploiting hardware-assisted asynchronous dynamic voltage scaling. ACM SIGPLAN NOTICES, Vol. 40, pp. 203–212. https://doi.org/10.1145/1070891.1065939 Zhu, Y. F., & Mueller, F. (2005). Feedback EDF scheduling of real-time tasks exploiting dynamic voltage scaling. REAL-TIME SYSTEMS, 31(1-3), 33–63. https://doi.org/10.1007/s11241-005-2744-3 Patil, K., Seth, K., & Mueller, F. (2004, July). Compositional static instruction cache simulation. ACM SIGPLAN NOTICES, Vol. 39, pp. 136–145. https://doi.org/10.1145/998300.997183 Anantaraman, A., Seth, K., Rotenberg, E., & Mueller, F. (2004). Enforcing safety of real-time schedules on contemporary processors using a virtual simple architecture (VISA). 25TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, pp. 114–125. https://doi.org/10.1109/real.2004.19 Desai, N., & Mueller, F. (2004). Scalable hierarchical locking for distributed systems. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 64(6), 708–724. https://doi.org/10.1016/j.jpdc.2003.10.001 Vetter, J. S., & Mueller, F. (2003, September). Communication characteristics of large-scale scientific applications for contemporary cluster architectures. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, Vol. 63, pp. 853–865. https://doi.org/10.1016/S0743-7315(03)00104-7 Seth, K., Anantaraman, A., Mueller, F., & Rotenberg, E. (2003). FAST: Frequency-aware static timing analysis. RTSS 2003: 24TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, pp. 40–51. https://doi.org/10.1109/real.2003.1253252 Anantaraman, A., Seth, K., Patil, K., Rotenberg, E., & F. Mueller, F. (2003). Virtual Simple Architecture (VISA): Exceeding the complexity limit in safe real-time systems. Computers and their applications :|bproceedings of the ISCA 16th International Conference, Seattle, Washington, USA, March 28-30, 2001, 350–361. Cary, NC: ISCA. Dudani, A., Mueller, F., & Zhu, Y. F. (2002, July). Energy-conserving feedback EDF scheduling for embedded systems with real-time constraints. ACM SIGPLAN NOTICES, Vol. 37, pp. 213–222. https://doi.org/10.1145/566225.513865 Unger, S., & Mueller, F. (2002). Handling irreducible loops: Optimized node splitting versus DJ-Graphs. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 24(4), 299–333. https://doi.org/10.1145/567097.567098 Wegener, J., & Mueller, F. (2001). A comparison of static analysis and evolutionary testing for the verification of timing constraints. REAL-TIME SYSTEMS, 21(3), 241–268. https://doi.org/10.1023/A:1011132221066 High-level parallel programming models and supportive environments 6th international workshop, HIPS 2001, San Francisco, CA, USA, April 23, 2001 : proceedings. (2001). New York: Springer.