Works (26)

2021 article

Exploring Thread Coarsening on FPGA

2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), pp. 436–441.

By: M. Zarch n, R. Neff & M. Becchi

author keywords: OpenCL; FPGA; high-level synthesis; compiler techniques; thread-coarsening; performance optimization
Source: Web Of Science
Added: May 2, 2022

2021 article

PILOT: a Runtime System to Manage Multi-tenant GPU Unified Memory Footprint

2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), pp. 442–447.

By: J. Ravi, T. Nguyen, H. Zhou & M. Becchi

Sources: Web Of Science, ORCID
Added: May 2, 2022

2020 article

A Loop-aware Autotuner for High-Precision Floating-point Applications

2020 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), pp. 285–295.

By: R. Gu, P. Beata n & M. Becchi

author keywords: autotuner; mixed-precision; floating-point
Source: Web Of Science
Added: May 24, 2021

2020 article

Evaluating Thread Coarsening and Low-cost Synchronization on Intel Xeon Phi

2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, pp. 1018–1029.

By: H. Wu & M. Becchi

author keywords: SIMT; manycore processors; Intel Xeon Phi; thread coarsening; synchronization
Source: Web Of Science
Added: June 10, 2021

2020 article

GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting

2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, pp. 274–284.

By: X. Yu*, F. Wei*, X. Ou*, M. Becchi, T. Bicer* & D. Yao*

author keywords: GPU; static program analysis; data-flow analysis; Android security; mobile application vetting; worklist algorithm; application-specific optimization
Source: Web Of Science
Added: June 10, 2021

2020 article

GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU

2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), pp. 294–304.

By: R. Gu & M. Becchi

author keywords: GPU; floating-point arithmetic; mixed-precision arithmetic; accuracy; performance; autotuning
Source: Web Of Science
Added: August 2, 2021

2020 article

Optimizing Complex OpenCL Code for FPGA: A Case Study on Finite Automata Traversal

2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), pp. 518–527.

By: M. Nourian, M. Zarch n & M. Becchi

author keywords: OpenCL; FPGA; high-level synthesis; automata processing; NFA; performance optimization
Source: Web Of Science
Added: July 19, 2021

2019 article

A Comparative Study of Parallel Programming Frameworks for Distributed GPU Applications

CF '19 - PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, pp. 268–273.

By: R. Gu & M. Becchi

author keywords: Parallel computing; Distributed applications; Homogeneous cluster
Source: Web Of Science
Added: July 29, 2019

2019 article

Editorial: Special Issue on Computing Frontiers

Palumbo, F., & Becchi, M. (2019, March). JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, Vol. 91, pp. 273–273.

By: F. Palumbo* & M. Becchi

Source: Web Of Science
Added: March 18, 2019

2019 journal article

Evaluating High Performance Pattern Matching on the Automata Processor

IEEE TRANSACTIONS ON COMPUTERS, 68(8), 1201–1212.

author keywords: Finite automata; regular expressions; automata processor; FPGAs; intrusion detection; protein motifs
Source: Web Of Science
Added: July 29, 2019

2018 article

A Compiler Framework for Fixed-topology Non-deterministic Finite Automata on SIMD Platforms

2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), pp. 507–516.

By: M. Nourian, H. Wu & M. Becchi

author keywords: Automata Processing; NFAs; SIMD; GPUs; Intel Xeon Phi platforms
Source: Web Of Science
Added: April 22, 2019

2018 article

Compiling SIMT Programs on Multi- and Many-core Processors with Wide Vector Units: A Case Study with CUDA

2018 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), pp. 123–132.

By: H. Wu, J. Ravi & M. Becchi

author keywords: Xeon Phi; hybrid MIMD/SIMD systems; CUDA; SIMT; vectorization
Source: Web Of Science
Added: June 17, 2019

2017 journal article

A Principled Approach to Secure Multi-core Processor Design with ReWire

ACM Transactions on Embedded Computing Systems, 16(2), 1–25.

By: A. Procter*, W. Harrison*, I. Graves*, M. Becchi & G. Allwein*

author keywords: Equational reasoning; monads; hardware security; reconfigurable computing
Source: Crossref
Added: July 20, 2019

2017 article

An Analytical Study of Recursive Tree Traversal Patterns on Multi- and Many-core Platforms

2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), pp. 586–595.

By: H. Wu n & M. Becchi

author keywords: recursive tree traversal; many-core processors; parallelism; GPU
Source: Web Of Science
Added: November 19, 2018

2017 conference paper

Understanding the performance-accuracy tradeoffs of floating-point arithmetic on GPUs

Proceedings of the 2017 ieee international symposium on workload characterization (iiswc), 207–218.

By: S. Surineni*, R. Gu, H. Nguyen* & M. Becchi

Source: NC State University Libraries
Added: August 6, 2018

2016 journal article

Picking Pesky Parameters: Optimizing Regular Expression Matching in Practice

IEEE Transactions on Parallel and Distributed Systems, 27(5), 1430–1442.

By: X. Chen*, B. Jones*, M. Becchi & T. Wolf*

author keywords: Network security; deep packet inspection; deterministic finite automaton; non-deterministic finite automaton; regular expressions; design space exploration
Source: Crossref
Added: July 20, 2019

2015 chapter

Hardware Synthesis from Functional Embedded Domain-Specific Languages: A Case Study in Regular Expression Compilation

In Lecture Notes in Computer Science (pp. 41–52).

By: I. Graves*, A. Procter*, W. Harrison*, M. Becchi & G. Allwein*

Source: Crossref
Added: February 24, 2020

2014 journal article

Large-Scale Pairwise Alignments on GPU Clusters: Exploring the Implementation Space

Journal of Signal Processing Systems, 77(1-2), 131–149.

By: H. Truong*, D. Li*, K. Sajjapongse*, G. Conant & M. Becchi

author keywords: Heterogeneous system; Sequence alignment; GPU
Sources: Crossref, ORCID
Added: July 20, 2019

2014 journal article

Revisiting State Blow-Up: Automatically Building Augmented-FA While Preserving Functional Equivalence

IEEE Journal on Selected Areas in Communications, 32(10), 1822–1833.

By: X. Yu*, B. Lin* & M. Becchi

author keywords: Deep packet inspection; finite automata; regular expression matching
Source: Crossref
Added: July 20, 2019

2013 journal article

A-DFA

ACM Transactions on Architecture and Code Optimization, 10(1), 1–26.

By: M. Becchi & P. Crowley*

author keywords: Algorithms; Design; Performance; Security; Deep packet inspection; regular expressions; deterministic finite automata; memory compression
Source: Crossref
Added: July 20, 2019

2013 journal article

Diet Alters Both the Structure and Taxonomy of the Ovine Gut Microbial Ecosystem

DNA Research, 21(2), 115–125.

By: M. Ellison*, G. Conant, R. Cockrum*, K. Austin*, H. Truong*, M. Becchi, W. Lamberson*, K. Cammack*

author keywords: Ovis aries; microbiome; 16S subunit
Sources: Crossref, ORCID
Added: July 20, 2019

2013 chapter

Efficient GPU Implementation of the Integral Histogram

In Computer Vision - ACCV 2012 Workshops (pp. 266–278).

By: M. Poostchi*, K. Palaniappan*, F. Bunyak*, M. Becchi & G. Seetharaman*

Source: Crossref
Added: August 28, 2020

2013 journal article

Scheduling concurrent applications on a cluster of CPU–GPU nodes

Future Generation Computer Systems, 29(8), 2262–2271.

By: V. Ravi, M. Becchi, W. Jiang, G. Agrawal & S. Chakradhar

author keywords: Scheduling; CPU-GPU systems
Source: Crossref
Added: July 20, 2019

2012 journal article

A Massively Parallel, Energy Efficient Programmable Accelerator for Learning and Classification

ACM Transactions on Architecture and Code Optimization, 9(1), 1–30.

By: A. Majumdar*, S. Cadambi*, M. Becchi, S. Chakradhar* & H. Graf*

author keywords: Design; Performance; Accelerator-based computing; parallel computing; heterogeneous computing; machine learning; architecture
Source: Crossref
Added: July 20, 2019

2012 journal article

Accelerating large-scale protein structure alignments with graphics processing units

BMC Research Notes, 5(1), 116.

Source: Crossref
Added: July 20, 2019

2008 journal article

Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures

The Journal of Instruction-Level Parallelism (JILP), 10.

By: M. Becchi & P. Crowley

Source: NC State University Libraries
Added: July 28, 2019