Joymallya Chakraborty

College of Engineering

Works (8)

Updated: October 1st, 2024 10:54

2024 journal article

When less is more: on the value of "co-training" for semi-supervised software defect predictors

EMPIRICAL SOFTWARE ENGINEERING, 29(2).

By: S. Majumder^*, J. Chakraborty^* & T. Menzies^*

author keywords: Semi-supervised learning; SSL; Self-training; Co-training; Boosting methods; Semi-supervised preprocessing; Clustering-based semi-supervised preprocessing; Intrinsically semi-supervised methods; Graph-based methods; Co-forest; Effort aware tri-training

topics (OpenAlex): Software Engineering Research; Software Reliability and Analysis Research; Software Testing and Debugging Techniques

10.1007/s10664-023-10418-4

Find Text @ NCSU

open access via arxiv.org (repository)

Sources: Web Of Science, NC State University Libraries

Added: March 11, 2024

2023 journal article

Fair Enough: Searching for Sufficient Measures of Fairness

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 32(6).

By: S. Majumderⁿ, J. Chakrabortyⁿ, G. Baiⁿ, K. Stoleeⁿ & T. Menziesⁿ

author keywords: Software fairness; fairness metrics; clustering; theoretical analysis; empirical analysis

topics (OpenAlex): Ethics and Social Impacts of AI; Adversarial Robustness in Machine Learning; Explainable Artificial Intelligence (XAI)

TL;DR: This article shows that many of those fairness metrics effectively measure the same thing, and it is no longer necessary (or even possible) to satisfy all fairness metrics. (via Semantic Scholar)

10.1145/3585006

Find Text @ NCSU

open access via arxiv.org (repository)

Sources: Web Of Science, ORCID, NC State University Libraries

Added: October 31, 2023

2023 journal article

FairMask: Better Fairness via Model-Based Rebalancing of Protected Attributes

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 49(4), 2426–2439.

By: K. Pengⁿ, J. Chakrabortyⁿ & T. Menziesⁿ

author keywords: Software fairness; explanation; bias mitigation

topics (OpenAlex): Ethics and Social Impacts of AI; Adversarial Robustness in Machine Learning; Explainable Artificial Intelligence (XAI)

TL;DR: This work proposes a model-based extrapolation method that corrects the misleading latent correlation between the protected attributes and other non-protected ones and achieves significantly better group and individual fairness than benchmark methods. (via Semantic Scholar)

10.1109/TSE.2022.3220713

Find Text @ NCSU

open access via arxiv.org (repository)

Sources: Web Of Science, ORCID, NC State University Libraries

Added: May 30, 2023

2022 article

Fair-SSL: Building fair ML Software with less data

2022 IEEE/ACM INTERNATIONAL WORKSHOP ON EQUITABLE DATA & TECHNOLOGY (FAIRWARE 2022), pp. 1–8.

By: J. Chakrabortyⁿ, S. Majumderⁿ & H. Tuⁿ

author keywords: Machine Learning with and for SE; Ethics in Software Engineering

topics (OpenAlex): Ethics and Social Impacts of AI; Intellectual Property and Patents; Digitalization, Law, and Regulation

TL;DR: This is the first SE work where semi-supervised techniques are used to fight against ethical bias in SE ML models, and the clear advantage of Fair-SSL is that it requires only 10% of the labeled training data. (via Semantic Scholar)

10.1145/3524491.3527305

Find Text @ NCSU

Source: Web Of Science

Added: October 3, 2022

2021 article

Bias in Machine Learning Software: Why? How? What to Do?

PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), pp. 429–440.

By: J. Chakrabortyⁿ, S. Majumderⁿ & T. Menziesⁿ

author keywords: Software Fairness; Fairness Metrics; Bias Mitigation

topics (OpenAlex): Ethics and Social Impacts of AI; Adversarial Robustness in Machine Learning; Imbalanced Data Classification Techniques

TL;DR: This paper postulates that the root causes of bias are the prior decisions that affect what data was selected and the labels assigned to those examples, and proposes the Fair-SMOTE algorithm, which removes biased labels; and rebalances internal distributions such that based on sensitive attribute, examples are equal in both positive and negative classes. (via Semantic Scholar)

10.1145/3468264.3468537

Find Text @ NCSU

open access via arxiv.org (repository)

Sources: Web Of Science, ORCID, NC State University Libraries

Added: March 7, 2022

2020 article

Making Fair ML Software using Trustworthy Explanation

2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020), pp. 1229–1233.

By: J. Chakrabortyⁿ, K. Pengⁿ & T. Menziesⁿ

topics (OpenAlex): Adversarial Robustness in Machine Learning; Ethics and Social Impacts of AI; Explainable Artificial Intelligence (XAI)

TL;DR: This work shows how the proposed method based on K nearest neighbors can overcome shortcomings and find the underlying bias of black box models and describes the future framework combining explanation and planning to build fair software. (via Semantic Scholar)

10.1145/3324884.3418932

Find Text @ NCSU

open access via arxiv.org (repository)

Sources: Web Of Science, NC State University Libraries

Added: June 10, 2021

2019 article

Investigating the Effects of Gender Bias on GitHub

2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), pp. 700–711.

By: N. Imtiazⁿ, J. Middletonⁿ, J. Chakrabortyⁿ, N. Robsonⁿ, G. Bai ⁿ & E. Murphy-Hill^*

author keywords: GitHub; gender; open source

topics (OpenAlex): Open Source Software Innovations; Digital Games and Media; Software Engineering Research

TL;DR: The effects of gender bias are largely invisible on the GitHub platform itself, but there are still signals of women concentrating their work in fewer places and being more restrained in communication than men. (via Semantic Scholar)

10.1109/ICSE.2019.00079

Find Text @ NCSU

Source: Web Of Science

Added: September 7, 2020

2019 article

Predicting Breakdowns in Cloud Services (with SPIKE)

ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, pp. 916–924.

By: J. Chenⁿ, J. Chakrabortyⁿ, P. Clark^*, K. Haverlock^*, S. Cherian^* & T. Menziesⁿ

author keywords: Cloud; optimization; data mining; parameter tuning

topics (OpenAlex): Software System Performance and Reliability; Advanced Clustering Algorithms Research; Data Mining Algorithms and Applications

TL;DR: SPIKE is a data mining tool which can predict upcoming service breakdowns, half an hour into the future, and performed relatively better than other widely-used learning methods (neural nets, random forests, logistic regression). (via Semantic Scholar)

10.1145/3338906.3340450

Find Text @ NCSU

Sources: Web Of Science, NC State University Libraries

Added: October 7, 2019

Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.

Certain data included herein are derived from the Web of Science© and InCites© (2025) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Joymallya Chakraborty

Works (8) 6 open access

Works (8)