Evaluating and mitigating gender bias in machine learning based resume filtering

Singh, Simar Preet

Welcome to the Digital Repository Service@BU

The Digital Institutional Repository of BU

DRS@BU captures all intellectual assets of Bennett University. This platform enables the BU community to deposit (self-archive) their publications using a web interface, preserve and organize these publications for easy retrieval. It is expected that the repository will evolve as a major source of reference for all Bennett University publications accessible on the net.

nanoll extt

Please use this identifier to cite or link to this item: http://lrcdrs.bennett.edu.in:80/handle/123456789/5038

Title:	Evaluating and mitigating gender bias in machine learning based resume filtering
Authors:	Singh, Simar Preet
Keywords:	Gender Bias Information extraction
Issue Date:	2023
Publisher:	Multimedia Tools and Applications
Abstract:	Shortlisting resumes for the companies are being automated using artifcial intelligence however, training systems to do that incorporate high social biases in the models. Con sidering the vitality of mitigating gender bias present in society, the research introduces a method for hiding gender specifc terms from data, termed as Gender Masking, before fnding the similarity with the job requirements. The paper ideates a method of reduction in indulgence of social biases in machine learning based resume fltering algorithms. In addition, an evaluation method is proposed to justify exclusion of gender specifc terms from classifcation of resumes short-listed for a particular role based upon requirements. The novelty of the proposed method is that upon extraction of information from the resume based on probabilistic indexing, the gender specifc terms are masked. This corpus is used as the received information in form of word encoding, across the stated requirements in order to retrieve a similarity score of the information using cosine similarity in correspond ence to the posting. The proposed model is evaluated using gender-swapped corpus to ensure unbiased performance of the algorithm. The evaluation method represents the per formance variation of the text on swapping the gender, it represents the unintentional dif ferences the algorithm captures based on the biases present in the society. The experimen tal research is taken out on preprocessed datasets (Online Resume Datasets), from which an average of 15.46% are observed to have been afected by gender bias, which is omit ted through the proposed method. From the results computed, an average increase of 1.2% accuracy on the trained Random Forest model is experienced outperforming state-of-the art techniques of training generic Linear SVM, Logistic Regression and Multinomial Naive Bayes models. The model is regularized to have 100 maximum trees in the ensemble along with 20 maximum depth and 10 minimum samples to split the nodes
URI:	https://doi.org/10.1007/s11042-023-16552-x http://lrcdrs.bennett.edu.in:80/handle/123456789/5038
ISSN:	13807501
Appears in Collections:	Conference/Seminar Papers_ SCSET

Files in This Item:

File	Description	Size	Format
Evaluating and mitigating gender bias in machine learning.pdf Restricted Access		1.3 MB	Adobe PDF	View/Open Request a copy

Show full item record

Contact admin for Full-Text