nanoll extt
Please use this identifier to cite or link to this item: http://lrcdrs.bennett.edu.in:80/handle/123456789/5038
Title: Evaluating and mitigating gender bias in machine learning based resume filtering
Authors: Singh, Simar Preet
Keywords: Gender Bias
Information extraction
Issue Date: 2023
Publisher: Multimedia Tools and Applications
Abstract: Shortlisting resumes for the companies are being automated using artifcial intelligence however, training systems to do that incorporate high social biases in the models. Con sidering the vitality of mitigating gender bias present in society, the research introduces a method for hiding gender specifc terms from data, termed as Gender Masking, before fnding the similarity with the job requirements. The paper ideates a method of reduction in indulgence of social biases in machine learning based resume fltering algorithms. In addition, an evaluation method is proposed to justify exclusion of gender specifc terms from classifcation of resumes short-listed for a particular role based upon requirements. The novelty of the proposed method is that upon extraction of information from the resume based on probabilistic indexing, the gender specifc terms are masked. This corpus is used as the received information in form of word encoding, across the stated requirements in order to retrieve a similarity score of the information using cosine similarity in correspond ence to the posting. The proposed model is evaluated using gender-swapped corpus to ensure unbiased performance of the algorithm. The evaluation method represents the per formance variation of the text on swapping the gender, it represents the unintentional dif ferences the algorithm captures based on the biases present in the society. The experimen tal research is taken out on preprocessed datasets (Online Resume Datasets), from which an average of 15.46% are observed to have been afected by gender bias, which is omit ted through the proposed method. From the results computed, an average increase of 1.2% accuracy on the trained Random Forest model is experienced outperforming state-of-the art techniques of training generic Linear SVM, Logistic Regression and Multinomial Naive Bayes models. The model is regularized to have 100 maximum trees in the ensemble along with 20 maximum depth and 10 minimum samples to split the nodes
URI: https://doi.org/10.1007/s11042-023-16552-x
http://lrcdrs.bennett.edu.in:80/handle/123456789/5038
ISSN: 13807501
Appears in Collections:Conference/Seminar Papers_ SCSET

Files in This Item:
File Description SizeFormat 
Evaluating and mitigating gender bias in machine learning.pdf
  Restricted Access
1.3 MBAdobe PDFView/Open Request a copy

Contact admin for Full-Text

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.