Identifying Five-Factor Model of Personality through text with Natural Language Processing for Social Media Recruitment on Reddit

English | 正體中文 | 简体中文 | Items with full text/Total items : 45422/58598 (78%)
Visitors : 2528059 Online Users : 206

RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer

TMUIR > College of Management > Graduate Institute of Data Science > Dissertations/Theses > Item 987654321/64321

Please use this identifier to cite or link to this item: http://libir.tmu.edu.tw/handle/987654321/64321

Title:	Identifying Five-Factor Model of Personality through text with Natural Language Processing for Social Media Recruitment on Reddit
Authors:	JIE, LIEW DI
Contributors:	大數據科技及管理研究所碩士班張詠淳
Keywords:	Recruitment;Natural Language Processing;Personality;Social Media;Text Mining;Deep Learning
Date:	2023-06-20
Issue Date:	2024-09-30 14:21:19 (UTC+8)
Abstract:	Recruitment is an essential function of Human Resources. Finding an appropriate hire is crucial to the organization. Understanding a candidate’s personality can be useful to the hiring process as employers can have an idea of whether the candidate fits the role and culture. The arrival of the internet and social media has brought large changes to recruitment, but they also bring new opportunities. Social media such as Twitter, Facebook and Reddit allow employers to market themselves to potential employees and candidates can engage directly with employers. These engagements produced a lot of text content which can be analyzed to find out more about candidates as it is possible to capture some aspects of their personality through their use and style of language. This research aims to apply machine learning and deep learning techniques to Reddit text data on the Five-Factor Model. The research will compare ten machine learning classifiers and six deep learning classifiers which includes two transformer models with our proposed sentence selection method and deep learning model architecture, FF-BERT. Our proposed method will utilize Log-Likelihood Ratio to extract keywords from the high and low end of each personality dimension and combine them to create a list of keywords which will be used to extract the most relevant sentences for the five personality dimensions which will be used for training. We also performed Topic Modeling using BERTopic and compared the results with keywords from each personality dimension. Our results showed that the proposed sentence selection method and deep learning architecture was able to achieve substantial gains compared to the machine learning and deep learning techniques. We also found some patterns within the topics extracted by BERTopic and keywords which match some characteristics of the personality dimensions.
Description:	碩士指導教授：張詠淳口試委員：蘇家玉口試委員：張詠淳口試委員：陳建錦
Note:	論文公開日期：2024-01-03
Data Type:	thesis
Appears in Collections:	[Graduate Institute of Data Science] Dissertations/Theses

Files in This Item:

File	Description	Size	Format
index.html		0Kb	HTML	38	View/Open

著作權聲明 Copyright Notice

本平台之數位內容為臺北醫學大學所收錄之機構典藏，包含體系內各式學術著作及學術產出。秉持開放取用的精神，提供使用者進行資料檢索、下載與取用，惟仍請適度、合理地於合法範圍內使用本平台之內容，以尊重著作權人之權益。商業上之利用，請先取得著作權人之授權。
The digital content on this platform is part of the Taipei Medical University Institutional Repository, featuring various academic works and outputs from the institution. It offers free access to academic research and public education for non-commercial use. Please use the content appropriately and within legal boundaries to respect copyright owners' rights. For commercial use, please obtain prior authorization from the copyright owner.
瀏覽或使用本平台，視同使用者已完全接受並瞭解聲明中所有規範、中華民國相關法規、一切國際網路規定及使用慣例，並不得為任何不法目的使用TMUIR。
By utilising the platform, users are deemed to have fully accepted and understood all the regulations set out in the statement, relevant laws of the Republic of China, all international internet regulations, and usage conventions. Furthermore, users must not use TMUIR for any illegal purposes.
本平台盡力防止侵害著作權人之權益。若發現本平台之數位內容有侵害著作權人權益情事者，煩請權利人通知本平台維護人員([email protected])，將立即採取移除該數位著作等補救措施。
TMUIR is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff([email protected]). We will remove the work from the repository.

Loading...