Taipei Medical University Institutional Repository:Item 987654321/64321
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 45422/58598 (78%)
Visitors : 2528059      Online Users : 206
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: http://libir.tmu.edu.tw/handle/987654321/64321


    Title: Identifying Five-Factor Model of Personality through text with Natural Language Processing for Social Media Recruitment on Reddit
    Authors: JIE, LIEW DI
    Contributors: 大數據科技及管理研究所碩士班
    張詠淳
    Keywords: Recruitment;Natural Language Processing;Personality;Social Media;Text Mining;Deep Learning
    Date: 2023-06-20
    Issue Date: 2024-09-30 14:21:19 (UTC+8)
    Abstract: Recruitment is an essential function of Human Resources. Finding an appropriate hire is crucial to the organization. Understanding a candidate’s personality can be useful to the hiring process as employers can have an idea of whether the candidate fits the role and culture. The arrival of the internet and social media has brought large changes to recruitment, but they also bring new opportunities. Social media such as Twitter, Facebook and Reddit allow employers to market themselves to potential employees and candidates can engage directly with employers. These engagements produced a lot of text content which can be analyzed to find out more about candidates as it is possible to capture some aspects of their personality through their use and style of language.
    This research aims to apply machine learning and deep learning techniques to Reddit text data on the Five-Factor Model. The research will compare ten machine learning classifiers and six deep learning classifiers which includes two transformer models with our proposed sentence selection method and deep learning model architecture, FF-BERT. Our proposed method will utilize Log-Likelihood Ratio to extract keywords from the high and low end of each personality dimension and combine them to create a list of keywords which will be used to extract the most relevant sentences for the five personality dimensions which will be used for training. We also performed Topic Modeling using BERTopic and compared the results with keywords from each personality dimension.
    Our results showed that the proposed sentence selection method and deep learning architecture was able to achieve substantial gains compared to the machine learning and deep learning techniques. We also found some patterns within the topics extracted by BERTopic and keywords which match some characteristics of the personality dimensions.
    Description: 碩士
    指導教授:張詠淳
    口試委員:蘇家玉
    口試委員:張詠淳
    口試委員:陳建錦
    Note: 論文公開日期:2024-01-03
    Data Type: thesis
    Appears in Collections:[Graduate Institute of Data Science] Dissertations/Theses

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML38View/Open


    All items in TMUIR are protected by copyright, with all rights reserved.


    著作權聲明 Copyright Notice
    • 本平台之數位內容為臺北醫學大學所收錄之機構典藏,包含體系內各式學術著作及學術產出。秉持開放取用的精神,提供使用者進行資料檢索、下載與取用,惟仍請適度、合理地於合法範圍內使用本平台之內容,以尊重著作權人之權益。商業上之利用,請先取得著作權人之授權。

      The digital content on this platform is part of the Taipei Medical University Institutional Repository, featuring various academic works and outputs from the institution. It offers free access to academic research and public education for non-commercial use. Please use the content appropriately and within legal boundaries to respect copyright owners' rights. For commercial use, please obtain prior authorization from the copyright owner.

    • 瀏覽或使用本平台,視同使用者已完全接受並瞭解聲明中所有規範、中華民國相關法規、一切國際網路規定及使用慣例,並不得為任何不法目的使用TMUIR。

      By utilising the platform, users are deemed to have fully accepted and understood all the regulations set out in the statement, relevant laws of the Republic of China, all international internet regulations, and usage conventions. Furthermore, users must not use TMUIR for any illegal purposes.

    • 本平台盡力防止侵害著作權人之權益。若發現本平台之數位內容有侵害著作權人權益情事者,煩請權利人通知本平台維護人員([email protected]),將立即採取移除該數位著作等補救措施。

      TMUIR is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff([email protected]). We will remove the work from the repository.

    Back to Top
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback