Taipei Medical University Institutional Repository:Item 987654321/64311
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 45422/58598 (78%)
Visitors : 2550541      Online Users : 234
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: http://libir.tmu.edu.tw/handle/987654321/64311


    Title: 使用生成式人工智慧從自由文本病理報告中提取結構化訊息
    Using Generative AI to Extract Structured Information from Free Text Pathology Reports
    Authors: SHAHID, FAHAD
    Contributors: 大數據科技及管理研究所碩士班
    許明暉
    Keywords: 生成式預訓練 變壓器 (GPT)、自由文本病理報告、資訊擷取、大型語言模型 (LLM)、ChatGPT、乳癌、人工智慧(AI)
    Generative pre-trained transformers (GPT)、Free-text pathology reports、information extraction、Large Language Models (LLM)、ChatGPT、Breast cancer、Artificial Intelligence (AI)
    Date: 2024-06-17
    Issue Date: 2024-09-30 14:20:56 (UTC+8)
    Abstract: ABSTRACT
    Background: Traditionally, organizing pathology reports has been a manual task. This approach is labor-intensive, costly, and often leads to inaccuracies, complicating the analysis of data for medical research and delaying critical insights. While artificial intelligence (AI) offers potential solutions, many systems struggle to effectively handle the complexity of pathological texts. Numerous studies and articles have underscored the urgent need to automate the process of extracting information and structuring that information from free text pathology reports.

    Objective: This study explores the use of generative AI to automate the extraction of information and structuring that information from 33 breast cancer free text pathology reports obtained from Taipei Medical University Hospital. The aim is to accelerate the information extraction process and enhance the reliability of this information for research and diagnostic purposes. Additionally, the study seeks if generative AI can help to improve the accuracy and efficiency to structure information from free text pathology reports.

    Methods: For this study, a Streamlit web application was developed, leveraging its capabilities to efficiently create robust generative AI applications. This application was seamlessly integrated with the ChatGPT Large Language Model provided by OpenAI, tasked with the extraction and structuring of information from free-text pathology reports. The data, once extracted, is methodically organized and compiled into a downloadable Excel file for subsequent analysis. Furthermore, the application is designed to display the results on its web interface, offering immediate validation features. This allows users to promptly verify and assess the accuracy of the information processed. The systematic and meticulous approach adopted ensures the highest standards of data integrity and operational efficiency, essential for the reliability of research outcomes.

    Results: The implementation of the Streamlit web application, integrated with the ChatGPT Large Language Model, successfully automated the extraction and structuring of information from 33 breast cancer free-text pathology reports. The system showed a higher percentage of accuracy, by achieving an extraction and structuring accuracy rate of 99.61%. This result confirms the effectiveness of generative AI in handling free text pathological reports.

    Conclusion: The reliability of the system was further underscored by its ability to significantly reduce the time required to structure and analyze free-text pathological reports compared to traditional manual methods. This advancement highlights the potential of generative AI to transform the processing of free-text pathological reports, enhancing both efficiency and accuracy, and offering substantial improvements in research and clinical diagnostics.
    Description: 碩士
    指導教授:許明暉
    口試委員:許
    口試委員:張詠淳
    口試委員:簡文山
    Note: 論文公開日期:2024-07-15
    Data Type: thesis
    Appears in Collections:[Graduate Institute of Data Science] Dissertations/Theses

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML70View/Open


    All items in TMUIR are protected by copyright, with all rights reserved.


    著作權聲明 Copyright Notice
    • 本平台之數位內容為臺北醫學大學所收錄之機構典藏,包含體系內各式學術著作及學術產出。秉持開放取用的精神,提供使用者進行資料檢索、下載與取用,惟仍請適度、合理地於合法範圍內使用本平台之內容,以尊重著作權人之權益。商業上之利用,請先取得著作權人之授權。

      The digital content on this platform is part of the Taipei Medical University Institutional Repository, featuring various academic works and outputs from the institution. It offers free access to academic research and public education for non-commercial use. Please use the content appropriately and within legal boundaries to respect copyright owners' rights. For commercial use, please obtain prior authorization from the copyright owner.

    • 瀏覽或使用本平台,視同使用者已完全接受並瞭解聲明中所有規範、中華民國相關法規、一切國際網路規定及使用慣例,並不得為任何不法目的使用TMUIR。

      By utilising the platform, users are deemed to have fully accepted and understood all the regulations set out in the statement, relevant laws of the Republic of China, all international internet regulations, and usage conventions. Furthermore, users must not use TMUIR for any illegal purposes.

    • 本平台盡力防止侵害著作權人之權益。若發現本平台之數位內容有侵害著作權人權益情事者,煩請權利人通知本平台維護人員([email protected]),將立即採取移除該數位著作等補救措施。

      TMUIR is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff([email protected]). We will remove the work from the repository.

    Back to Top
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback