Skip to content

Code used for "The Effects of Input and Temperature of GPT Model on Labeling Medical Data" BSc thesis.

Notifications You must be signed in to change notification settings

VeronikaKukk/gpt-and-bert-ner

Repository files navigation

The Effects of Input and Temperature of GPT Model on Labeling Medical Data

This repository consists of code used for "The Effects of Input and Temperature of GPT Model on Labeling Medical Data" 2024 BSc thesis. Thesis supervisor is Hendrik Šuvalov. Parts of the code were provided by the supervisor. The provided codeblocks are marked with a comments.

The code has two parts:

  1. Fine-tuning XLM-RoBERTa and estmedBERT models on GPT annotations and on human annotations
  2. Researching the data from NCBI dataset and synthetic Estonian medical dataset

There are 8 files.


GPT mudeli sisendi ja temperatuuri mõju meditsiiniliste andmete märgendamisele

See repositoorium koosneb koodist, mida kasutati "GPT mudeli sisendi ja temperatuuri mõju meditsiiniliste andmete märgendamisele" BSc lõputöös. Töö juhendajaks on Hendrik Šuvalov. Osa koodist on juhendaja poolt antud ning need on koodi sees kommentaariga ära märgitud.

Kood koosneb kahest osast:

  1. XLM-RoBERTa ja estmedBERT baasmudelite peenhäälestamine GPT mudeli märgendustega ja inimese märgendatud andmetega
  2. NCBI andmestiku ja eestikeelse sünteetilise meditsiinilise andmestiku pinnapealne uurimine

Repositooriumis on 8 faili.

About

Code used for "The Effects of Input and Temperature of GPT Model on Labeling Medical Data" BSc thesis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published