Parameter-efficient fine-tuning in reading comprehension
dc.contributor.advisor | Kementchedjhieva, Yova, juhendaja | |
dc.contributor.advisor | Sirts, Kairit, juhendaja | |
dc.contributor.author | Abdumalikov, Rustam | |
dc.contributor.other | Tartu Ülikool. Loodus- ja täppisteaduste valdkond | et |
dc.contributor.other | Tartu Ülikool. Arvutiteaduse instituut | et |
dc.date.accessioned | 2023-11-02T10:39:04Z | |
dc.date.available | 2023-11-02T10:39:04Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Question Answering is an important task in Natural Language Processing. There are different approaches to answering questions, such as using the knowledge learned during pre-training or extracting an answer from a given context, which is commonly known as reading comprehension. One problem with the knowledge learned during pre-trained is that it can become outdated because we train it only once. Instead of replacing outdated information in the model, an alternative approach is to add updated information to the model input. However, there is a risk that the model may rely too much on its memorized knowledge and ignore new information, which can cause errors. Our study aims to analyze whether parameter-efficient fine-tuning methods would improve the model’s ability to handle new information. We assess the effectiveness of these techniques in comparison to traditional fine-tuning for reading comprehension on an augmented NaturalQuestions dataset. Our findings indicate that parameter-efficient fine-tuning leads to a marginal improvement in performance compared to fine-tuning. Furthermore, we observed that data augmentations contributed the most substantial performance enhancements. | et |
dc.identifier.uri | https://hdl.handle.net/10062/93945 | |
dc.language.iso | eng | et |
dc.publisher | Tartu Ülikool | et |
dc.rights | openAccess | et |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | natural language processing | et |
dc.subject | question answering | et |
dc.subject | fine-tuning | et |
dc.subject | transformers | et |
dc.subject | neural networks | et |
dc.subject.other | magistritööd | et |
dc.subject.other | informaatika | et |
dc.subject.other | infotehnoloogia | et |
dc.subject.other | informatics | et |
dc.subject.other | infotechnology | et |
dc.title | Parameter-efficient fine-tuning in reading comprehension | et |
dc.type | Thesis | et |
Failid
Originaal pakett
1 - 1 1
Laen...
- Nimi:
- RustamAbdumalikov_ComputerScience_MasterThesis.pdf
- Suurus:
- 1000.77 KB
- Formaat:
- Adobe Portable Document Format
- Kirjeldus:
Litsentsi pakett
1 - 1 1
Pisipilt ei ole saadaval
- Nimi:
- license.txt
- Suurus:
- 1.71 KB
- Formaat:
- Item-specific license agreed upon to submission
- Kirjeldus: