Grammatiliste vigade parandamine sageduspõhise sünteetilise andmestikuga
dc.contributor.advisor | Luhtaru, Agnes, juhendaja | |
dc.contributor.advisor | Fišel, Mark, juhendaja | |
dc.contributor.author | Univer, Jakob | |
dc.contributor.other | Tartu Ülikool. Loodus- ja täppisteaduste valdkond | et |
dc.contributor.other | Tartu Ülikool. Arvutiteaduse instituut | et |
dc.date.accessioned | 2023-08-23T06:28:16Z | |
dc.date.available | 2023-08-23T06:28:16Z | |
dc.date.issued | 2022 | |
dc.description.abstract | In this thesis we introduce a grammatical error correction method with a neural network trained only on synthetic data. The method is useful for languages without big corpora for training a grammatical error correction model, like Estonian. From a smaller human corrected corpus, we found the probabilities of word deletion, addition, substitution and changing word order mistakes in the text. With the help of these probabilities we created a bigger synthetic corpus and we trained a neural network for grammatical error correction on the synthetic data. The author found that the probabilities of mistakes do not have to be very precise and the trained neural network can correct spelling mistakes as well as grammar mistakes. | et |
dc.identifier.uri | https://hdl.handle.net/10062/91682 | |
dc.language.iso | est | et |
dc.publisher | Tartu Ülikool | et |
dc.rights | openAccess | et |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Grammatcal Error Correction | et |
dc.subject | neural network | et |
dc.subject | synthetic data | et |
dc.subject.other | bakalaureusetööd | et |
dc.subject.other | informaatika | et |
dc.subject.other | infotehnoloogia | et |
dc.subject.other | informatics | et |
dc.subject.other | infotechnology | et |
dc.title | Grammatiliste vigade parandamine sageduspõhise sünteetilise andmestikuga | et |
dc.type | Thesis | et |