Eesti Vabariigi seaduste ja kohtulahendite andmebaas ning ristviitamine
Date
2016
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Riigi Teataja on Eesti Vabariigi ametlik võrguväljaanne, kust on võimalik leida seaduseid ja kohtulahendeid. Selles bakalaureusetöös otsitakse Riigi Teataja veebilehelt üles hetkel keh-tivad seadused ja viimase kümne aasta jooksul avaldatud kohtulahendid. Informatsioon laaditakse Riigi Teataja lehelt alla automaatselt, kasutades selleks Scrapy tööriista abil kir-jutatud programmi. Kuna vajalikud andmed on leitavad vaid PDF-failidena, kasutatakse neist teksti eraldamiseks tööriista PDFMiner. Täistekstidest metaandmete eraldamiseks on kirjutatud vastav tekstianalüsaator. Seejärel seotakse seadused ja kohtulahendid omavahel ning saadud andmestikku kasutatakse andmeanalüüsiks, uurides eelkõige seaduste kasuta-tavust kohtulahendites. Andmetöötlust tehakse programmeerimiskeeles Python, päringuid koostatakse programmeerimiskeeles SQL. Töö tulemustest on näha, et põhiline töö tehakse kohtutes ära vaid mõne seadusega ning paljudele hetkel kehtivatele seadustele ei ole viima-se kümne aasta jooksul kordagi viidatud.
Riigi Teataja is an official publication of the Republic of Estonia, where laws and adjudi-cations can be found. In this Bachelor’s thesis, adjudications from the last 10 years and currently effective laws are downloaded using a tool named Scrapy. Because the data is only available as PDF files, an additional tool named PDFMiner is used, to parse text from the files. In order to extract metadata from the full texts, a corresponding text-analysing program is written. The data is then cross-referenced and analysed, especially with focus on the usage of laws in adjudications. Data processing is done using the Python language, queries are constructed in SQL. The results show that most of the work in courts is done using only a few laws and that many of the currently effective laws have not been refer-enced once in the last ten years.
Riigi Teataja is an official publication of the Republic of Estonia, where laws and adjudi-cations can be found. In this Bachelor’s thesis, adjudications from the last 10 years and currently effective laws are downloaded using a tool named Scrapy. Because the data is only available as PDF files, an additional tool named PDFMiner is used, to parse text from the files. In order to extract metadata from the full texts, a corresponding text-analysing program is written. The data is then cross-referenced and analysed, especially with focus on the usage of laws in adjudications. Data processing is done using the Python language, queries are constructed in SQL. The results show that most of the work in courts is done using only a few laws and that many of the currently effective laws have not been refer-enced once in the last ten years.