PERBANDINGAN KINERJA 3 MODEL BAHASA INDOBERT UNTUK PERINGKASAN TEKS PADA DOKUMEN BAHASA INDONESIA

Herwinsyah, NIM.: 21206052001 (2023) PERBANDINGAN KINERJA 3 MODEL BAHASA INDOBERT UNTUK PERINGKASAN TEKS PADA DOKUMEN BAHASA INDONESIA. Masters thesis, UIN SUNAN KALIJAGA YOGYAKARTA.

[img]
Preview
Text (PERBANDINGAN KINERJA 3 MODEL BAHASA INDOBERT UNTUK PERINGKASAN TEKS PADA DOKUMEN BAHASA INDONESIA)
21206052001_BAB-I_IV-atau-V_DAFTAR-PUSTAKA.pdf - Published Version

Download (3MB) | Preview
[img] Text (PERBANDINGAN KINERJA 3 MODEL BAHASA INDOBERT UNTUK PERINGKASAN TEKS PADA DOKUMEN BAHASA INDONESIA)
21206052001_BAB-II_sampai_SEBELUM-BAB-TERAKHIR..pdf - Published Version
Restricted to Registered users only

Download (7MB) | Request a copy

Abstract

A good document summary produced by a language model should have a high level of similarity to a summary generated by a human, as the latter considers the semantic value of the text. Typically, a summarization algorithm implemented by a system tends to focus solely on the arrangement of words, without taking into account the semantic aspect of a sentence. The objective of this study is to measure the quality of the summary results obtained from three language models, namely IndoBERT. In this study, the researcher summarized the same source material using three language models that are focused on the Indonesian language. The summary results generated by the three language models were then compared with a reference summary, which is a summary generated by a human from the same paragraph source. Three evaluation tools were used to measure the quality of the summary results. The study found that the IndoBERT_3 language model, or the IndoBERT version developed by Sarah Lintang (UGM Jogja), produced a summary result that closely approximates the level of similarity to the human-generated summary (gold summary).

Item Type: Thesis (Masters)
Additional Information: Pembimbing: Dr. Agung Fatwanto, S.Si, M.Kom
Uncontrolled Keywords: Model Bahasa, IndoBERT, ROUGE, BLEU, BERTScore
Subjects: Tehnik Informatika
BAHASA > Bahasa Indonesia
Divisions: Fakultas Sains dan Teknologi > Informatika (S2)
Depositing User: Muh Khabib, SIP.
Date Deposited: 06 Jun 2023 15:52
Last Modified: 06 Jun 2023 15:52
URI: http://digilib.uin-suka.ac.id/id/eprint/59064

Share this knowledge with your friends :

Actions (login required)

View Item View Item
Chat Kak Imum