<mets:mets OBJID="eprint_76894" LABEL="Eprints Item" xsi:schemaLocation="http://www.loc.gov/METS/ http://www.loc.gov/standards/mets/mets.xsd http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd" xmlns:mets="http://www.loc.gov/METS/" xmlns:mods="http://www.loc.gov/mods/v3" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"><mets:metsHdr CREATEDATE="2026-06-28T00:09:19Z"><mets:agent ROLE="CUSTODIAN" TYPE="ORGANIZATION"><mets:name>Institutional Repository UIN Sunan Kalijaga Yogyakarta</mets:name></mets:agent></mets:metsHdr><mets:dmdSec ID="DMD_eprint_76894_mods"><mets:mdWrap MDTYPE="MODS"><mets:xmlData><mods:titleInfo><mods:title>DETEKSI CACAT KODE PADA BAHASA PEMROGRAMAN PYTHON MENGGUNAKAN MODEL BAHASA CODET5-SMALL BERBASIS FOCAL LOSS</mods:title></mods:titleInfo><mods:name type="personal"><mods:namePart type="given">NIM.: 24206051005</mods:namePart><mods:namePart type="family">Hanny Handayani Sucinta</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:abstract>Code defect detection is an important component of software quality assurance because it helps identify potentially faulty code before such defects affect testing or production environments. In Python, this task is particularly challenging due to the dynamic nature of the language and the presence of class imbalance in defect detection datasets. This study aims to develop a binary classification model for detecting defects in Python code using CodeT5-small with a Sequence-to-Sequence architecture. The dataset was obtained from PyTraceBugs and processed through preprocessing, deduplication, tokenization, and stratified data splitting based on label and length-bin to preserve both class distribution and code-length distribution. To address class imbalance, WeightedRandomSampler was applied at the data distribution level, while Focal Loss was used at the loss function level. The model was evaluated using accuracy, precision, recall, F1-score, MCC, AUC, confusion matrix, a length-only baseline, and a length-matched test. The evaluation on the test set achieved an accuracy of 80.14%, defective-class precision of 67.89%, recall of 86.51%, F1-score of 76.08%, MCC of 60.79%, and AUC of 83.73%. Compared with the length-only baseline, the proposed model showed clear improvements across the main evaluation metrics. In the length-matched test, the model maintained a recall of 83.39% and an F1-score of 77.30%, indicating that its performance was not solely driven by code length. The findings suggest that the combination of CodeT5-small, WeightedRandomSampler, and Focal Loss improves sensitivity toward the defective class while providing a more robust evaluation against potential code-length bias.</mods:abstract><mods:classification authority="lcc">005.12 Software System Analysis and Design/Sistem Analisa dan Desain Perangkat Lunak</mods:classification><mods:originInfo><mods:dateIssued encoding="iso8061">2026-05-18</mods:dateIssued></mods:originInfo><mods:originInfo><mods:publisher>UIN SUNAN KALIJAGA YOGYAKARTA;FAKULTAS SAINS DAN TEKNOLOGI</mods:publisher></mods:originInfo><mods:genre>Thesis</mods:genre></mets:xmlData></mets:mdWrap></mets:dmdSec><mets:amdSec ID="TMD_eprint_76894"><mets:rightsMD ID="rights_eprint_76894_mods"><mets:mdWrap MDTYPE="MODS"><mets:xmlData><mods:useAndReproduction>
<p xmlns="http://www.w3.org/1999/xhtml"><strong>For work being deposited by its own author:</strong> 
In self-archiving this collection of files and associated bibliographic 
metadata, I grant Institutional Repository UIN Sunan Kalijaga Yogyakarta the right to store 
them and to make them permanently available publicly for free on-line. 
I declare that this material is my own intellectual property and I 
understand that Institutional Repository UIN Sunan Kalijaga Yogyakarta does not assume any 
responsibility if there is any breach of copyright in distributing these 
files or metadata. (All authors are urged to prominently assert their 
copyright on the title page of their work.)</p>

<p xmlns="http://www.w3.org/1999/xhtml"><strong>For work being deposited by someone other than its 
author:</strong> I hereby declare that the collection of files and 
associated bibliographic metadata that I am archiving at 
Institutional Repository UIN Sunan Kalijaga Yogyakarta) is in the public domain. If this is 
not the case, I accept full responsibility for any breach of copyright 
that distributing these files or metadata may entail.</p>

<p xmlns="http://www.w3.org/1999/xhtml">Clicking on the deposit button indicates your agreement to these 
terms.</p>
    </mods:useAndReproduction></mets:xmlData></mets:mdWrap></mets:rightsMD></mets:amdSec><mets:fileSec><mets:fileGrp USE="reference"><mets:file ID="eprint_76894_1057373_1" SIZE="6387769" OWNERID="https://digilib.uin-suka.ac.id/id/eprint/76894/1/24206051005_BAB-I_IV-atau-V_DAFTAR-PUSTAKA.pdf" MIMETYPE="application/pdf"><mets:FLocat LOCTYPE="URL" xlink:type="simple" xlink:href="https://digilib.uin-suka.ac.id/id/eprint/76894/1/24206051005_BAB-I_IV-atau-V_DAFTAR-PUSTAKA.pdf"></mets:FLocat></mets:file></mets:fileGrp><mets:fileGrp USE="reference"><mets:file ID="eprint_76894_1057374_1" SIZE="9569104" OWNERID="https://digilib.uin-suka.ac.id/id/eprint/76894/2/24206051005_BAB-II_sampai_SEBELUM-BAB-TERAKHIR.pdf" MIMETYPE="application/pdf"><mets:FLocat LOCTYPE="URL" xlink:type="simple" xlink:href="https://digilib.uin-suka.ac.id/id/eprint/76894/2/24206051005_BAB-II_sampai_SEBELUM-BAB-TERAKHIR.pdf"></mets:FLocat></mets:file></mets:fileGrp></mets:fileSec><mets:structMap><mets:div DMDID="DMD_eprint_76894_mods" ADMID="TMD_eprint_76894"><mets:fptr FILEID="eprint_76894_document_1057373_1"></mets:fptr><mets:fptr FILEID="eprint_76894_document_1057374_1"></mets:fptr></mets:div></mets:structMap></mets:mets>