eprintid: 713 rev_number: 1 eprint_status: archive userid: 8 dir: disk0/00/00/07/13 lastmod: 2012-05-04 16:39:06 status_changed: 2012-05-04 16:39:06 type: article metadata_visibility: show creators_name: Maria Ulfah Siregar, title: DATA PREPARATION FOR DATA MINING BASED ON NEURAL NETWORK: A CASE STUDY ON GERMAN CREDIT CLASSIFICATION DATASET ispublished: pub full_text_status: none keywords: Data preparation, data, missing value, data cleaning, data integration, data transformation, statistical analysis. abstract: bThis paper will give detailed data description and preparation of German Credit Classification dataset, before it is used for further processes in data mining or data warehouse. Data preparation is the longest and most difficult part of data mining process. In general, readily available data is usually dirty and sometimes no quality data is available. There are five parts in data description and preparation that are going to be given in this paper. The first part is the name of the dataset and the number of examples and their types of attributes. In the second part, some examples from good and bad class are given in the form of tables. Then, a data preliminary process is carried out to detect missing values from each of attributes. Next, the result of statistical data analysis is displayed on charts or categories tables from each of attributes. The last part is preprocessing, which comprise of data cleaning, integration and transformation. Based on the results obtained, three out of twenty attributes are deleted: Attribute 10, Attribute 18 and Attribute 20. So, the final data is smaller than the original one. Moreover, data is distributed more normally and in suitable patterns, which is hoped to be helpful for further processes. date: 2009-03-27 publication: /Jurnal/Kaunia/Volume 4, No. 2, Oktober 2008/ publisher: Fakultas Sain dan Teknologi UIN Sunan Kalijaga Yog refereed: TRUE referencetext: 1 update terakhir : 2009-03-27 09:58:13 ; nama file diserver lama : digilib-uinsuka--mariaulfah-1459-1-mariaul-t.pdf ; letak file diserver lama : ./files/disk1/30/digilib-uinsuka--mariaulfah-1459-1-mariaul-t.pdf ; url download server lama : /download.php?id=1840 ; nama file lama : MARIA ULFAH SIREGAR - DATA PREPARATION FOR DATA MINING BASED ON NEURAL NETWORK A CASE STUDY ON GERMAN CREDIT CLASSIFICATION DATASET.pdf ; format file : application/pdf ; besar file : 274706 ; 1 1 update terakhir : 2009-03-27 09:58:13 ; nama file diserver lama : digilib-uinsuka--mariaulfah-1459-1-mariaul-t.pdf ; letak file diserver lama : ./files/disk1/30/digilib-uinsuka--mariaulfah-1459-1-mariaul-t.pdf ; url download server lama : /download.php?id=1840 ; nama file lama : MARIA ULFAH SIREGAR - DATA PREPARATION FOR DATA MINING BASED ON NEURAL NETWORK A CASE STUDY ON GERMAN CREDIT CLASSIFICATION DATASET.pdf ; format file : application/pdf ; besar file : 274706 ; Copyright � 2009 by Perpustakaan Digital UIN Sunan Kalijaga Yogyakarta. Verbatim copying and distribution of this entire article is permitted by author in any medium, provided this notice is preserved. citation: Maria Ulfah Siregar, (2009) DATA PREPARATION FOR DATA MINING BASED ON NEURAL NETWORK: A CASE STUDY ON GERMAN CREDIT CLASSIFICATION DATASET. /Jurnal/Kaunia/Volume 4, No. 2, Oktober 2008/.