%0 Journal Article
%A MARIA ULFAH, SIREGAR 
%D 2008
%F digilib:7802
%J Kaunia Jurnal Sains dan Teknologi vol IV, no2, oktober 2008
%K Data preparation, data, missing value, data cleaning, data  integration, data transformation, statistical analysis.
%T DATA PREPARATION FOR DATA MINING BASED ON NEURAL NETWORK: A CASE STUDY ON GERMAN CREDIT CLASSIFICATION DATASET
%U https://digilib.uin-suka.ac.id/id/eprint/7802/
%X This paper will give detailed data description and preparation of German Credit  Classification dataset, before it is used for further processes in data mining or data  warehouse. Data preparation is the longest and most difficult part of data mining process.  In general, readily available data is usually dirty and sometimes no quality data is available.  There are five parts in data description and preparation that are going to be given in this  paper. The first part is the name of the dataset and the number of examples and their types  of attributes. In the second part, some examples from good and bad class as are given in  the form of tables. Then, a data preliminary process is carried out to detect missing  values from each attribute. Next, the result of statistical data analysis is displayed on  charts or categories tables from each attribute. The last part is preprocessing, which  comprise of data cleaning, integration and transformation. Based on the results obtained,  three out of twenty attributes are deleted: Attribute 10, Attribute 18 and Attribute 20. So,  the final data is smaller than the original one. Moreover, data is distributed more normally  and in suitable patterns, which is hoped to be helpful for further processes.