Determining the impact of demographic features in predicting student success in Croatia
Predicting the success of students is a topic which has been studied for a long time in different scientific fields. Evaluation of importance of the features used in the prediction and their subsequent selection is an immensely important step in the process of classification and data mining. This paper presents a study on the importance of student demographic features in the process of predicting. The study and performed analyses used the demographic data collected from the Information System for Higher Education (ISVU). For determining the importance of demographic features in the study the following methods have been used: Information Gain (IG), Gain Ratio (GR), Sequential Backward Selection (SBS), Sequential Forward Selection (SFS). The results show the features rank, their importance weight in the prediction and comparison of the results and the use of different methods. Two classification algorithms for evaluating the impact of ranking features to the quality of prediction are used: Naive Bayes i Support Vector Machine (SVM). Final results provide guidelines for the development of a new prediction model.