|
|
|
HABERLER
|
 |
08/06/2009 Güncellenme Tarihi: 20/07/2009
|
|
ISTANBUL KÜLTÜR UNIVERSITY DEPARTMENT OF INDUSTRIAL ENGINEERING
SEMINAR SERIES
MILP Based Hyper-Box Enclosure Approach to Multi-Class Data Classification
by
Fadime Üney-Yüksektepe, PhD Koç University, Department of Industrial Engineering e-mail: fadime.uney@gmail.com
Abstract: Data classification is an important data mining problem that aims to determine the membership of different instances to a number of different sets. Traditional approaches that are based on partitioning the data sets into two groups need some modifications for multi-class data classification problems. These modifications affect the efficiency and make the models more complex. In this study, a novel mixed integer programming based hyper-box enclosure approach is presented for multi-class data classification problems. In order to deal with large data sets, a three-stage mathematical programming based approach is developed for training part analysis of hyper-box enclosure method. Training set is preprocessed to identify the observations that are more difficult to classify, and seed finding and sub grouping algorithms are applied in the first stage. Then, optimization model is formulated considering these observations and seeds. Finally, assignments of non-problematic instances, intersection elimination and box combination algorithms are carried out. After training analysis with this three stage approach, the efficiency of the method is tested by the simple distance based testing algorithm. The efficiency of the proposed three-stage method is tested on two separate benchmark problems; the protein folding type prediction problem and the UCI Repository data sets. The computational results on the illustrative example and the benchmark problems show the accuracy of the proposed method.
All interested are cordially invited.
Date : July 22, 2009 Time : 14:00-15:00 Room : Faculty of Engineering and Architecture Seminar Room (2nd floor, 215)
|
|
|
|
|