4 sonuçlar
Arama Sonuçları
Listeleniyor 1 - 4 / 4
Öğe A novel approach for extracting ideal exemplars by clustering for massive time-ordered datasets(TÜBİTAK, 2017-07-30) Ertuğrul, Ömer FarukThe number and length of massive datasets have increased day by day and this yields more complex machine learning stages due to the high computational costs. To decrease the computational cost many methods were proposed in the literature such as data condensing, feature selection, and filtering. Although clustering methods are generally employed to divide samples into groups, another way of data condensing is by determining ideal exemplars (or prototypes), which can be used instead of the whole dataset. In this study, first the efficiency of traditional data condensing by clustering approach was confirmed according to obtained accuracies and condensing ratios in 9 different synthetic or real batch datasets. This approach was then improved to be employed in time-ordered datasets. In order to validate the proposed approach, 23 different real time-ordered datasets were used in experiments. Achieved mean RMSEs were 0.27 and 0.29 by employing the condensed (mean condensed ratio was 97.17%) and the whole datasets, respectively. Obtained results showed that higher accuracy rates and condensing ratios were achieved by the proposed approach.Öğe Doküman dili tanıma için yeni bir öznitelik çıkarım yaklaşımı: İkili desenler(Gazi Üniversitesi, 2016-12-14) Kaya, Yılmaz; Ertuğrul, Ömer FarukDoğal dil işlemenin önemli alt konularından biri olan dil tanıma (DT), bir dokümanın içeriğine göre yazıldığı dili belirleme işlemidir. Bu çalışmada, karakterlerin UTF-8 değerlerini birbirleri ile karşılaştırmalar sonucu elde edilen ikili desenler kullanarak yeni bir dil tanıma yaklaşımı, bir boyutlu yerel ikili örüntüler (1B-YİÖ) önerilmiştir. Önerilen yöntem farklı sayıda dillerden oluşan metinler içeren dört veri kümesi ile test edilmiştir. 1B-YİÖ ile dokümanlardan elde edilen öznitelikler kullanılarak farklı makine öğrenmesi yöntemleri ile sınıflandırma işlemi gerçekleştirilmiştir. Dört veri kümesi için sınıflandırma başarıları sırası ile %86.20, %92.75, %100 ve %89.77 olarak gözlenmiştir. Elde edilen sonuçlara göre önerilen öznitelik çıkarım yönteminin dil tanıma için önemli örüntüler sağladığı görülmüştür.Öğe A fast feature selection approach based on extreme learning machine and coefficient of variation(TÜBİTAK, 2017-07-30) Ertuğrul, Ömer Faruk; Tağluk, Mehmet EminFeature selection is the method of reducing the size of data without degrading their accuracy. In this study, we propose a novel feature selection approach, based on extreme learning machines (ELMs) and the coefficient of variation (CV). In the proposed approach, the most relevant features are identified by ranking each feature with the coefficient obtained through ELM divided by CV. The achieved accuracies and computational costs, obtained with the use of features selected via the proposed approach in 9 classification and 26 regression benchmark data sets, were compared to those obtained with all features, as well as those obtained with the features selected by a wrapper and a filtering method. The achieved accuracy values obtained with the proposed approach were generally higher than when using all features. Furthermore, high feature reduction ratios were obtained with the proposed approach, including the achieved feature reduction ratios in epilepsy, liver, EMG, shuttle, and abalone. Stock data sets were 90.48%, 90%, 70.59%, 66.67%, 75%, and 77.78%, respectively. This approach is an extremely fast process that is independent of the employed machine-learning methods.Öğe Determining optimal artificial neural network training method in predicting the performance and emission parameters of a biodiesel-fueled diesel generator(International Journal of Automotive Engineering and Technologies, 2019-04-03) Altun, Şehmus; Ertuğrul, Ömer FarukArtificial neural network (ANN) methods were employed and suggested in modeling the emissions and performance of a diesel generator fueled with waste cooking oil derived biodiesel during steady-state operation. These papers are generally built on determining optimal network structure, but the modelling accuracy of an ANN is also highly dependent on employed training method. In modeling, operating conditions and fuel blend ratio were used as the inputs while the performance and emission parameters were the outputs. The modeling results obtained by conventional ANNs that were trained by back propagation (BP) learning algorithm, radial basis function (RBF), and extreme learning machine (ELM) were compared with experimental results and each other. The accuracy of the estimations by ELM was above 95% for all the output parameters except for specific fuel consumption and thermal efficiency. Moreover, ELM performed better than BP and RBF with lower mean relative error (MRE) in case where the emissions were estimated. The ELM provided correlation coefficients of 0.987, 0.950 and 0.996 for unburned hydrocarbons (HCs), nitrogen oxides (NOx) and smoke opacity (SO), respectively, while for BP, they were 0.973, 0.818, 0.993, and for RBF, 0.975, 0.640 and 0.981. The most suitable training function for each emission and performance parameters of diesel generator was determined based on obtained accuracies.