Classification is separating data into predefined categories by obtaining descriptive features. In the classification process, machine and deep learning algorithms assume that the class samples are evenly distributed. In particular, the dataset size used in deep learning is significant for classification success. However, obtaining balanced data distribution in real-life problems is very difficult. This negatively affects class-based accuracy. Various methods are used in the literature to overcome the unbalanced data problem. This study investigated the effects of GAN, SMOTE, and VAE methods on ECG data. For this purpose, the heartbeat signals in the MIT-BIH dataset were used. To test the performance of the methods, a performance comparison was made using real and synthetic data, and finally, the model trained with synthetic data was tested with real data. According to the results, 96.5% accuracy was obtained with the real data. The highest classification accuracy of 100.0% was obtained in VAE when using only synthetic data. In training with synthetic data and test results with real data, the highest classification success was 86.4% with SMOTE. When synthetic and real data sets are used together, the highest success rate is 98.6% with VAE. In addition, the accuracy of all classes is evenly distributed after data augmentation.
Synthetic data Data augmentation GAN SMOTE VAE Heartbeat classification Deep learning
The study is complied with research and publication ethics.
Birincil Dil | İngilizce |
---|---|
Konular | Yapay Zeka (Diğer) |
Bölüm | Araştırma Makalesi |
Yazarlar | |
Erken Görünüm Tarihi | 30 Aralık 2024 |
Yayımlanma Tarihi | 31 Aralık 2024 |
Gönderilme Tarihi | 28 Temmuz 2024 |
Kabul Tarihi | 25 Eylül 2024 |
Yayımlandığı Sayı | Yıl 2024 Cilt: 13 Sayı: 4 |