摘要
|
The zero-inflated Bernoulli model, enhanced with elastic net regularization, effectively handles binary classification for zero-inflated datasets. This zero-inflated structure significantly contributes to data imbalance. To improve the ZIBer model’s ability to accurately identify minority classes, we explore the use of momentum and Nesterov’s gradient descent methods, particle swarm optimization, and a novel hybrid algorithm combining particle swarm optimization with Nesterov’s accelerated gradient techniques. Additionally, the synthesized minority oversampling technique is employed for data augmentation and training the model. Extensive simulations using holdout cross-validation reveal that the proposed hybrid algorithm with data augmentation excels in identifying true positive cases. Conversely, the hybrid algorithm without data augmentation is preferable when aiming for a balance between the metrics of recall and precision. Two case studies about diabetes and biopsy are provided to demonstrate the model’s effectiveness, with performance assessed through K-fold cross-validation. |