Depth Analysis of Vulnerabilities in Training and Inference Times of Large Language Models

Canan Batur Şahin

doi:10.46572/naturengs.1636277

Araştırma Makalesi

Depth Analysis of Vulnerabilities in Training and Inference Times of Large Language Models

Yıl 2025, Cilt: 6 Sayı: 1, 23 - 33, 29.06.2025

Canan Batur Şahin

https://doi.org/10.46572/naturengs.1636277

Öz

Large language models (LLMs) have dramatically reshaped the field of natural language processing, presenting groundbreaking advancements in many areas, from chatbots to content creation. However, with the increasing adoption of these sophisticated models, it is crucial to scrutinize the vulnerabilities associated with their training and inference stages. This comprehensive analysis highlights the critical threats and inefficiencies inherent to these processes and emphasizes the need for robust countermeasures. This paper presents an extensive study of training and inference time vulnerabilities in Large Language Models (LLMs), specifically focusing on poisoning, backdoor, paraphrasing, and spoofing attacks. We introduce novel evaluation frameworks and detection mechanisms for each attack type. Our experimental results across multiple attack vectors demonstrate varying degrees of model susceptibility and reveal critical security implications. The proposed defensive mechanisms showcase impressive model performance, highlighted by consistent successful evaluation outcomes.

Anahtar Kelimeler

Large Language Models (LLMs), Cyber-security, Adversarial attacks, Critical vulnerabilities

Kaynakça

Zhang, E. Y., Cheok, A. D., Pan, Z., Cai, J., & Yan, Y. (2023). From Turing to Transformers: A Comprehensive Review and Tutorial on the Evolution and Applications of Generative Transformer Models. Sci, 5(4), 46. https://doi.org/10.3390/sci5040046
Yang,J. (2024). Large language models privacy and security. Applied and Computational Engineering,76,177-188.
Guven, M. (2024). A Comprehensive Review of Large Language Models in Cyber Security. International Journal of Computational and Experimental Science and Engineering, 10(3). https://doi.org/10.22399/ijcesen.469
OWASP. (2024). OWASP Top 10 for Large Language Model Applications. OWASP Foundation.
B. S. Latibari et al., (2024). Transformers: A Security Perspective, IEEE Access, vol. 12, pp. 181071-181105, doi: 10.1109/ACCESS.2024.3509372.
Du, W., Li, P., Li, B., Zhao, H., & Liu, G. (2023). UOR: Universal Backdoor Attacks on Pre-trained Language Models. ArXiv. https://arxiv.org/abs/2305.09574.
Zhang, Y., Rando, J., Evtimov, I., Chi, J., Smith, E. M., Carlini, N., Tramèr, F., & Ippolito, D. (2024). Persistent Pre-Training Poisoning of LLMs. ArXiv. https://arxiv.org/abs/2410.13722
Dozono, K., Gasiba, T. E., & Stocco, A. (2024). Large Language Models for Secure Code Assessment: A Multi-Language Empirical Study. ArXiv. https://arxiv.org/abs/2408.06428
Agnew, W., Jiang, H. H., Sum, C., Sap, M., & Das, S. (2024). Data Defenses Against Large Language Models. ArXiv. https://arxiv.org/abs/2410.13138.
Shayegani, E., Mamun, M. A., Fu, Y., Zaree, P., & Dong, Y. (2023). Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks. ArXiv. https://arxiv.org/abs/2310.10844.
Zheng, Z., & Zhu, X. (2023). NatLogAttack: A framework for attacking natural language inference models with natural logic. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. Vol. 1, pp. 9960- 9976. https://doi.org/10.18653/v1/2023.acl-long.542.
Shi, Y., Gao, Y., Lai, Y., Wang, H., Feng, J., He, L., Wan, J., Chen, C., Yu, Z., & Cao, X. (2024). SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models. ArXiv. https://arxiv.org/abs/2402.04178
Zhang, J., Wang, C., Li, A., Sun, W., Zhang, C., Ma, W., & Liu, Y. (2024). An Empirical Study of Automated Vulnerability Localization with Large Language Models. ArXiv. https://arxiv.org/abs/2404.00287.
Liu, Y., Gao, L., Yang, M., Xie, Y., Chen, P., Zhang, X., & Chen, W. (2024). VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models. ArXiv. https://arxiv.org/abs/2406.07595.
Shestov, A., Levichev, R., Mussabayev, R., Maslov, E., Cheshkov, A., & Zadorozhny, P. (2024). Finetuning Large Language Models for Vulnerability Detection. ArXiv. https://arxiv.org/abs/2401.17010.

Yıl 2025, Cilt: 6 Sayı: 1, 23 - 33, 29.06.2025

Canan Batur Şahin

https://doi.org/10.46572/naturengs.1636277

Öz

Kaynakça

Zhang, E. Y., Cheok, A. D., Pan, Z., Cai, J., & Yan, Y. (2023). From Turing to Transformers: A Comprehensive Review and Tutorial on the Evolution and Applications of Generative Transformer Models. Sci, 5(4), 46. https://doi.org/10.3390/sci5040046
Yang,J. (2024). Large language models privacy and security. Applied and Computational Engineering,76,177-188.
Guven, M. (2024). A Comprehensive Review of Large Language Models in Cyber Security. International Journal of Computational and Experimental Science and Engineering, 10(3). https://doi.org/10.22399/ijcesen.469
OWASP. (2024). OWASP Top 10 for Large Language Model Applications. OWASP Foundation.
B. S. Latibari et al., (2024). Transformers: A Security Perspective, IEEE Access, vol. 12, pp. 181071-181105, doi: 10.1109/ACCESS.2024.3509372.
Du, W., Li, P., Li, B., Zhao, H., & Liu, G. (2023). UOR: Universal Backdoor Attacks on Pre-trained Language Models. ArXiv. https://arxiv.org/abs/2305.09574.
Zhang, Y., Rando, J., Evtimov, I., Chi, J., Smith, E. M., Carlini, N., Tramèr, F., & Ippolito, D. (2024). Persistent Pre-Training Poisoning of LLMs. ArXiv. https://arxiv.org/abs/2410.13722
Dozono, K., Gasiba, T. E., & Stocco, A. (2024). Large Language Models for Secure Code Assessment: A Multi-Language Empirical Study. ArXiv. https://arxiv.org/abs/2408.06428
Agnew, W., Jiang, H. H., Sum, C., Sap, M., & Das, S. (2024). Data Defenses Against Large Language Models. ArXiv. https://arxiv.org/abs/2410.13138.
Shayegani, E., Mamun, M. A., Fu, Y., Zaree, P., & Dong, Y. (2023). Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks. ArXiv. https://arxiv.org/abs/2310.10844.
Zheng, Z., & Zhu, X. (2023). NatLogAttack: A framework for attacking natural language inference models with natural logic. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. Vol. 1, pp. 9960- 9976. https://doi.org/10.18653/v1/2023.acl-long.542.
Shi, Y., Gao, Y., Lai, Y., Wang, H., Feng, J., He, L., Wan, J., Chen, C., Yu, Z., & Cao, X. (2024). SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models. ArXiv. https://arxiv.org/abs/2402.04178
Zhang, J., Wang, C., Li, A., Sun, W., Zhang, C., Ma, W., & Liu, Y. (2024). An Empirical Study of Automated Vulnerability Localization with Large Language Models. ArXiv. https://arxiv.org/abs/2404.00287.
Liu, Y., Gao, L., Yang, M., Xie, Y., Chen, P., Zhang, X., & Chen, W. (2024). VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models. ArXiv. https://arxiv.org/abs/2406.07595.
Shestov, A., Levichev, R., Mussabayev, R., Maslov, E., Cheshkov, A., & Zadorozhny, P. (2024). Finetuning Large Language Models for Vulnerability Detection. ArXiv. https://arxiv.org/abs/2401.17010.

Toplam 15 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	İngilizce
Konular	Bilgisayar Sistem Yazılımı
Bölüm	Research Articles
Yazarlar	Canan Batur Şahin 0000-0002-2131-6368
Erken Görünüm Tarihi	29 Haziran 2025
Yayımlanma Tarihi	29 Haziran 2025
Gönderilme Tarihi	9 Şubat 2025
Kabul Tarihi	17 Mart 2025
Yayımlandığı Sayı	Yıl 2025 Cilt: 6 Sayı: 1

Kaynak Göster

APA	Batur Şahin, C. (2025). Depth Analysis of Vulnerabilities in Training and Inference Times of Large Language Models. NATURENGS, 6(1), 23-33. https://doi.org/10.46572/naturengs.1636277

Kapak Resmi İndir

Makale Dosyaları

Tam Metin