Beyond Line Drawings and Photographs: A Narrative Review on the Potential Use of AI-Assisted Images in Acquired Language Disorders

Şevket Özdemir; Şükriye Kayhan Aktürk; Eda İyigün Uzunöz; Aylin Müge Tunçer

doi:10.58563/dkyad-2025.81.4

Derleme

Beyond Line Drawings and Photographs: A Narrative Review on the Potential Use of AI-Assisted Images in Acquired Language Disorders

Yıl 2025, Cilt: 8 Sayı: 1, 60 - 86, 30.04.2025

Şevket Özdemir , Şükriye Kayhan Aktürk , Eda İyigün Uzunöz , Aylin Müge Tunçer

https://doi.org/10.58563/dkyad-2025.81.4

Öz

Purpose: This review aims to delve into the question of whether artificial intelligence (AI) could be used to create images that are linguistically and culturally relevant for individuals with acquired language disorders with specific consideration of Turkish. Images are one of highly preferred stimuli among acquired language disorders both in research and clinical contexts.

Method: A narrative review methodology was adopted in the study by synthesizing the findings from previous studies. A literature review was carried out using various databases including Web of Science, Scopus, and EBSCO with search terms including “acquired language disorders”, “aphasia”, “image research”, “artificial intelligence”, “text-to-image generation”, and “prompt engineering”.

Results: Previous studies on image research including neurotypical adults and individuals with aphasia were introduced to draw a comprehensive picture of what ideal image characteristics would involve. The potential uses of AI in acquired language disorders were discussed with particular focus on image generation tools by unraveling the options of image generation, practice of writing prompts, potential benefits and challenges of text-to-image generation tools, and the need to consider cultural and linguistic diversity in the formation of images.

Conclusion: There is a lack of research that embodies AI and image research together, which means that there needs to be a close collaboration between the researchers in the field of AI and those in the aphasia rehabilitation. Regarding languages other than English, AI-generated image tools need to consider the unique linguistic and cultural characteristics of the languages concerned. It is suggested that the use of these tools should be promoted among speech and language therapists in their clinical practice, and their experiences with these tools should be documented so that image generation could be a part of clinical practice.

Anahtar Kelimeler

image research, artificial intelligence, Turkish, Text-to-image generation, human-computer interaction, acquired language disorders, aphasia rehabilitation

Kaynakça

Adikari, A., Hernandez, N., Alahakoon, D., Rose, M. L., & Pierce, J. E. (2024). From concept to practice: a scoping review of the application of AI to aphasia diagnosis and management. Disability and Rehabilitation, 46(7), 1288- 1297. https://doi.org/10.1080/09638288.2023.2199463
Alyahya, R. S., Halai, A. D., Conroy, P., & Ralph, M. A. L. (2018). Noun and verb processing in aphasia: Behavioural profiles and neural correlates. NeuroImage: Clinical, 18, 215-230. https://doi.org/10.1016/j.nicl.2018.01.023
Beukelman, D. R., Fager, S., Ball, L., & Dietz, A. (2007). AAC for adults with acquired neurological conditions: A review. Augmentative and Alternative Communication, 23, 230–242. https://doi.org/10.1080/07434610701553668
Brown, J., & Thiessen, A. (2018). Using images with individuals with aphasia: Current research and clinical trends. American Journal of Speech-Language Pathology, 27, 504–515. https://doi.org/10.1044/2017_AJSLP-16-0190
De Bleser, R., & Kauschke, C. (2003). Acquisition and loss of nouns and verbs: Parallel or divergent patterns? Journal of Neurolinguistics, 16(2), 213–229. https://doi.org/10.1016/S0911-6044(02)00015-5
Garrett, K., Kimelman, M., Beukelman, D., Yorkston, K., & Reichle, J. (2000). AAC and aphasia: Cognitive- linguistic considerations. In D. R. Beukelman, J. Reichle & K. M. Yorkston (Eds.), Augmentative and Alternative Communication for Adults with Acquired Neurologic Disorders (pp. 339–374). Baltimore, MD: Brookes.
Göring, S., Rao, R. R. R., Merten, R., & Raake, A. (2023). Analysis of Appeal for realistic AI-generated Photos. Ieee Access, 11, 38999-39012. https://doi.org/10.1109/ACCESS.2023.3267968
Heuer, S. (2016). The influence of image characteristics on image recognition: A comparison of photographs and line drawings. Aphasiology, 30(8), 943–961. https://doi.org/10.1080/02687038.2015.1081138
Humphreys, G. W., Price, J., & Riddoch, M. J. (1999). From objects to names: A cognitive neuroscience approach. Psychological Research, 62, 118–130. https://doi.org/10.1007/s004260050046
Jacobs, B., Drew, R., Ogletree, B. T., & Pierce, K. (2004). Augmentative and alternative communication (AAC) for adults with severe aphasia: Where we stand and how we can go further. Disability and Rehabilitation, 26, 1231–1240. https://doi.org/10.1080/09638280412331280244
Kim, S., Eun, J., Oh, C., & Lee, J. (2025). “Journey of finding the best query”: Understanding the user experience of AI image generation system. International Journal of Human–Computer Interaction, 41(2), 951-969. https://doi.org/10.1080/10447318.2024.2307670
Lasker, J., & Bedrosian, J. (2001). Promoting acceptance of augmentative and alternative communication by adults with acquired communication disorders. Augmentative and Alternative Communication, 17, 141–153. https://doi.org/10.1080/aac.17.3.141.153
Lasker, J., & Garrett, K. (2006). Using the Multimodal Communication Screening Test for Persons with Aphasia (MCST-A) to guide the selection of alternative communication strategies for people with aphasia. Aphasiology, 20, 217–232. https://doi.org/10.1080/02687030500473411
Lo, L. S. (2023). The art and science of prompt engineering: a new literacy in the information age. Internet Reference Services Quarterly, 27(4), 203-210. https://doi.org/10.1080/10875301.2023.2227621
Martínez‐Ferreiro, S. (2024). Naming as a window to word retrieval changes in healthy and pathological ageing: Methodological considerations. International Journal of Language & Communication Disorders, 59(1), 68-83. https://doi.org/10.1111/1460-6984.12827
Liu, V., & Lydia, B. C. (2022). Design guidelines for prompt engineering text-to-image generative models. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (CHI ‘22). Association for Computing Machinery, New York, NY, Article 384. https://doi.org/10.1145/3491102.3501825
Maviş, İ., Tunçer, A. M., Selvi-Balo, S., Tokaç, S. D., & Özdemir, Ş. (2022). The adaptation process of the Comprehensive Aphasia Test into CAT-Turkish: Psycholinguistic and clinical considerations. Aphasiology, 36(4), 493-512. https://doi.org/10.1080/02687038.2021.1923947
Maviş, İ., Tunçer, A. M., Selvi-Balo, S, Özdemir, Ş., Tokaç Scheffer, S. D., Yaşar Gündüz, E., Günhan Şenol, N. E. (2024). Kapsamlı Afazi Testi Türkçe Versiyonu (CAT-TR) Kullanım Kılavuzu. Detay Yayıncılık.
Mazumdar, B., Donovan, N. J., & Duncan, E. S. (2023). Identifying an appropriate picture stimulus for a Bangla Picture Description Task. Journal of Speech, Language, and Hearing Research, 66(4), 1334-1350. https://doi.org/10.1044/2022_JSLHR-22-00152
McNeil, M. R., & Pratt, S. R. (2001). Defining aphasia: Some theoretical and clinical implications of operating from a formal definition. Aphasiology, 15(10-11), 901–911. https://doi.org/10.1080/02687040143000276 Menn, L., Niemi, J., & Ahlsen, E. (1996). Cross-linguistic studies of aphasia: Why and how. Aphasiology, 10(6), 523–531. https://doi.org/10.1080/02687039608248434
Mohr, E. (2010). Colour and Naming in Healthy and Aphasic People. Unpublished Doctoral dissertation. University of Durham, Department of Psychology, Durham, UK. Retrieved from https://etheses.dur.ac.uk/394/
Moys, J. L., Martínez-Freile, C., McCrindle, R., Meteyard, L., Robson, H., Kendrick, L., & Wairagkar, M. (2018). Exploring illustration styles for materials used in visual resources for people with aphasia. Visible Language, 52(3), 97-113. https://journals.uc.edu/index.php/vl/article/view/4635
Nichol, A., Dhariwal, P., Ramesh, A., Shyam, P., Mishkin, P., McGrew, B., Sutskever, I., & Chen, M. (2021). Glide: Towards photorealistic image generation and editing with text-guided diffusion models. arXiv preprint arXiv:2112.10741. https://doi.org/10.48550/arXiv.2112.10741
Oppenlaender, J. (2022, November). The creativity of text-to-image generation. In Proceedings of the 25th international academic mindtrek conference (pp. 192-202). https://doi.org/10.1145/3569219.3569352
Oppenlaender, J. (2024). A taxonomy of prompt modifiers for text-to-image generation. Behaviour & Information Technology 43(15), 3763-3776. https://doi.org/10.1080/0144929X.2023.2286532 Oppenlaender, J., Visuri, A., Paananen, V., Linder, R., & Silvennoinen, J. (2023). Text-to-image generation: Perceptions and realities. arXiv preprint arXiv:2303.13530. https://doi.org/10.48550/arXiv.2303.13530
Özdemir, Ş., Maviş, İ., & Tunçer, A. M. (2022). The validity and reliability of the language battery in Comprehensive Aphasia Test-Turkish (CAT-TR). Journal of Psycholinguistic Research, 51(4), 789- 802. https://doi.org/10.1007/s10936-022-09850-2 Özperçin, M., Kara, N., & Özdemir, Ş. (2023). Afazi dostu yazılı materyal hazırlama. In A. Özşimşek (Eds.) Güncel Nöroloji Çalışmaları IV (ss. 45-59). Akademisyen Yayınevi. https://doi.org/10.37609/akya.2819
Parr, S., Pound, C., & Hewitt, A. (2006). Communication access to health and social services. Topics in Language Disorders, 26, 189–198. https://doi.org/10.1097/00011363-200607000-00003
Pierce, J. E. (2024). AI-Generated images for Speech Pathology—An exploratory application to aphasia assessment and intervention materials. American Journal of Speech-Language Pathology, 33(1), 443-451. https://doi.org/10.1044/2023_AJSLP-23-00142 Privitera, A. J., Ng, S. H. S., Kong, A. P. H., & Weekes, B. S. (2024). AI and aphasia in the digital age: A critical review. Brain Sciences, 14(4), 383. https://doi.org/10.3390/brainsci14040383
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. ArXiv preprint arXiv:2103.00020. https://doi.org/10.48550/arXiv.2103.00020 Reymond, C., Müller, C., & Grumbinaite, I. (2019). E-Inclusion: Defining basic image properties for illustrated stimuli in aphasia treatment. Visible Language, 53(3), 4–27. https://doi.org/10.34314/vl.v53i3.4652
Reymond, C., Widmer Beierlein, S., Müller, C., Reutimann, R., Kuntner, K. P., Falcon Garcia, N., Grumbinaite, I., Hemm Ode, S., Degen, M., Parillo, F., Karlin, S., Park, S., Renner, M., & Blechschmidt, A. (2023). Naming images in aphasia: effects of graphic representations and photographs on naming performance in persons with and without aphasia. Aphasiology, 37(7), 993-1015. https://doi.org/10.1080/02687038.2022.2064421
Rhodes, M. (1961). An analysis of creativity. The Phi Delta Kappan 42(7), 305–310. https://www.jstor.org/stable/20342603
Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2011a). Aphasia friendly written health information: Content and design characteristics. International Journal of Speech-Language Pathology, 13, 335–347. https://doi.org/10.3109/17549507.2011.560396 Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2011b). Exploring the use of graphics in written health information for people with aphasia. Aphasiology, 25, 1579–1599. https://doi.org/10.1080/02687038.2011.626845 Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2012). Guiding principles for printed education materials: Design preferences of people with aphasia. International Journal of Speech-Language Pathology, 14, 11–23. https://doi.org/10.3109/17549507.2011.631583
Rossion, B., & Pourtois, G. (2004). Rossion, B., & Pourtois, G. (2004). Revisiting Snodgrass and Vanderwart's object pictorial set: The role of surface detail in basic-level object recognition. Perception, 33(2), 217-236. https://doi.org/10.1068/p5117
Selvi-Balo, S., Maviş, İ., & Tunçer, A. M. (2020). 586 Türkçe sözcügün imgelenebilirlik, tanıdıklık ve öznel edinim yaşı norm değerleri. Dil, Konusma ve Yutma Arastırmaları Dergisi, 3(3), 301-334. https://dergipark.org.tr/en/pub/dkyad/issue/59228/835975
Snodgrass, J. G., & Vanderwart, M. (1980). A standardized set of 260 pictures: Norms for name agreement, image agreement, familiarity, and visual complexity. Journal of Experimental Psychology: Human Learning and Memory, 6(2), 174–215. https://psycnet.apa.org/doi/10.1037/0278-7393.6.2.174 Székely, A., Jacobsen, T., D’Amico, S., Devescovi, A., Andonova, E., Herron, D., Lu, C. C., Pechmann, T., Pléh, C., Wicha, N., Federmeier, K., Gerdjikova, I., Gutierrez, G., Hung, D., Hsu, J., Iyer, G., Kohnert, K., Mehotcheva, T., Orozco-Figueroa, A., . . . Bates, E. (2004). A new on-line resource for psycholinguistic studies. Journal of Memory and Language, 51(2), 247–250. https://doi.org/10.1016/j.jml.2004.03.002
Therriault, D. J., Yaxley, R. H., & Zwaan, R. A. (2009). The role of color diagnosticity in object recognition and representation. Cognitive Processing, 10(4), 335. https://doi.org/10.1007/s10339-009-0260-4
Thiessen, A., Beukelman, D., Ullman, C., & Longenecker, M. (2014). Measurement of the visual attention patterns of people with aphasia: A preliminary investigation of two types of human engagement in photographic images. Augmentative and Alternative Communication, 30(2), 120-129. https://doi.org/10.3109/07434618.2014.905798 Thiessen, A., Beukelman, D., Hux, K., & Longenecker, M. (2016). A comparison of the visual attention patterns of people with aphasia and adults without neurological conditions for camera-engaged and task-engaged visual scenes. Journal of Speech, Language, and Hearing Research, 59(2), 290-301. https://doi.org/10.1044/2015_JSLHR-L-14-0115
Thiessen, A., Brown, J., Beukelman, D., Hux, K., & Myers, A. (2017). Effect of message type on the visual attention of adults with traumatic brain injury. American Journal of Speech-Language Pathology, 26(2), 428-442. https://doi.org/10.1044/2016_AJSLP-16-0024
Toğram, B., & Maviş, İ. (2012). Validity, reliability and standardization study of the language assessment test for aphasia. Turkish Journal of Neurology, 18(3), 96-103. https://doi.org/10.4274/Tnd.16779
Vigliocco, G., Vinson, D. P., Druks, J., Barber, H., & Cappa, S. F. (2011). Nouns and verbs in the brain: A review of behavioural, electrophysiological, neuropsychological and imaging studies. Neuroscience & Biobehavioral Reviews, 35(3), 407-426. https://doi.org/10.1016/j.neubiorev.2010.04.007
Waller, A., Dennis, F., Brodie, J., & Cairns, A. Y. (1998). Evaluating the use of TalksBac, a predictive communication device for nonfluent adults with aphasia. International Journal of Language & Communication Disorders, 33, 45–70. https://doi.org/10.1080/136828298247929
Weissling, K. S., & Beukelman, D. R. (2006). Visual scenes displays: Low-tech options. SIG 12 Perspectives on Augmentative and Alternative Communication, 15(4), 15–17. https://doi.org/10.1044/aac15.4.15
Zhang, M., Tang, E., Ding, H., & Zhang, Y. (2024) Artificial intelligence and the future of communication sciences and disorders: A bibliometric and visualization analysis. Journal of Speech, Language, and Hearing Research, 67(11), 4369-4390. https://doi.org/10.1044/2024_JSLHR-24-00157
Akinina, Y., Malyutina, S., Ivanova, M., Iskra, E., Mannova, E., & Dragoy, O. (2015). Russian normative data for 375 action pictures and verbs. Behavior Research Methods, 47(3), 691–707. https://doi.org/10.3758/s13428-014-0492-9
Azevedo, N., Kehayia, E., Jarema, G., Le Dorze, G., Beaujard, C., & Yvon, M. (2024). How artificial intelligence (AI) is used in aphasia rehabilitation: A scoping review. Aphasiology, 38(2), 305-336. https://doi.org/10.1080/02687038.2023.2189513
Bellaire, K. J., Georges, J. B., & Thompson, C. K. (1991). Establishing functional communication board use for nonverbal aphasic subjects. Clinical Aphasiology, 19, 219–227.
Biedermann, B., Fieder, N., & Nickels, L. (2018). Spoken Word Production: Processes and Potential Breakdown. In A. Bar-On & D. Ravid (Eds.), Handbook of Communication Disorders: Theoretical, Empirical, and Applied Linguistic Perspectives (pp. 155-178). De Gruyter Mouton. https://doi.org/10.1515/9781614514909-009
Griffith, J., Dietz, A. and Weissling, K. (2014). ‘Supporting narrative retells for people with aphasia using augmentative and alternative communication: photographs or line drawings? Text or no text?’ American Journal of Speech-Language Pathology, 23, S213–S224. https://doi.org/10.1044/2014_AJSLP-13-0089
Heuer, S., & Hallowell, B. (2007). An evaluation of multiple choice test images for comprehension assessment in aphasia. Aphasiology, 21(9), 883–900. https://doi.org/10.1080/02687030600695194
Johnson, C. J., Paivio, A., & Clark, J. M. (1996). Cognitive components of picture naming. Psychological Bulletin, 120, 113–139. https://psycnet.apa.org/doi/10.1037/0033-2909.120.1.113
Johnson, R. K., Hough, M. S., King, K. A., Vos, P., & Jeffs, T. (2008). Functional communication in individuals with chronic severe aphasia using augmentative communication. Augmentative and Alternative Communication, 24, 269–280. https://doi.org/10.1080/07434610802463957
Knollman-Porter, K., Brown, J., Hux, K., Wallace, S. E., & Uchtman, E. (2016). Preferred visuographic images to support reading by people with chronic aphasia. Topics in Stroke Rehabilitation, 23(4), 269-275. https://doi.org/10.1080/10749357.2016.1155276
Khwaileh, T., Mustafawi, E., Herbert, R., & Howard, D. (2018). Gulf Arabic nouns and verbs: A standardized set of 319 object pictures and 141 action pictures, with predictors of naming latencies. Behavior Research Methods, 50(6), 2408–2425. https://doi.org/10.3758/s13428-018-1019-6
Lo, L. S. (2023). The art and science of prompt engineering: a new literacy in the information age. Internet Reference Services Quarterly, 27(4), 203-210. https://doi.org/10.1080/10875301.2023.2227621
Ma, X., Boyd-Graber, J., Nikolova, S., & Cook, P. R. (2009, October). Speaking through pictures: Images vs. icons. In Proceedings of the 11th international ACM SIGACCESS conference on computers and accessibility (pp. 163–170). New York, NY: ACM. https://doi.org/10.1145/1639642.1639672
Martínez‐Ferreiro, S. (2024). Naming as a window to word retrieval changes in healthy and pathological ageing: Methodological considerations. International Journal of Language & Communication Disorders, 59(1), 68-83. https://doi.org/10.1111/1460-6984.12827
Martínez-Ferreiro, S., Arslan, S., Fyndanis, V., Howard, D., Kraljević, J. K., Škorić, A. M., Munarriz-Ibarrola, A., Norvik, M., Peñaloza, C., Pourquié, M., Simonsen, H. G., Swinburn, K., Varlokosta, S., & Soroli, E. (2024). Guidelines and recommendations for cross-linguistic aphasia assessment: a review of 10 years of comprehensive aphasia test adaptations. Aphasiology, 1-25. https://doi.org/10.1080/02687038.2024.2343456
Mätzig, S., Druks, J., Masterson, J., & Vigliocco, G. (2009). Noun and verb differences in picture naming: Past studies and new evidence. Cortex, 45(6), 738–758. https://doi.org/10.1016/j.cortex.2008.10.003
Maviş, İ., Tunçer, A. M., & Balo, S. S. (2020). The adaptation of MAIN to Turkish. ZAS Papers in Linguistics, 64, 249-256. https://doi.org/10.21248/zaspil.64.2020.579
Ma, X., Boyd-Graber, J., Nikolova, S., & Cook, P. R. (2009, October). Speaking through pictures: Images vs. icons. In Proceedings of the 11th international ACM SIGACCESS conference on computers and accessibility (pp. 163–170). New York, NY: ACM. https://doi.org/10.1145/1639642.1639672

Çizimler ve Fotoğrafların Ötesinde: Yapay Zekâ Destekli Görsellerin Edinilmiş Dil Bozukluklarında Olası Kullanımları üzerine Geleneksel bir Derleme

Yıl 2025, Cilt: 8 Sayı: 1, 60 - 86, 30.04.2025

Şevket Özdemir , Şükriye Kayhan Aktürk , Eda İyigün Uzunöz , Aylin Müge Tunçer

https://doi.org/10.58563/dkyad-2025.81.4

Öz

Amaç: Görseller edinilmiş dil bozukluklarında araştırma ve klinik uygulamalarda en çok tercih edilen uyaranlardan biridir. Dil ve konuşma terapistleri (DKT) afazili bireyler için görsel oluşturma, bulma veya seçme sürecinde zorlanabilir. Görsellerin anlamlı ve net olması beklenebilir. Bu sebepten, seçilen görselin karşılık geldiği sözcük, ileti ve kavramları net bir şekilde temsil etmesi gerekmektedir (Brown & Thiessen, 2018; Pierce, 2024). Afazili bireylerin dil becerileri heterojen olabilir ve dil modalitelerine ait performans modaliteye özgü değişkenlik gösterebilir (örn. İşitsel ve yazılı anlama, tekrarlama, adlandırma, okuma, yazma). Bununla birlikte metinden görsel oluşturma araçlarını içeren Yapay Zekâ (YZ) teknolojisindeki son gelişmeler, afazili bireyler için uygun görsel belirleme sürecinde DKT’lere farklı bir seçenek sunabilir. YZ teknolojisi halen gelişmeye devam etmektedir ve YZ destekli görsel oluşturma süreçlerinin görsel araştırmalarına nasıl katkıda bulunacağı merak konusudur. Dolayısıyla, bu derleme YZ’nin edinilmiş dil bozukluğu olan bireyler için dilsel ve kültürel açıdan uygun görseller oluşturmada kullanılıp kullanılamayacağı sorusunu Türkçe özelinde ele almayı amaçlamaktadır.

Yöntem: Bu çalışma geleneksel derleme metodolojisini benimsemiştir. Web of Science, Scopus ve EBSCO veri tabanlarında “edinilmiş dil bozuklukları”, “afazi”, “görsel araştırmaları”, “yapay zekâ”, “metinden görsel oluşturma”, ve “prompt mühendisliği” arama terimleri kullanılarak Haziran-Ağustos 2024 aralığında alan yazın taraması gerçekleştirilmiştir. Tarama sırasında son 10 yılda yayımlanan çalışmalar (2014-2024) öncelikli olarak incelenmiştir, ancak konunun edinilmiş dil bozukluklarındaki önemini vurgulamak ve kuramsal temellerini ortaya koymak amacıyla 2014 öncesinde yayımlanan bazı kaynaklar da dahil edilmiştir.

Bulgular: Bu amaç doğrultusunda derleme iki bölüme ayrılmıştır: İlk olarak, ideal bir görselde bulunması gereken özelliklerin neler olduğuna dair bilgi vermek üzere sağlıklı ve afazili bireyleri dahil eden görsel araştırmalarından bahsedilmiştir. Devamında, görsel oluşturma araçlarına odaklanarak YZ’nin edinilmiş dil bozukluklarında olası kullanımları hakkında bilgi verilmiştir. Görsel araştırmalarının bulguları incelendiğinde; görselin doğru şekilde tanınması, görselin türü ve bağlamı, görsellerin somut veya soyut olması ile kültürel faktörlerin dikkate alınması gerektiği belirtilmektedir. Görseller arasında özellikle el çizimlerinin araştırma ve klinik bağlamlarda uzun süreden beri kullanıldığı bilinmektedir. Bununla birlikte fotoğrafların popülerlik kazandığı; erişilebilirliği ve düşük maliyeti sebebiyle de bu popülerliğin arttığı bilinmektedir. Son yıllarda YZ’nin gelişmesi ile birlikte, metin üzerinden görsel oluşturma teknolojisinin DKT’ler için önemli bir seçenek olabileceği düşünülmektedir. Bu teknolojinin adlandırma da dahil olmak üzere farklı amaçlarla kullanılabileceği belirtilmektedir. Bu zamana kadar YZ tarafından oluşturulan görsellerin adlandırma veya ilişkili bir dil becerisi bağlamında incelenmediği gözlenmekle birlikte, sadece Pierce (2024) DALL-E 2 tarafından oluşturulan görsellerin işlevselliğini değerlendirdiği bir çalışma gerçekleştirmiştir. Çalışmanın bulguları, isimlere ait görsellerin eylem ve cümlelere kıyasla daha net olduğunu göstermiştir (Pierce, 2024). Derlemenin ikinci bölümünde, edinilmiş dil bozuklukları alanında görsel uyaran kullanan araştırmacı ve DKT’lere yol gösterebilecek birtakım bilgiler sunulmuştur. Öncelikle görsel üretim seçeneklerinden bahsedilmiştir (örn. DALL-E 3 , Midjourney , Stable Diffusion , Glide , Craiyon , gibi). Devamında “kısa ve betimleyici / açıklayıcı metin” olarak tanımlanan prompt hakkında bilgi verilmiş (Lo, 2023), prompt yazımı için ileri düzey bir teknik veya programlama becerisine ihtiyaç duyulmadığı vurgulanmıştır (Adikari ve ark., 2024). Buna rağmen promptun içeriğine ve sonrasında ortaya çıkan sonuca (veya ürüne) dikkat çekilmiş, prompt üzerinde yapılabilecek düzenlemeler ile prompt taksonomisinin içeriğinden bahsedilmiştir (Oppenlaender 2022; 2024). Son olarak, metinden görsel oluşturma araçlarının olası faydaları ile zorlukları ve görsellerin oluşturulmasında kültürel ve dilsel çeşitliliği göz önünde bulundurma ihtiyacı dile getirilmiştir.

Sonuç: YZ ve görsel araştırmalarını bir araya getiren araştırmaların sayısının oldukça az olduğu gözlenmiştir. Bu durum YZ ile afazi rehabilitasyonunda çalışan araştırmacılar arasında yakın bir iş birliği olması gerektiğini göstermektedir. Ayrıca, İngilizce dışında farklı bir dilde prompt yazılmasına olanak veren görsel üretim araçlarının ilgili dillerin kendine özgü dilsel ve kültürel özelliklerini dikkate alması gerekmektedir. Bu araçların dil ve konuşma terapistlerinin klinik uygulamalarında kullanımının teşvik edilmesi ve görsel üretiminin klinik uygulamanın bir parçası olabilmesi için bu araçlarla ilgili dil ve konuşma terapistlerinin deneyimlerinin incelenmesi önerilmektedir.

Anahtar Kelimeler

yapay zeka, Türkçe, Metinden görsel oluşturma, insan-bilgisayar etkileşimi, görsel araştırmaları, edinilmiş dil bozuklukları, afazi rehabilitasyonu

Kaynakça

Adikari, A., Hernandez, N., Alahakoon, D., Rose, M. L., & Pierce, J. E. (2024). From concept to practice: a scoping review of the application of AI to aphasia diagnosis and management. Disability and Rehabilitation, 46(7), 1288- 1297. https://doi.org/10.1080/09638288.2023.2199463
Alyahya, R. S., Halai, A. D., Conroy, P., & Ralph, M. A. L. (2018). Noun and verb processing in aphasia: Behavioural profiles and neural correlates. NeuroImage: Clinical, 18, 215-230. https://doi.org/10.1016/j.nicl.2018.01.023
Beukelman, D. R., Fager, S., Ball, L., & Dietz, A. (2007). AAC for adults with acquired neurological conditions: A review. Augmentative and Alternative Communication, 23, 230–242. https://doi.org/10.1080/07434610701553668
Brown, J., & Thiessen, A. (2018). Using images with individuals with aphasia: Current research and clinical trends. American Journal of Speech-Language Pathology, 27, 504–515. https://doi.org/10.1044/2017_AJSLP-16-0190
De Bleser, R., & Kauschke, C. (2003). Acquisition and loss of nouns and verbs: Parallel or divergent patterns? Journal of Neurolinguistics, 16(2), 213–229. https://doi.org/10.1016/S0911-6044(02)00015-5
Garrett, K., Kimelman, M., Beukelman, D., Yorkston, K., & Reichle, J. (2000). AAC and aphasia: Cognitive- linguistic considerations. In D. R. Beukelman, J. Reichle & K. M. Yorkston (Eds.), Augmentative and Alternative Communication for Adults with Acquired Neurologic Disorders (pp. 339–374). Baltimore, MD: Brookes.
Göring, S., Rao, R. R. R., Merten, R., & Raake, A. (2023). Analysis of Appeal for realistic AI-generated Photos. Ieee Access, 11, 38999-39012. https://doi.org/10.1109/ACCESS.2023.3267968
Heuer, S. (2016). The influence of image characteristics on image recognition: A comparison of photographs and line drawings. Aphasiology, 30(8), 943–961. https://doi.org/10.1080/02687038.2015.1081138
Humphreys, G. W., Price, J., & Riddoch, M. J. (1999). From objects to names: A cognitive neuroscience approach. Psychological Research, 62, 118–130. https://doi.org/10.1007/s004260050046
Jacobs, B., Drew, R., Ogletree, B. T., & Pierce, K. (2004). Augmentative and alternative communication (AAC) for adults with severe aphasia: Where we stand and how we can go further. Disability and Rehabilitation, 26, 1231–1240. https://doi.org/10.1080/09638280412331280244
Kim, S., Eun, J., Oh, C., & Lee, J. (2025). “Journey of finding the best query”: Understanding the user experience of AI image generation system. International Journal of Human–Computer Interaction, 41(2), 951-969. https://doi.org/10.1080/10447318.2024.2307670
Lasker, J., & Bedrosian, J. (2001). Promoting acceptance of augmentative and alternative communication by adults with acquired communication disorders. Augmentative and Alternative Communication, 17, 141–153. https://doi.org/10.1080/aac.17.3.141.153
Lasker, J., & Garrett, K. (2006). Using the Multimodal Communication Screening Test for Persons with Aphasia (MCST-A) to guide the selection of alternative communication strategies for people with aphasia. Aphasiology, 20, 217–232. https://doi.org/10.1080/02687030500473411
Lo, L. S. (2023). The art and science of prompt engineering: a new literacy in the information age. Internet Reference Services Quarterly, 27(4), 203-210. https://doi.org/10.1080/10875301.2023.2227621
Martínez‐Ferreiro, S. (2024). Naming as a window to word retrieval changes in healthy and pathological ageing: Methodological considerations. International Journal of Language & Communication Disorders, 59(1), 68-83. https://doi.org/10.1111/1460-6984.12827
Liu, V., & Lydia, B. C. (2022). Design guidelines for prompt engineering text-to-image generative models. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (CHI ‘22). Association for Computing Machinery, New York, NY, Article 384. https://doi.org/10.1145/3491102.3501825
Maviş, İ., Tunçer, A. M., Selvi-Balo, S., Tokaç, S. D., & Özdemir, Ş. (2022). The adaptation process of the Comprehensive Aphasia Test into CAT-Turkish: Psycholinguistic and clinical considerations. Aphasiology, 36(4), 493-512. https://doi.org/10.1080/02687038.2021.1923947
Maviş, İ., Tunçer, A. M., Selvi-Balo, S, Özdemir, Ş., Tokaç Scheffer, S. D., Yaşar Gündüz, E., Günhan Şenol, N. E. (2024). Kapsamlı Afazi Testi Türkçe Versiyonu (CAT-TR) Kullanım Kılavuzu. Detay Yayıncılık.
Mazumdar, B., Donovan, N. J., & Duncan, E. S. (2023). Identifying an appropriate picture stimulus for a Bangla Picture Description Task. Journal of Speech, Language, and Hearing Research, 66(4), 1334-1350. https://doi.org/10.1044/2022_JSLHR-22-00152
McNeil, M. R., & Pratt, S. R. (2001). Defining aphasia: Some theoretical and clinical implications of operating from a formal definition. Aphasiology, 15(10-11), 901–911. https://doi.org/10.1080/02687040143000276 Menn, L., Niemi, J., & Ahlsen, E. (1996). Cross-linguistic studies of aphasia: Why and how. Aphasiology, 10(6), 523–531. https://doi.org/10.1080/02687039608248434
Mohr, E. (2010). Colour and Naming in Healthy and Aphasic People. Unpublished Doctoral dissertation. University of Durham, Department of Psychology, Durham, UK. Retrieved from https://etheses.dur.ac.uk/394/
Moys, J. L., Martínez-Freile, C., McCrindle, R., Meteyard, L., Robson, H., Kendrick, L., & Wairagkar, M. (2018). Exploring illustration styles for materials used in visual resources for people with aphasia. Visible Language, 52(3), 97-113. https://journals.uc.edu/index.php/vl/article/view/4635
Nichol, A., Dhariwal, P., Ramesh, A., Shyam, P., Mishkin, P., McGrew, B., Sutskever, I., & Chen, M. (2021). Glide: Towards photorealistic image generation and editing with text-guided diffusion models. arXiv preprint arXiv:2112.10741. https://doi.org/10.48550/arXiv.2112.10741
Oppenlaender, J. (2022, November). The creativity of text-to-image generation. In Proceedings of the 25th international academic mindtrek conference (pp. 192-202). https://doi.org/10.1145/3569219.3569352
Oppenlaender, J. (2024). A taxonomy of prompt modifiers for text-to-image generation. Behaviour & Information Technology 43(15), 3763-3776. https://doi.org/10.1080/0144929X.2023.2286532 Oppenlaender, J., Visuri, A., Paananen, V., Linder, R., & Silvennoinen, J. (2023). Text-to-image generation: Perceptions and realities. arXiv preprint arXiv:2303.13530. https://doi.org/10.48550/arXiv.2303.13530
Özdemir, Ş., Maviş, İ., & Tunçer, A. M. (2022). The validity and reliability of the language battery in Comprehensive Aphasia Test-Turkish (CAT-TR). Journal of Psycholinguistic Research, 51(4), 789- 802. https://doi.org/10.1007/s10936-022-09850-2 Özperçin, M., Kara, N., & Özdemir, Ş. (2023). Afazi dostu yazılı materyal hazırlama. In A. Özşimşek (Eds.) Güncel Nöroloji Çalışmaları IV (ss. 45-59). Akademisyen Yayınevi. https://doi.org/10.37609/akya.2819
Parr, S., Pound, C., & Hewitt, A. (2006). Communication access to health and social services. Topics in Language Disorders, 26, 189–198. https://doi.org/10.1097/00011363-200607000-00003
Pierce, J. E. (2024). AI-Generated images for Speech Pathology—An exploratory application to aphasia assessment and intervention materials. American Journal of Speech-Language Pathology, 33(1), 443-451. https://doi.org/10.1044/2023_AJSLP-23-00142 Privitera, A. J., Ng, S. H. S., Kong, A. P. H., & Weekes, B. S. (2024). AI and aphasia in the digital age: A critical review. Brain Sciences, 14(4), 383. https://doi.org/10.3390/brainsci14040383
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. ArXiv preprint arXiv:2103.00020. https://doi.org/10.48550/arXiv.2103.00020 Reymond, C., Müller, C., & Grumbinaite, I. (2019). E-Inclusion: Defining basic image properties for illustrated stimuli in aphasia treatment. Visible Language, 53(3), 4–27. https://doi.org/10.34314/vl.v53i3.4652
Reymond, C., Widmer Beierlein, S., Müller, C., Reutimann, R., Kuntner, K. P., Falcon Garcia, N., Grumbinaite, I., Hemm Ode, S., Degen, M., Parillo, F., Karlin, S., Park, S., Renner, M., & Blechschmidt, A. (2023). Naming images in aphasia: effects of graphic representations and photographs on naming performance in persons with and without aphasia. Aphasiology, 37(7), 993-1015. https://doi.org/10.1080/02687038.2022.2064421
Rhodes, M. (1961). An analysis of creativity. The Phi Delta Kappan 42(7), 305–310. https://www.jstor.org/stable/20342603
Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2011a). Aphasia friendly written health information: Content and design characteristics. International Journal of Speech-Language Pathology, 13, 335–347. https://doi.org/10.3109/17549507.2011.560396 Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2011b). Exploring the use of graphics in written health information for people with aphasia. Aphasiology, 25, 1579–1599. https://doi.org/10.1080/02687038.2011.626845 Rose, T. A., Worrall, L. E., Hickson, L. M., & Hoffmann, T. C. (2012). Guiding principles for printed education materials: Design preferences of people with aphasia. International Journal of Speech-Language Pathology, 14, 11–23. https://doi.org/10.3109/17549507.2011.631583
Rossion, B., & Pourtois, G. (2004). Rossion, B., & Pourtois, G. (2004). Revisiting Snodgrass and Vanderwart's object pictorial set: The role of surface detail in basic-level object recognition. Perception, 33(2), 217-236. https://doi.org/10.1068/p5117
Selvi-Balo, S., Maviş, İ., & Tunçer, A. M. (2020). 586 Türkçe sözcügün imgelenebilirlik, tanıdıklık ve öznel edinim yaşı norm değerleri. Dil, Konusma ve Yutma Arastırmaları Dergisi, 3(3), 301-334. https://dergipark.org.tr/en/pub/dkyad/issue/59228/835975
Snodgrass, J. G., & Vanderwart, M. (1980). A standardized set of 260 pictures: Norms for name agreement, image agreement, familiarity, and visual complexity. Journal of Experimental Psychology: Human Learning and Memory, 6(2), 174–215. https://psycnet.apa.org/doi/10.1037/0278-7393.6.2.174 Székely, A., Jacobsen, T., D’Amico, S., Devescovi, A., Andonova, E., Herron, D., Lu, C. C., Pechmann, T., Pléh, C., Wicha, N., Federmeier, K., Gerdjikova, I., Gutierrez, G., Hung, D., Hsu, J., Iyer, G., Kohnert, K., Mehotcheva, T., Orozco-Figueroa, A., . . . Bates, E. (2004). A new on-line resource for psycholinguistic studies. Journal of Memory and Language, 51(2), 247–250. https://doi.org/10.1016/j.jml.2004.03.002
Therriault, D. J., Yaxley, R. H., & Zwaan, R. A. (2009). The role of color diagnosticity in object recognition and representation. Cognitive Processing, 10(4), 335. https://doi.org/10.1007/s10339-009-0260-4
Thiessen, A., Beukelman, D., Ullman, C., & Longenecker, M. (2014). Measurement of the visual attention patterns of people with aphasia: A preliminary investigation of two types of human engagement in photographic images. Augmentative and Alternative Communication, 30(2), 120-129. https://doi.org/10.3109/07434618.2014.905798 Thiessen, A., Beukelman, D., Hux, K., & Longenecker, M. (2016). A comparison of the visual attention patterns of people with aphasia and adults without neurological conditions for camera-engaged and task-engaged visual scenes. Journal of Speech, Language, and Hearing Research, 59(2), 290-301. https://doi.org/10.1044/2015_JSLHR-L-14-0115
Thiessen, A., Brown, J., Beukelman, D., Hux, K., & Myers, A. (2017). Effect of message type on the visual attention of adults with traumatic brain injury. American Journal of Speech-Language Pathology, 26(2), 428-442. https://doi.org/10.1044/2016_AJSLP-16-0024
Toğram, B., & Maviş, İ. (2012). Validity, reliability and standardization study of the language assessment test for aphasia. Turkish Journal of Neurology, 18(3), 96-103. https://doi.org/10.4274/Tnd.16779
Vigliocco, G., Vinson, D. P., Druks, J., Barber, H., & Cappa, S. F. (2011). Nouns and verbs in the brain: A review of behavioural, electrophysiological, neuropsychological and imaging studies. Neuroscience & Biobehavioral Reviews, 35(3), 407-426. https://doi.org/10.1016/j.neubiorev.2010.04.007
Waller, A., Dennis, F., Brodie, J., & Cairns, A. Y. (1998). Evaluating the use of TalksBac, a predictive communication device for nonfluent adults with aphasia. International Journal of Language & Communication Disorders, 33, 45–70. https://doi.org/10.1080/136828298247929
Weissling, K. S., & Beukelman, D. R. (2006). Visual scenes displays: Low-tech options. SIG 12 Perspectives on Augmentative and Alternative Communication, 15(4), 15–17. https://doi.org/10.1044/aac15.4.15
Zhang, M., Tang, E., Ding, H., & Zhang, Y. (2024) Artificial intelligence and the future of communication sciences and disorders: A bibliometric and visualization analysis. Journal of Speech, Language, and Hearing Research, 67(11), 4369-4390. https://doi.org/10.1044/2024_JSLHR-24-00157
Akinina, Y., Malyutina, S., Ivanova, M., Iskra, E., Mannova, E., & Dragoy, O. (2015). Russian normative data for 375 action pictures and verbs. Behavior Research Methods, 47(3), 691–707. https://doi.org/10.3758/s13428-014-0492-9
Azevedo, N., Kehayia, E., Jarema, G., Le Dorze, G., Beaujard, C., & Yvon, M. (2024). How artificial intelligence (AI) is used in aphasia rehabilitation: A scoping review. Aphasiology, 38(2), 305-336. https://doi.org/10.1080/02687038.2023.2189513
Bellaire, K. J., Georges, J. B., & Thompson, C. K. (1991). Establishing functional communication board use for nonverbal aphasic subjects. Clinical Aphasiology, 19, 219–227.
Biedermann, B., Fieder, N., & Nickels, L. (2018). Spoken Word Production: Processes and Potential Breakdown. In A. Bar-On & D. Ravid (Eds.), Handbook of Communication Disorders: Theoretical, Empirical, and Applied Linguistic Perspectives (pp. 155-178). De Gruyter Mouton. https://doi.org/10.1515/9781614514909-009
Griffith, J., Dietz, A. and Weissling, K. (2014). ‘Supporting narrative retells for people with aphasia using augmentative and alternative communication: photographs or line drawings? Text or no text?’ American Journal of Speech-Language Pathology, 23, S213–S224. https://doi.org/10.1044/2014_AJSLP-13-0089
Heuer, S., & Hallowell, B. (2007). An evaluation of multiple choice test images for comprehension assessment in aphasia. Aphasiology, 21(9), 883–900. https://doi.org/10.1080/02687030600695194
Johnson, C. J., Paivio, A., & Clark, J. M. (1996). Cognitive components of picture naming. Psychological Bulletin, 120, 113–139. https://psycnet.apa.org/doi/10.1037/0033-2909.120.1.113
Johnson, R. K., Hough, M. S., King, K. A., Vos, P., & Jeffs, T. (2008). Functional communication in individuals with chronic severe aphasia using augmentative communication. Augmentative and Alternative Communication, 24, 269–280. https://doi.org/10.1080/07434610802463957
Knollman-Porter, K., Brown, J., Hux, K., Wallace, S. E., & Uchtman, E. (2016). Preferred visuographic images to support reading by people with chronic aphasia. Topics in Stroke Rehabilitation, 23(4), 269-275. https://doi.org/10.1080/10749357.2016.1155276
Khwaileh, T., Mustafawi, E., Herbert, R., & Howard, D. (2018). Gulf Arabic nouns and verbs: A standardized set of 319 object pictures and 141 action pictures, with predictors of naming latencies. Behavior Research Methods, 50(6), 2408–2425. https://doi.org/10.3758/s13428-018-1019-6
Lo, L. S. (2023). The art and science of prompt engineering: a new literacy in the information age. Internet Reference Services Quarterly, 27(4), 203-210. https://doi.org/10.1080/10875301.2023.2227621
Ma, X., Boyd-Graber, J., Nikolova, S., & Cook, P. R. (2009, October). Speaking through pictures: Images vs. icons. In Proceedings of the 11th international ACM SIGACCESS conference on computers and accessibility (pp. 163–170). New York, NY: ACM. https://doi.org/10.1145/1639642.1639672
Martínez‐Ferreiro, S. (2024). Naming as a window to word retrieval changes in healthy and pathological ageing: Methodological considerations. International Journal of Language & Communication Disorders, 59(1), 68-83. https://doi.org/10.1111/1460-6984.12827
Martínez-Ferreiro, S., Arslan, S., Fyndanis, V., Howard, D., Kraljević, J. K., Škorić, A. M., Munarriz-Ibarrola, A., Norvik, M., Peñaloza, C., Pourquié, M., Simonsen, H. G., Swinburn, K., Varlokosta, S., & Soroli, E. (2024). Guidelines and recommendations for cross-linguistic aphasia assessment: a review of 10 years of comprehensive aphasia test adaptations. Aphasiology, 1-25. https://doi.org/10.1080/02687038.2024.2343456
Mätzig, S., Druks, J., Masterson, J., & Vigliocco, G. (2009). Noun and verb differences in picture naming: Past studies and new evidence. Cortex, 45(6), 738–758. https://doi.org/10.1016/j.cortex.2008.10.003
Maviş, İ., Tunçer, A. M., & Balo, S. S. (2020). The adaptation of MAIN to Turkish. ZAS Papers in Linguistics, 64, 249-256. https://doi.org/10.21248/zaspil.64.2020.579
Ma, X., Boyd-Graber, J., Nikolova, S., & Cook, P. R. (2009, October). Speaking through pictures: Images vs. icons. In Proceedings of the 11th international ACM SIGACCESS conference on computers and accessibility (pp. 163–170). New York, NY: ACM. https://doi.org/10.1145/1639642.1639672

Toplam 60 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	İngilizce
Konular	Konuşma Patolojisi
Bölüm	Derleme
Yazarlar	Şevket Özdemir 0000-0002-1230-6491 Şükriye Kayhan Aktürk 0000-0003-4678-3795 Eda İyigün Uzunöz 0000-0003-3752-0918 Aylin Müge Tunçer 0000-0001-7372-5967
Yayımlanma Tarihi	30 Nisan 2025
Gönderilme Tarihi	5 Aralık 2024
Kabul Tarihi	3 Mart 2025
Yayımlandığı Sayı	Yıl 2025 Cilt: 8 Sayı: 1

Kaynak Göster

APA	Özdemir, Ş., Kayhan Aktürk, Ş., İyigün Uzunöz, E., Tunçer, A. M. (2025). Beyond Line Drawings and Photographs: A Narrative Review on the Potential Use of AI-Assisted Images in Acquired Language Disorders. Dil Konuşma Ve Yutma Araştırmaları Dergisi, 8(1), 60-86. https://doi.org/10.58563/dkyad-2025.81.4

Makale Dosyaları

Tam Metin

DKYAD Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.