OpenAI’s Sora in ophthalmology: revolutionary generative AI in eye health

Waisberg, Ethan; Ong, Joshua; Masalkhi, Mouayad; Lee, Andrew G.

doi:10.1038/s41433-024-03098-x

Download PDF

Comment
Open access
Published: 30 April 2024

OpenAI’s Sora in ophthalmology: revolutionary generative AI in eye health

Eye (2024)Cite this article

531 Accesses
2 Altmetric
Metrics details

Subjects

Introduction

Sora is a state-of-the-art AI model developed by Open AI that has been engineered to generate realistic and imaginative scenes purely based on textual instructions [1]. This innovative application is a remarkable leap forward and highlights the advanced capabilities of modern AI in interpreting and visualizing complex narratives [1]. The technological foundation of Sora rests on Large Language Models (LLMs) and artificial video generation techniques. LLMs are advanced neural network architectures designed to understand, generate, and interpret human language in a highly sophisticated manner [2,3,4,5]. When combined with diffusion models for video generation, these AI systems can create detailed and dynamic visual content from text descriptions [1]. This involves processing the text to understand its meaning and context, and then translating them into a series of images that form a coherent video sequence.

The implications of such technology extend across various fields, including ophthalmology (Fig. 1). Sora and similar AI models could revolutionize patient education, surgical training, and the visualization of complex eye conditions and visual phenomena. By generating detailed visual simulations based on textual case descriptions or surgical procedures, practitioners can enhance their understanding and teaching of intricate ophthalmic concepts, thereby improving patient care and outcomes. Similarly, it could enable practitioners get an accurate first-person perspective into what their patients are seeing/experiencing, which could provide guidance for improved care and empathy. Despite its ground-breaking potential, at the time of writing, Sora is currently inaccessible for public use and is only available to select individuals.

Surgical training

Ophthalmic surgery is a highly technical and learning new surgical techniques can be time intensive. Sora can be used to generate step-by-step surgical technique videos from text descriptions. While there are written step-by-step text explanations, with a few photos to describe ophthalmic surgical techniques, an AI-generated surgical video can provide an invaluable visual aid to ophthalmology trainees. A study by Reck-Burneo et al. [6] found that surgical trainees reported having significantly higher levels of confidence following watching operative videos, rather than reading a peer-reviewed manuscript on a surgical technique.

Patient Education

Strong ophthalmologist-patient communication is essential in the management of eye disease. Helping to educate and empower patients with conditions such as glaucoma, has been shown to improve both clinical outcomes [7] and treatment adherence [8]. A systematic review by Farwana et al. [9] showed that video-based media can be a useful ophthalmic patient education tool, with 71% of studies showing a significant improvement in comprehension following a video intervention. The current standard of providing additional written information, which is usually written in small text on leaflets, is also not particularly well-suited for individuals with visual impairments, non-native English speakers, or individuals with low literacy levels [9].

Public awareness campaigns

To reduce preventable blindness and vision impairments, the general public must be aware of the importance of regular eye examinations. This was also a key recommendation made by the World Health Organization’s World Report on Vision [10], to empower people and improve eye health literacy worldwide as early detection and timely management can help reduce preventable visual impairments. Ophthalmologists can use Sora to rapidly generate high-quality public awareness campaigns to educate the general public about various ophthalmic disorders and preventative measures that can be taken.

Clinician education

A video generated by Sora can potentially illustrate symptoms and signs of rare/uncommon ophthalmic diseases, to help improve the ability of ophthalmologists-in-training to recognize them. Ophthalmology residents could then observe and diagnose these conditions in a supportive and controlled environment.

It is also important to also consider possible limitations of Sora. Like all LLMs, minor misunderstandings in written text can lead to the production of inaccurate videos [11,12,13,14]. Future research will also need to be conducted on the anatomical accuracy of the ophthalmic AI-generated content.

Other future directions of Sora should include providing audio descriptions of videos to improve accessibility of the content for individuals with vision impairments. All things considered, Sora’s artificial video generation has the potential to enhance ophthalmic surgical training, improve patient education, and the visualization of complex eye conditions and visual phenomena.

References

Sora: creating video from text. https://openai.com/sora.
Waisberg E, Ong J, Masalkhi M, Lee AG. Large language model (LLM)-driven chatbots for neuro-ophthalmic medical education. Eye. 2023. https://doi.org/10.1038/s41433-023-02759-7.
Masalkhi M, Ong J, Waisberg E, Lee AG. Google DeepMind’s Gemini AI versus ChatGPT: a comparative analysis in ophthalmology. Eye. 2024. https://doi.org/10.1038/s41433-024-02958-w.
Alser M, Waisberg E. Concerns with the usage of ChatGPT in academia and medicine: a viewpoint. Am J Med Open. 2023:100036. https://doi.org/10.1016/j.ajmo.2023.100036.
Waisberg E, Ong E, Masalkhi M, Zaman N, Kamran SA, Sarker P, et al. ChatGPT and medical education: a new frontier for emerging physicians. Can Med Ed J. 2023. https://doi.org/10.36834/cmej.77644.
Reck-Burneo CA, Dingemans AJM, Lane VA, Cooper J, Levitt MA, Wood RJ. The impact of manuscript learning vs. video learning on a surgeon’s confidence in performing a difficult procedure. Front Surg. 2018;5:67.
Article PubMed PubMed Central Google Scholar
Friedman DS, Hahn SR, Gelb L, Tan J, Shah SN, Kim EE, et al. Doctor–patient communication, health-related beliefs, and adherence in glaucoma. Ophthalmology. 2008;115:1320–27.e3.
Article PubMed Google Scholar
Sleath B, Blalock SJ, Carpenter DM, Sayner R, Muir KW, Slota C, et al. Ophthalmologist–patient communication, self-efficacy, and glaucoma medication adherence. Ophthalmology. 2015;122:748–54.
Article PubMed Google Scholar
Farwana R, Sheriff A, Manzar H, Farwana M, Yusuf A, Sheriff I. Watch this space: a systematic review of the use of video-based media as a patient education tool in ophthalmology. Eye. 2020;34:1563–9.
Article PubMed PubMed Central Google Scholar
World Health Organization. World report on vision. World Health Organization: Geneva; 2019.
Waisberg E, Ong J, Masalkhi M, Zaman N, Sarker P, Lee AG, et al. GPT-4 and medical image analysis: strengths, weaknesses and future directions. J Med Artif Intell. 2023;6:29.
Article Google Scholar
Masalkhi M, Ong J, Waisberg E, Zaman N, Sarker P, Lee AG. et al. Large language models for post-operative guidance in refractive surgery. AME Surg J. 2024;4:2.
Article Google Scholar
Paladugu PS, Ong J, Nelson N, Kamran SA, Waisberg E, Zaman N, et al. Generative adversarial networks in medicine: important considerations for this emerging innovation in artificial intelligence. Ann Biomed Eng. 2023. https://doi.org/10.1007/s10439-023-03304-z.
Waisberg E, Ong J, Masalkhi M, Kamran SA, Zaman N, Sarker P, et al. Automated ophthalmic imaging analysis in the era of Generative Pre-Trained Transformer-4. Pan Am J Ophthalmol. 2023;5:46.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Ophthalmology, University of Cambridge, Cambridge, UK
Ethan Waisberg
Department of Ophthalmology and Visual Sciences, University of Michigan Kellogg Eye Center, Ann Arbor, MI, USA
Joshua Ong
School of Medicine, University College Dublin, Dublin, Ireland
Mouayad Masalkhi
Department of Ophthalmology, Blanton Eye Institute, Houston Methodist Hospital, Houston, TX, USA
Andrew G. Lee
The Houston Methodist Research Institute, Houston Methodist Hospital, Houston, TX, USA
Andrew G. Lee
Departments of Ophthalmology, Neurology, and Neurosurgery, Weill Cornell Medicine, New York, NY, USA
Andrew G. Lee
Department of Ophthalmology, University of Texas Medical Branch, Galveston, TX, USA
Andrew G. Lee
University of Texas MD Anderson Cancer Center, Houston, TX, USA
Andrew G. Lee
Texas A&M College of Medicine, Bryan, TX, USA
Andrew G. Lee
Department of Ophthalmology, The University of Iowa Hospitals and Clinics, Iowa City, IA, USA
Andrew G. Lee

Authors

Ethan Waisberg
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Ong
View author publications
You can also search for this author in PubMed Google Scholar
Mouayad Masalkhi
View author publications
You can also search for this author in PubMed Google Scholar
Andrew G. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

EW—Conceptualization, Writing. JO—Conceptualization, Writing. MM—Conceptualization, Writing. AGL—Review, Intellectual Support.

Corresponding author

Correspondence to Ethan Waisberg.

Ethics declarations

Competing interests

AGL is a member of the Eye editorial board. The authors declare no conflicts of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Waisberg, E., Ong, J., Masalkhi, M. et al. OpenAI’s Sora in ophthalmology: revolutionary generative AI in eye health. Eye (2024). https://doi.org/10.1038/s41433-024-03098-x

Download citation

Received: 10 March 2024
Revised: 13 March 2024
Accepted: 18 April 2024
Published: 30 April 2024
DOI: https://doi.org/10.1038/s41433-024-03098-x

OpenAI’s Sora in ophthalmology: revolutionary generative AI in eye health

Subjects

Introduction

Surgical training

Patient Education

Public awareness campaigns

Clinician education

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Introduction

Surgical training

Patient Education

Public awareness campaigns

Clinician education

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links