{"id":15426,"date":"2023-02-01T00:00:00","date_gmt":"2023-07-18T01:55:00","guid":{"rendered":"https:\/\/bvtn.edu.vn\/?post_type=nghien-cuu&#038;p=15426"},"modified":"2023-07-19T14:26:03","modified_gmt":"2023-07-19T07:26:03","slug":"10-1016-j-amjoto-2023-103800","status":"publish","type":"nghien-cuu","link":"https:\/\/bvtn.edu.vn\/en\/nghien-cuu\/10-1016-j-amjoto-2023-103800\/","title":{"rendered":"Support of deep learning to classify vocal fold images in flexible laryngoscopy"},"comment_status":"closed","ping_status":"closed","template":"","meta":{"update-doi":"false","doi":"10.1016\/j.amjoto.2023.103800","first-author":"Bich Anh Tran","authors":"Bich Anh Tran, Thao Thi Phuong Dao, Ho Dang Quy Dung, Ngoc Boi Van, Chanh Cong Ha, Nam Hoang Pham, Tu Cong Huyen Ton Nu Cam Nguyen, Tan-Cong Nguyen, Minh-Khoi Pham, Mai-Khiem Tran, Truong Minh Tran, Minh-Triet Tran","journal-title":"American Journal of Otolaryngology","publisher":"Elsevier BV","published-date":1675209600,"volume":"44","issue":"3","issn":"0196-0709","title":"Support of deep learning to classify vocal fold images in flexible laryngoscopy","affiliations":"","references":"<ol><li>Jonathan Reid et al., Development of a machine-learning based voice disorder screening tool. <em>American Journal of Otolaryngology<\/em>. 2022;  :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/j.amjoto.2021.103327\">doi: 10.1016\/j.amjoto.2021.103327<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Jonathan+Reid+et+al.%2C+Development+of+a+machine-learning+based+voice+disorder+screening+tool.+%3Cem%3EAmerican+Journal+of+Otolaryngology%3C%2Fem%3E.+2022%3B++%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Won Ki Cho et al., Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network\u2010Based Image Classification System. <em>The Laryngoscope<\/em>. 2021; 131 :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1002\/lary.29595\">doi: 10.1002\/lary.29595<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Won+Ki+Cho+et+al.%2C+Diagnostic+Accuracies+of+Laryngeal+Diseases+Using+a+Convolutional+Neural+Network%E2%80%90Based+Image+Classification+System.+%3Cem%3EThe+Laryngoscope%3C%2Fem%3E.+2021%3B+131+%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Won Ki Cho et al., Comparison of Convolutional Neural Network Models for Determination of Vocal Fold Normality in Laryngoscopic Images.. <em>Journal of voice : official journal of the Voice Foundation<\/em>. 2020;  :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/j.jvoice.2020.08.003\">doi: 10.1016\/j.jvoice.2020.08.003<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Won+Ki+Cho+et+al.%2C+Comparison+of+Convolutional+Neural+Network+Models+for+Determination+of+Vocal+Fold+Normality+in+Laryngoscopic+Images..+%3Cem%3EJournal+of+voice+%3A+official+journal+of+the+Voice+Foundation%3C%2Fem%3E.+2020%3B++%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>F. Parker et al., Machine Learning in Laryngoscopy Analysis: A Proof of Concept Observational Study for the Identification of Post-Extubation Ulcerations and Granulomas. <em>Annals of Otology, Rhinology & Laryngology<\/em>. 2020; 130 :286 - 291. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1177\/0003489420950364\">doi: 10.1177\/0003489420950364<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=F.+Parker+et+al.%2C+Machine+Learning+in+Laryngoscopy+Analysis%3A+A+Proof+of+Concept+Observational+Study+for+the+Identification+of+Post-Extubation+Ulcerations+and+Granulomas.+%3Cem%3EAnnals+of+Otology%2C+Rhinology+%26+Laryngology%3C%2Fem%3E.+2020%3B+130+%3A286+-+291.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Nathaniel Adamian et al., An Open\u2010Source Computer Vision Tool for Automated Vocal Fold Tracking From Videoendoscopy. <em>The Laryngoscope<\/em>. 2020; 131 :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1002\/lary.28669\">doi: 10.1002\/lary.28669<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Nathaniel+Adamian+et+al.%2C+An+Open%E2%80%90Source+Computer+Vision+Tool+for+Automated+Vocal+Fold+Tracking+From+Videoendoscopy.+%3Cem%3EThe+Laryngoscope%3C%2Fem%3E.+2020%3B+131+%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Jian-jun Ren et al., Automatic Recognition of Laryngoscopic Images Using a Deep\u2010Learning Technique. <em>The Laryngoscope<\/em>. 2020; 130 :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1002\/lary.28539\">doi: 10.1002\/lary.28539<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Jian-jun+Ren+et+al.%2C+Automatic+Recognition+of+Laryngoscopic+Images+Using+a+Deep%E2%80%90Learning+Technique.+%3Cem%3EThe+Laryngoscope%3C%2Fem%3E.+2020%3B+130+%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>M. Fehling et al., Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network. <em>PLoS ONE<\/em>. 2020; 15 :. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1371\/journal.pone.0227791\">doi: 10.1371\/journal.pone.0227791<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=M.+Fehling+et+al.%2C+Fully+automatic+segmentation+of+glottis+and+vocal+folds+in+endoscopic+laryngeal+high-speed+videos+using+a+deep+Convolutional+LSTM+Network.+%3Cem%3EPLoS+ONE%3C%2Fem%3E.+2020%3B+15+%3A.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>H. Xiong et al., Computer-aided diagnosis of laryngeal cancer via deep learning based on laryngoscopic images. <em>EBioMedicine<\/em>. 2019; 48 :92 - 99. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/j.ebiom.2019.08.075\">doi: 10.1016\/j.ebiom.2019.08.075<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=H.+Xiong+et+al.%2C+Computer-aided+diagnosis+of+laryngeal+cancer+via+deep+learning+based+on+laryngoscopic+images.+%3Cem%3EEBioMedicine%3C%2Fem%3E.+2019%3B+48+%3A92+-+99.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>M. Laves et al., A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation. <em>International Journal of Computer Assisted Radiology and Surgery<\/em>. 2018; 14 :483-492. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1007\/s11548-018-01910-0\">doi: 10.1007\/s11548-018-01910-0<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=M.+Laves+et+al.%2C+A+dataset+of+laryngeal+endoscopic+images+with+comparative+study+on+convolution+neural+network-based+semantic+segmentation.+%3Cem%3EInternational+Journal+of+Computer+Assisted+Radiology+and+Surgery%3C%2Fem%3E.+2018%3B+14+%3A483-492.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>M. Sandler et al., MobileNetV2: Inverted Residuals and Linear Bottlenecks. <em>2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition<\/em>. 2018;  :4510-4520. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1109\/CVPR.2018.00474\">doi: 10.1109\/CVPR.2018.00474<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=M.+Sandler+et+al.%2C+MobileNetV2%3A+Inverted+Residuals+and+Linear+Bottlenecks.+%3Cem%3E2018+IEEE%2FCVF+Conference+on+Computer+Vision+and+Pattern+Recognition%3C%2Fem%3E.+2018%3B++%3A4510-4520.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Ramprasaath R. Selvaraju et al., Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. <em>International Journal of Computer Vision<\/em>. 2016; 128 :336-359. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1007\/s11263-019-01228-7\">doi: 10.1007\/s11263-019-01228-7<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Ramprasaath+R.+Selvaraju+et+al.%2C+Grad-CAM%3A+Visual+Explanations+from+Deep+Networks+via+Gradient-Based+Localization.+%3Cem%3EInternational+Journal+of+Computer+Vision%3C%2Fem%3E.+2016%3B+128+%3A336-359.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Fran\u00e7ois Chollet et al., Xception: Deep Learning with Depthwise Separable Convolutions. <em>2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)<\/em>. 2016;  :1800-1807. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1109\/CVPR.2017.195\">doi: 10.1109\/CVPR.2017.195<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Fran%C3%A7ois+Chollet+et+al.%2C+Xception%3A+Deep+Learning+with+Depthwise+Separable+Convolutions.+%3Cem%3E2017+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition+%28CVPR%29%3C%2Fem%3E.+2016%3B++%3A1800-1807.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Gao Huang et al., Densely Connected Convolutional Networks. <em>2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)<\/em>. 2016;  :2261-2269. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1109\/CVPR.2017.243\">doi: 10.1109\/CVPR.2017.243<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Gao+Huang+et+al.%2C+Densely+Connected+Convolutional+Networks.+%3Cem%3E2017+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition+%28CVPR%29%3C%2Fem%3E.+2016%3B++%3A2261-2269.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Kaiming He et al., Deep Residual Learning for Image Recognition. <em>2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)<\/em>. 2015;  :770-778. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1109\/cvpr.2016.90\">doi: 10.1109\/cvpr.2016.90<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Kaiming+He+et+al.%2C+Deep+Residual+Learning+for+Image+Recognition.+%3Cem%3E2016+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition+%28CVPR%29%3C%2Fem%3E.+2015%3B++%3A770-778.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>H. \u0130. T\u00fcrkmen et al., Classification of laryngeal disorders based on shape and vascular defects of vocal folds. <em>Computers in biology and medicine<\/em>. 2015; 62 :76-85. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/j.compbiomed.2015.02.001\">doi: 10.1016\/j.compbiomed.2015.02.001<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=H.+%C4%B0.+T%C3%BCrkmen+et+al.%2C+Classification+of+laryngeal+disorders+based+on+shape+and+vascular+defects+of+vocal+folds.+%3Cem%3EComputers+in+biology+and+medicine%3C%2Fem%3E.+2015%3B+62+%3A76-85.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Christian Szegedy et al., Going deeper with convolutions. <em>2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)<\/em>. 2014;  :1-9. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1109\/CVPR.2015.7298594\">doi: 10.1109\/CVPR.2015.7298594<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Christian+Szegedy+et+al.%2C+Going+deeper+with+convolutions.+%3Cem%3E2015+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition+%28CVPR%29%3C%2Fem%3E.+2014%3B++%3A1-9.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>A. Verikas et al., A kernel-based approach to categorizing laryngeal images. <em>Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society<\/em>. 2007; 31 8 :587-94. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/J.COMPMEDIMAG.2007.07.003\">doi: 10.1016\/J.COMPMEDIMAG.2007.07.003<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=A.+Verikas+et+al.%2C+A+kernel-based+approach+to+categorizing+laryngeal+images.+%3Cem%3EComputerized+medical+imaging+and+graphics+%3A+the+official+journal+of+the+Computerized+Medical+Imaging+Society%3C%2Fem%3E.+2007%3B+31+8+%3A587-94.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>Antanas Verikas et al., Multiple feature sets based categorization of laryngeal images. <em>Computer methods and programs in biomedicine<\/em>. 2007; 85 3 :257-66. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/j.cmpb.2006.11.002\">doi: 10.1016\/j.cmpb.2006.11.002<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=Antanas+Verikas+et+al.%2C+Multiple+feature+sets+based+categorization+of+laryngeal+images.+%3Cem%3EComputer+methods+and+programs+in+biomedicine%3C%2Fem%3E.+2007%3B+85+3+%3A257-66.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>J. Ilgner et al., Colour texture analysis for quantitative laryngoscopy. <em>Acta Oto-Laryngologica<\/em>. 2003; 123 :730 - 734. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1080\/00016480310000412\">doi: 10.1080\/00016480310000412<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=J.+Ilgner+et+al.%2C+Colour+texture+analysis+for+quantitative+laryngoscopy.+%3Cem%3EActa+Oto-Laryngologica%3C%2Fem%3E.+2003%3B+123+%3A730+-+734.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><li>W. Peter et al., The History of Laryngology: A Centennial Celebration. <em>Otolaryngology- Head and Neck Surgery<\/em>. 1996; 114 :345 - 354. <a target=\"_blank\" href=\"https:\/\/doi.org\/10.1016\/S0194-59989670202-4\">doi: 10.1016\/S0194-59989670202-4<\/a><a target=\"_blank\" href=\"https:\/\/scholar.google.com\/scholar?q=W.+Peter+et+al.%2C+The+History+of+Laryngology%3A+A+Centennial+Celebration.+%3Cem%3EOtolaryngology-+Head+and+Neck+Surgery%3C%2Fem%3E.+1996%3B+114+%3A345+-+354.\"><img class=\"refIcon\" alt=\"Google Scholar\" src=\"https:\/\/bvtn.edu.vn\/wp-content\/uploads\/2023\/07\/google-schoolar.png\" style=\"width: 18px;\" class=\"ggscholar\"><\/a><\/li><\/ol>","tldr":"The results show that current deep learning models can classify vocal fold images well and effectively assist physicians in vocal fold identification and classification of normal or abnormal vocal folds.","research-type":"JournalArticle","bibtex":"@Article{Tran2023SupportOD,\n DOI = {10.1016\/j.amjoto.2023.103800}, author = {Bich Anh Tran and Thao Thi Phuong Dao and Ho Dang Quy Dung and Ngoc Boi Van and Chanh Cong Ha and Nam Hoang Pham and TuNam Nguyen and Tan-Cong Nguyen and Minh Pham and Mai-Khiem Tran and T. M. Tran and Minh-Triet Tran},\n booktitle = {American Journal of Otolaryngology},\n journal = {American journal of otolaryngology},\n pages = {\n          103800\n        },\n title = {Support of deep learning to classify vocal fold images in flexible laryngoscopy.},\n volume = {44 3},\n year = {2023}\n}\n"},"loai-nghien-cuu":[124],"class_list":["post-15426","nghien-cuu","type-nghien-cuu","status-publish","hentry","loai-nghien-cuu-quoc-te"],"_links":{"self":[{"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/nghien-cuu\/15426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/nghien-cuu"}],"about":[{"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/types\/nghien-cuu"}],"replies":[{"embeddable":true,"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/comments?post=15426"}],"version-history":[{"count":0,"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/nghien-cuu\/15426\/revisions"}],"wp:attachment":[{"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/media?parent=15426"}],"wp:term":[{"taxonomy":"loai-nghien-cuu","embeddable":true,"href":"https:\/\/bvtn.edu.vn\/en\/wp-json\/wp\/v2\/loai-nghien-cuu?post=15426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}