Institutional-Repository, University of Moratuwa.  

Investigating the learning progress of cnns in script identification using gradient values

Show simple item record

dc.contributor.author Tomioka, E
dc.contributor.author Morita, K
dc.contributor.author Shirai, NC
dc.contributor.author Wakabayashi, T
dc.contributor.author Ohyama, W
dc.contributor.editor Sudantha, BH
dc.date.accessioned 2022-11-18T04:26:52Z
dc.date.available 2022-11-18T04:26:52Z
dc.date.issued 2019-12
dc.identifier.citation E. Tomioka, K. Morita, N. C. Shirai, T. Wakabayashi and W. Ohyama, "Investigating the Learning Progress of CNNs in Script Identification Using Gradient Values," 2019 4th International Conference on Information Technology Research (ICITR), 2019, pp. 1-6, doi: 10.1109/ICITR49409.2019.9407784. en_US
dc.identifier.uri http://dl.lib.uom.lk/handle/123/19562
dc.description.abstract Demands for an automatic translation based on Camera-based Multilingual Optical Character Recognition (CM-OCR) are increasing. In addition, CM-OCR methods usually employ a script identification step before character recognition. Recent approaches for script identification depend on a Convolutional Neural Networks (CNN) thanks to its promising performance in the image recognition task. However, researchers mentioned the importance to understand the decision criteria in CNNs as a warning to employ them for actual tasks as black-box classifiers. Thus, the purpose of this research is to investigate the hyperparameter dependence of CNNs and to visualize the region focused by CNNs in the task of script identification. In this research, we applied Grad-CAM to the script identification task of image classification and used the SIW-13 dataset. We investigated the learning progress of CNNs by defining the value used in Grad-CAM as the "reaction" and visualized the region focused by CNNs in script identification. As a result, the learning process was stabilized in the case that the number of hyperparameters was sufficient for the given training samples even though the hyperparameters which should be tuned were increased. This result demonstrated that the capacity to stably learn training samples depends on the number of hyperparameters. In the insufficient capacity case, the learning process was destabilized and it caused scripts with relatively low accuracy. Analyzing one of the low accuracy scripts of the model using Grad-CAM, we found that some failures progress greatly changes by the difference in hyperparameters of CNNs. Scatter plots of the reaction and the probability clarified the capacity of CNNs in each script. en_US
dc.language.iso en en_US
dc.publisher Information Technology Research Unit, Faculty of Information Technology, University of Moratuwa, Sri Lanka en_US
dc.relation.uri https://ieeexplore.ieee.org/document/9407784 en_US
dc.subject CM-OCR en_US
dc.subject Deep Learning en_US
dc.subject Convolutional Neural Network en_US
dc.subject Grad-CAM en_US
dc.subject Scene Text Script en_US
dc.subject Script Identification en_US
dc.title Investigating the learning progress of cnns in script identification using gradient values en_US
dc.type Conference-Full-text en_US
dc.identifier.faculty IT en_US
dc.identifier.department Information Technology Research Unit, Faculty of Information Technology, University of Moratuwa. en_US
dc.identifier.year 2019 en_US
dc.identifier.conference 4th International Conference in Information Technology Research 2019 en_US
dc.identifier.place Colombo,Sri Lanka en_US
dc.identifier.proceeding Proceedings of the 4th International Conference in Information Technology Research 2019 en_US
dc.identifier.doi doi: 10.1109/ICITR49409.2019.9407784 en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

  • ICITR - 2019 [19]
    International Conference on Information Technology Research (ICITR)

Show simple item record