GLOBAL THRESHOLDING IMPLEMENTATION FOR NOISE HANDLING IN DIGITAL IMAGE RECOGNITION

Dannu Purwanto, Agustiyar Agustiyar

Abstract


Text recognition (OCR - Optical Character Recognition) is a research field that is gaining widespread attention due to its wide application in image and document processing. Although OCR technology has achieved a high level of success, the main challenge faced is the presence of noise in text image, noise causes decreased text recognition results, noise causes miss classification. Therefore needed noise handling text recognition.  The aim of this research is to provide valuable insight into the techniques and approaches used in the context of noise treatment using global threshold methods. The method used starts from an input digital image, then preprocessing is carried out by converting the image into a gray scale image, then a threshold is applied to the image, then recognition is carried out. From 6 experiments, the best results were obtained for character recognition with a threshold value (t) of 65 and a character recognition accuracy percentage of 94.29%. T value determined manually and static for separates the all object and the background, while in reality the lighting or contrast always varies. Suggestions for further research include developing an adaptive thresholding method approach to obtain threshold values automatically and optimally. So that if faced with varying lighting conditions or contrast, better results can be obtained.


Keywords


Segmentation; Character Recognition; Global Thresholding; Noise

References


B. Gulnara and A. Yerassyl, “Using Image Processing and Optical Character Recognition to Recognise ID cards in the Online Process of Onboarding,” SIST 2022 - 2022 Int. Conf. Smart Inf. Syst. Technol. Proc., 2022.

C. Kaundilya, D. Chawla, and Y. Chopra, “Automated text extraction from images using OCR system,” Proc. 2019 6th Int. Conf. Comput. Sustain. Glob. Dev. INDIACom 2019, pp. 145–150, 2019.

I. O. Joshua, M. O. Arowolo, M. O. Adebiyi, O. R. Oluwaseun, and K. A. Gbolagade, “Development of an Image Processing Techniques for Vehicle Classification Using OCR and SVM,” in 2023 International Conference on Science, Engineering and Business for Sustainable Development Goals, SEB-SDG 2023, 2023.

A. M. Abbas, M. S. S. Hameed, S. Balakrishnan, and K. S. Anandh, “Intelligent Document Finding using Optical Character Recognition and Tagging,” Int. Conf. Autom. Comput. Renew. Syst. ICACRS 2022 - Proc., pp. 1165–1168, 2022.

J. Chaloupka, K. Palecek, P. Cerva, and J. Zdansky, “Optical character recognition for audio-visual broadcast transcription system,” 11th IEEE Int. Conf. Cogn. Infocommunications, CogInfoCom 2020 - Proc., pp. 229–232, 2020.

I. Pivavaruk and J. R. Fonseca Cacho, “OCR Enhanced Augmented Reality Indoor Navigation,” Proc. - 2022 IEEE Int. Conf. Artif. Intell. Virtual Reality, AIVR 2022, pp. 186–192, 2022.

J. Ma et al., “Arbitrary-oriented scene text detection via rotation proposals,” IEEE Trans. Multimed., vol. 20, no. 11, pp. 3111–3122, 2018.

S. K. Devi and C. N. Subalalitha, “Intelligent Deep Learning Empowered Text Detection Model from Natural Scene Images,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 12, no. 3, pp. 1263–1268, 2022.

A. A. Chandio, M. Asikuzzaman, M. R. Pickering, and M. Leghari, “Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network,” IEEE Access, vol. 10, pp. 10062–10078, 2022.

L. Gomez, A. Nicolaou, and D. Karatzas, “Improving patch-based scene text script identification with ensembles of conjoined networks,” Pattern Recognit., vol. 67, pp. 85–96, 2017.

T. Titijaroonroj and K. Woraratpanya, “Iteration-free Bi-dimensional empirical mode decomposition and its application,” IEICE Trans. Inf. Syst., vol. E100D, no. 9, pp. 2183–2196, 2017.

H. Mokayed, P. Shivakumara, R. Saini, M. Liwicki, L. Chee Hin, and U. Pal, “Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic,” IEEE Access, vol. 9, pp. 129102–129109, 2021.

C. L. Cameras, O. Enqvist, and F. Kahl, “An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 7, pp. 1455–1461, 2017.

Z. Huang et al., “An algorithm based on text position correction and encoder-decoder network for text recognition in the scene image of visual sensors,” Sensors (Switzerland), vol. 20, no. 10, 2020.

M. K. Sharma, T. Sharma, M. Gupta, and Shivanshdeep, “Recognition and Extraction of Banking Data using OCR,” Proc. - 2021 3rd Int. Conf. Adv. Comput. Commun. Control Networking, ICAC3N 2021, pp. 472–475, 2021.

S. Godbole, D. Joijode, K. Kadam, and S. Karoshi, “Detection of Medicine Information with Optical Character Recognition using Android,” Proc. B-HTC 2020 - 1st IEEE Bangalore Humanit. Technol. Conf., 2020.

N. Ramya, M. Annamalai, and K. Santhosh, “An Automated Vehicle Tracking System Using Haar-Cascade Classifiers and Optical Character Recognition Engine,” in 2022 1st International Conference on Computational Science and Technology, ICCST 2022 - Proceedings, 2022, pp. 958–962.

D. Joshi and N. Mohd, “Techniques used in Automatic Number Plate Recognition,” 2023 4th Int. Conf. Emerg. Technol. INCET 2023, 2023.

F. Chen, H. Kataoka, and Y. Satoh, “Text Detection in Traffic Informatory Signs Using Synthetic Data,” Proc. Int. Conf. Doc. Anal. Recognition, ICDAR, vol. 1, pp. 851–858, 2017.

M. Krishnamoorthi, K. P. S. Ram, M. Sathyan, and T. Vasanth, “Improving Optical Character Recognition(OCR) Accuracy using Multi-Layer Perceptron(MLP),” 7th Int. Conf. Trends Electron. Informatics, ICOEI 2023 - Proc., pp. 1642–1647, 2023.

P. Sisodia, “Optical Character Recognition Development Using Python,” J. Informatics Electr. Electron. Eng., vol. 4, no. 3, pp. 1–13, 2023.

S. J. Jang, “OCR RELATED TECHNOLOGY TRENDS,” Eur. J. Eng. Technol., vol. 8, no. 1, pp. 13–20, 2020.

G. Shidaganti, S. Salil, P. Anand, and V. Jadhav, “Robotic Process Automation with AI and OCR to Improve Business Process: Review,” in Proceedings of the 2nd International Conference on Electronics and Sustainable Communication Systems, ICESC 2021, 2021, pp. 1612–1618.

M. Namysl and I. Konya, “Efficient, lexicon-free OCR using deep learning,” Proc. Int. Conf. Doc. Anal. Recognition, ICDAR, pp. 295–301, 2019.

K. I. Gunawan and J. Santoso, “Multilabel Text Classification Menggunakan SVM dan Doc2Vec Classification Pada Dokumen Berita Bahasa Indonesia,” J. Inf. Syst. Hosp. Technol., vol. 3, no. 01, pp. 29–38, 2021.

Q. Mazhar, A. Masood, I. Touqir, and A. Ahmad, “Image De-Noising and Compression Using Statistical based Thresholding in 2-D Discrete Wavelet Transform,” Int. J. Adv. Comput. Sci. Appl., vol. 7, no. 11, pp. 311–316, 2016.

W. Gunawan and A. Z. Arifin, “Lokal Fuzzy Thresholding Berdasarkan Pengukuran Fuzzy Similarity Pada Interaktif Segmentasi Citra Panoramik Gigi,” J. Infotel, vol. 9, no. 1, p. 40, 2017.

A. Desiani, D. A. Zayanti, R. Primartha, F. Efriliyanti, and N. A. C. Andriani, “Variasi Thresholding untuk Segmentasi Pembuluh Darah Citra Retina,” J. Edukasi dan Penelit. Inform., vol. 7, no. 2, p. 255, 2021.

D. Tuwohingide and C. Fatichah, “Spatial Fuzzy C-means dan Rapid Region Merging untuk Pemisahan Sel Kanker Payudara,” J. Nas. Tek. Elektro dan Teknol. Inf., vol. 6, no. 1, 2017.

D. P. Agyztia Premana, Raden Mohamad Herdian Bhakti, “Segmentasi K-Means Clustering Pada Citra Menggunakan Ekstrasi Fitur Warna dan Tekstur,” Pap. Knowl. . Towar. a Media Hist. Doc., 2020.




DOI: http://dx.doi.org/10.26623/transformatika.v21i2.8713

Refbacks

  • There are currently no refbacks.


| View My Stats |

Jurnal Transformatika : Journal Information Technology  by  Department of Information Technology, Faculty of Information Technology and Communication, Semarang University  is licensed under a  Creative Commons Attribution 4.0 International License.