Binarization algorithm for document image with complex background

التفاصيل البيبلوغرافية
العنوان: Binarization algorithm for document image with complex background
المؤلفون: Shaojun Miao, Tongwei Lu, Feng Min
المصدر: SPIE Proceedings.
بيانات النشر: SPIE, 2015.
سنة النشر: 2015
مصطلحات موضوعية: Polynomial, Pixel, business.industry, media_common.quotation_subject, Pattern recognition, Optical character recognition, computer.software_genre, Edge detection, Histogram, Contrast (vision), Computer vision, Artificial intelligence, Bilateral filter, Noise (video), business, Algorithm, computer, media_common, Mathematics
الوصف: The most important step in image preprocessing for Optical Character Recognition (OCR) is binarization. Due to the complex background or varying light in the text image, binarization is a very difficult problem. This paper presents the improved binarization algorithm. The algorithm can be divided into several steps. First, the background approximation can be obtained by the polynomial fitting, and the text is sharpened by using bilateral filter. Second, the image contrast compensation is done to reduce the impact of light and improve contrast of the original image. Third, the first derivative of the pixels in the compensated image are calculated to get the average value of the threshold, then the edge detection is obtained. Fourth, the stroke width of the text is estimated through a measuring of distance between edge pixels. The final stroke width is determined by choosing the most frequent distance in the histogram. Fifth, according to the value of the final stroke width, the window size is calculated, then a local threshold estimation approach can begin to binaries the image. Finally, the small noise is removed based on the morphological operators. The experimental result shows that the proposed method can effectively remove the noise caused by complex background and varying light.
تدمد: 0277-786X
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::efa7cca7c1b6912c28f005fe0c258e8f
https://doi.org/10.1117/12.2209016
رقم الأكسشن: edsair.doi...........efa7cca7c1b6912c28f005fe0c258e8f
قاعدة البيانات: OpenAIRE