實習日記11/28~12/2

--- tags: 實習日記 --- # 實習日記11/28~12/2 [TOC] # 進度 ## 整理紅綠燈辨識照片 - 0-1000:339/3=113 - 1001-2000=291/3=97 - 2001-3000=285/3=95 - 3001-4000=294/=98 - 4001-Fianl= 486/3=162 <font color="#f00">共1695張，565組</font> ![](https://i.imgur.com/bgwEiKH.png) ## EasyOCR 1. examples: ![](https://i.imgur.com/FBTQ3zU.png) ![](https://i.imgur.com/VPR2YUl.png) ![](https://i.imgur.com/F56tL1f.png) 2. 使用深度學習與Tesseract不同，EasyOCR是基於深度學習的OCR套件，它使用CRAFT模型（Character Region Awareness for Text Detection）來進行文字偵測，再依不同的語言載入對應的CRNN（Convolutional Recurrent Neural Network，卷積循環神經網路）模型進行檢測。每個CRNN模型包含了三個組件: 1.特徵萃取Resnet 2.序列標記LSTM 3.文字解譯CTC （connectionist temporal classification） ![](https://i.imgur.com/Zeq5o36.png) 3. 較適合靜態的印刷字體 4. 實作1：使用easyocr ![](https://i.imgur.com/03KB5K3.png) ![](https://i.imgur.com/kbGgI5H.png) 5. 實作2: easyocr vs keras_ocr ![](https://i.imgur.com/1AMte6h.jpg) ![](https://i.imgur.com/Jp7p7EK.jpg) ![](https://i.imgur.com/30EK8z8.png) ![](https://i.imgur.com/Bgef8CW.jpg) ![](https://i.imgur.com/7TJeKRN.jpg) Conclusion: - While keras_ocr is good in terms of accuracy but it is costly in terms of time. Also if you’re using CPU, time might be an issue for you. Keras-OCR is image specific OCR tool. If text is inside the image and their fonts and colors are unorganized. - Easy-OCR is lightweight model which is giving a good performance for receipt or PDF conversion. It is giving more accurate results with organized texts like PDF files, receipts, bills. Easy OCR also performs well on noisy images. - All these results can be further improved by performing specific image operations. OCR Prediction is not only dependent on the model and also on a lot of other factors like clarity, grey scale of the image, hyper parameter, weight age given, etc. 6. - https://www.youtube.com/watch?v=ZVKaWPW9oQY - https://github.com/nicknochnack/EasyOCR/blob/main/OCR%20Basics-EasyOCR.ipynb - https://www.kaggle.com/code/robikscube/extracting-text-from-images-youtube-tutorial - https://www.youtube.com/watch?v=oyqNdcbKhew ## Github基本教學 https://hackmd.io/_7_uEcksQu-XfSr4-YJhfw