## Project title: *Manga Translator* Students: Tu Chau and Quang Nguyen Instructors: Huy Le, Ha V. Nguyen, Tung H. Cao, Hung Ngo Course: Deep Learning for AI, 2022 LeHongPhong Highschool for the Gifted, HCMC Vietnam --- ## Project Description: - Vietnamese people are among the biggest fans of manga, especially Vietnamese teenagers. - However, most of Vietnamese readers can only access to a few famous (translated) manga due to language barrier and lack of manga translators. - Having acknowledge the situation, our team are encouraged to build an app which can translate manga into Vietnamese. --- ## Task description $T$: High-level, briefly with some keywords & illustrations: - *++What our system do:++* Translate manga in English into Vietnamese. - *++What does it take as input:++* Untranslated manga - *++What does it give as output:++* Translated manga --- ## App pipeline: ![End-to-end Pipeline](https://i.imgur.com/h5QQzpa.png) - ***Image Segmentaton***: Extract text and words from the manga pages. - ***Translation***: Translate text written in English into Vietnamese. - ***Completing Output***: Replace the text in English on the manga pages. --- ## Image Segmentation: - Using pretrained paddleOCR to recognize lines of text (bounding boxes). - Backbone architecture: MobileNetV3 - Concatenate bounding boxes (using the distance between those bounding boxes). - Since the size of text varies $\rightarrow$ the distance between bounding boxes varies. ---- ### Image Segmentation: ![](https://i.imgur.com/DBOB6Wf.jpg) ---- ### Image Segmentation: ![](https://i.imgur.com/xSlyclK.jpg) --- ## Translation: - Translate the text with googletrans - Sometimes paddleOCR ommit the spaces between words $\rightarrow$ using module wordninja to prevent the problems. --- ## Text Rendering: - Inpainting objects (text) from the manga page with [LaMa](https://github.com/saic-mdal/lama) $\rightarrow$ replace them with new text (translated sentences). ---- ### Inpainting: ![](https://i.imgur.com/lU8NKrg.jpg) ---- ### Text Rendering: ![](https://i.imgur.com/i3AfajB.jpg) --- ### Compare with other apps: - Inpainting with LaMa instead of inserting white box to cover the old text. - Auto-detection rather than cropping out. --- ### What needs to be improved: - Real-time execution. - The app only support 1 font $\rightarrow$ insert synthetic text. - Sometimes PaddleOCR cannot detect text accurately $\rightarrow$ need to be retrained. ---- ![](https://i.imgur.com/vXRqhbB.png) --- ## Demo: - Demo Video: https://youtu.be/usfyNl7KHbQ - GitHub repo: https://github.com/tuchau1404/manga_translate
{"metaMigratedAt":"2023-06-17T02:58:04.734Z","metaMigratedFrom":"YAML","breaks":true,"slideOptions":"{\"transition\":\"slide\",\"width\":900,\"height\":900,\"margin\":0.1,\"minScale\":0.2,\"maxScale\":1.5}","contributors":"[{\"id\":\"70a8c36d-ad4f-4f48-a325-03376546e656\",\"add\":3604,\"del\":825}]","description":"Students: Tu Chau and Quang Nguyen","title":"Project Capstone"}
    499 views
   Owned this note