--- tags: Table Detection --- # TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images 表格檢測 & 結構識別 e2e model. ## Contributions - e2e deep multial-task architecture for both table detection and structure recognition. - adding additional spatial semantic features. - fine tuning it on an another new dataset will boost the performance of the model on the new dataset. - manually annotated the Marmot dataset!!!! ## Model Structure 1. pre-train VGG-19 encoder and two decoder (table detection and structure recognition) ![](https://i.imgur.com/FZR6OVe.png) 3. The images are preprocessed and modified using the Tesseract OCR - spatial semantic features by highlighting the words. ![](https://i.imgur.com/QURKEas.png) 4. output is mask. ![](https://i.imgur.com/PEJY7ut.png) ## row segmentation - Radon transform. - The rows of the table which have maximum non-blank entries is marked as the starting point for a new row. - All the columns are completely filled and there are no line demarcations, each line (level) can be seen as a unique row. ## dataset [PAPER](https://arxiv.org/pdf/2001.01469.pdf) [Marmot Dataset](https://www.icst.pku.edu.cn/cpdp/sjzy/index.htm) [GITHUB](https://github.com/jainammm/TableNet) [知乎](https://zhuanlan.zhihu.com/p/385673899)