Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Eunji | - |
dc.contributor.author | Park, Jaewoo | - |
dc.contributor.author | Koo, Hyung Il | - |
dc.contributor.author | Cho, Nam Ik | - |
dc.date.issued | 2022-02-01 | - |
dc.identifier.issn | 1573-7721 | - |
dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/32460 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85122058382&origin=inward | - |
dc.description.abstract | Table structure recognition is a key component in document understanding. Many prior methods have addressed this problem with three sequential steps: table detection, table component extraction, and structure analysis based on pairwise relations. However, they have limitations in addressing complexly structured tables and/or practical scenarios (e.g., scanned documents). In this paper, we propose a novel graph-based table structure recognition framework. In order to handle complex tables, we formulate tables as planar graphs, whose faces are cell-regions. Then, we compute vertex (junction) confidence maps and line fields with the heatmap regression networks having a small number of parameters (about 1M) and reconstruct tables by solving a constrained optimization problem. We demonstrate the robustness of the proposed system through experiments on ICDAR 2019 dataset and on challenging table images. Experimental results show that the proposed method outperforms the conventional method for a range of scenarios and delivers good generalization performance. | - |
dc.description.sponsorship | This work was supported in part by the Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No. 2021-0-01062, Development of personal information processing technology for collection/utilization of high-quality and trusted training data for autonomous driving), and in part by LG AI Research. | - |
dc.description.sponsorship | This work was supported in part by the Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No. 2021-0-01062, Development of personal information processing technology for collection/utilization of high-quality and trusted training data for autonomous driving), and in part by LG AI Research. | - |
dc.language.iso | eng | - |
dc.publisher | Springer | - |
dc.subject.mesh | Component extraction | - |
dc.subject.mesh | Deep learning | - |
dc.subject.mesh | Document understanding | - |
dc.subject.mesh | Documents analysis | - |
dc.subject.mesh | Graph-based | - |
dc.subject.mesh | Graph-based approach | - |
dc.subject.mesh | Structure recognition | - |
dc.subject.mesh | Table detection | - |
dc.subject.mesh | Table structure | - |
dc.subject.mesh | Table understanding | - |
dc.title | Deep-learning and graph-based approach to table structure recognition | - |
dc.type | Article | - |
dc.citation.endPage | 5848 | - |
dc.citation.number | 4 | - |
dc.citation.startPage | 5827 | - |
dc.citation.title | Multimedia Tools and Applications | - |
dc.citation.volume | 81 | - |
dc.identifier.bibliographicCitation | Multimedia Tools and Applications, Vol.81 No.4, pp.5827-5848 | - |
dc.identifier.doi | 2-s2.0-85122058382 | - |
dc.identifier.scopusid | 2-s2.0-85122058382 | - |
dc.identifier.url | https://link.springer.com/journal/11042 | - |
dc.subject.keyword | Deep learning | - |
dc.subject.keyword | Document analysis | - |
dc.subject.keyword | Graph-based approach | - |
dc.subject.keyword | Table understanding | - |
dc.type.other | Article | - |
dc.identifier.pissn | 1380-7501 | - |
dc.description.isoa | false | - |
dc.subject.subarea | Software | - |
dc.subject.subarea | Media Technology | - |
dc.subject.subarea | Hardware and Architecture | - |
dc.subject.subarea | Computer Networks and Communications | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.