To participate go to: https://competitions.codalab.org/competitions/26979 Table recognition is a well-studied problem in document analysis, and many academic and commercial approaches have been developed to recognize tables in several document formats, including plain text, scanned page images, and born-digital, object-based formats such as PDF. There are several works that can convert tables in text-based PDF format into structured representations. However, there is limited work on image-based table content recognition. The proposed challenge aims at assessing the ability of state-of-the-art methods to recognize scientific tables in LaTeX format. In particular, the problem would be split up into two subtasks: Subtask I: : Reconstructing the structure of a table in the form of LaTeX symbols and code Table structure reconstruction (S) Subtask II: : Reconstructing and recognizing the content of a table in the form of LaTeX symbols and code Table content reconstruction © Tasks Our shared task has two subtasks. Subtask-1 and Subtask-2 focus on evaluating machine-learning models’ performance with regard to two broader table recognition tasks. Subtask-I: Table structure reconstruction In this subtask, you are given an image of a table and its corresponding LaTeX code. You need to construct the LaTeX structural tokens that define the table in LaTeX. Subtask-II: Table content reconstruction In this subtask, you are given an image of a table and its corresponding LaTeX code. You need to construct the LaTeX content tokens that belong to the table in LaTeX. Frequently Asked Questions Q1. What is the size of the dataset with specific numbers for each task (training set — test — validation set)? Size of the dataset for both the subtasks is given as follows: A1. We abbreviate Table structure reconstruction task dataset as TSRD and Table content reconstruction task dataset as TCRD. For the TSR dataset, we take tables having less than 250 tokens and for TCR dataset we take tables having less than 500 tokens. Q2. Will the code of the competitors be available for the research community (reproducibility of the results)? It would be mandatory for participants to make their code available for reproducibility. The dataset provided for this task would be licensed under international license, and the evaluation script would be provided under MIT License. A2. CC BY-NC-SA 4.0 Q3. Will there be an award for all the proposed Tasks? We would be awarding both the proposed subtasks: A3. Table structure reconstruction taskTable content reconstruction task Q4. What are some Examples for the two tasks? Examples: A4. Table Structure Reconstruction: { | c c | } \\hline \\multicolumn { 2 } { | c | } CELL \\\\ \\hline \\multicolumn { 2 } { | c | } CELL \\\\ \\multicolumn { 2 } { | c | } CELL \\\\ \\multicolumn { 2 } { | c | } CELL \\\\ \\multicolumn { 2 } { | c | } CELL \\\\ \\multicolumn { 2 } { | c | } CELL \\\\ \\hline Table Content Reconstruction: $ T _ { \mathbf { D } 1 } = p _ { 1 1 ¦ } \frac { t _ { \mathbf { A } } + \mathbf { p } — \frac { \mathbf { r } } { 2 } } { 2 t ¦ _ { \mathbf { D } } } + p _ { 1 2 ¦ } \frac { t _ { \mathbf { D } } + \mathbf { p — d — r } } { 2 t ¦ _ { \mathbf { D } } } + $ \\ $ p _ { 1 3 ¦ } \frac { t _ { \mathbf { A } } + t _ { \mathbf { D } } — 2 \mathbf { r + p — d } } { 4 t ¦ _ { \mathbf { D } } } . $ Timeline: Registration Period: 15th Oct 2020 to 28th Feb 2021Release of training and validation set: 20th Oct 2020Release of test set: 01st Mar 2021Submission Deadline: 31st Mar 2021Post-Evaluation Phase Starts: 01st Apr 2021 Evaluation For both the subtasks, the participants would be required to submit the prediction files as per the submission format. The tasks would be scored by Exact Match Accuracy and Exact Match Accuracy @ 95% similarity as common evaluation metrics. Also, task-specific metrics include: Row Prediction Accuracy and Column Prediction Accuracy for Table structure reconstruction taskAlpha-Numeric characters Prediction Accuracy, LaTeX Token Accuracy, LaTex Symbol Accuracy, and Non-LaTeX Symbols Prediction Accuracy for Table content reconstruction task The description of each metric is as follows: : Fraction of predictions which match exactly with the ground truth Exact Match Accuracy : Fraction of predictions with at least 95% similarity between ground truth Exact Match Accuracy @ 95% similarity : Fraction of predictions with a count of rows equal to the count of rows in the ground truth Row Prediction Accuracy : Fraction of predictions with a count of cell alignment (‘c’, ‘r’, ‘l’) tokens equal to the count of cell alignment tokens in the ground truth Column Prediction Accuracy : Fraction of predictions which has the same alphanumeric characters as in the ground truth Alpha-Numeric Characters Prediction Accuracy : Fraction of predictions which has the same LaTeX tokens as in the ground truth LaTeX Token Accuracy : Fraction of predictions which has the same LaTeX Symbols as in the ground truth LaTeX Symbol Accuracy : Fraction of predictions which has the same Non-LaTeX Symbols as in the ground truth Non-LaTeX Symbol Prediction Accuracy Example: For the given image, to calculate Exact Match Accuracy @ 95% similarity between the ground truth target sequence and predicted target sequence, we use the Longest Common Subsequence algorithm to find the similarity percentage and set the similarity percentage minimum threshold to 95%. The ground truth target sequence (G) for Table structure recognition task is (No. of tokens = 37) { c | c c c } & \milticolumn { 3 } { c } \\ & & & \\ \hline \hline & & \\ & & & \\ \hline \multicolumn { 3 } { c } and the predicted target sequence (P) is (No. of tokens = 36) { c | c c } & \milticolumn { 2 } { c } \\ & & & \\ \hline \hline & & \\ & & & \\ \hline \multicolumn { 3 } { c } The longest common subsequence between G and P is . } { c } \\ & & & \\ \hline \hline & & \\ & & & \\ \hline \multicolumn { 3 } { c } Thus, the percentage similarity calculated is (26/.37). 70.27% Please post your queries as comments. Also published at https://medium.com/@pratik.kayal/competition-latex-code-generation-from-table-images-1261a1650810

ICDAR 2021 Competition: Detecting Tables Using Image Recognition

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

5 Best AI Articles of the Month

6 Effective Ways to Stay Ahead of Your Competition Using Social AI

77 Stories To Learn About Ai Top Story

A Deep Learning Overview: NLP vs CNN

A Detailed Overview of How AI Detectors Work

A Quick Introduction to Federated Learning of Cohorts [FloC]

5 Best AI Articles of the Month

6 Effective Ways to Stay Ahead of Your Competition Using Social AI

77 Stories To Learn About Ai Top Story

A Deep Learning Overview: NLP vs CNN

A Detailed Overview of How AI Detectors Work

A Quick Introduction to Federated Learning of Cohorts [FloC]

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps