Skip to content

Commit c8b5915

Browse files
committed
Update tests
Signed-off-by: Christoph Auer <[email protected]>
1 parent bd30b46 commit c8b5915

34 files changed

+267
-133
lines changed

tests/data/groundtruth/docling_v1/2203.01017v2.doctags.txt

Lines changed: 29 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -153,20 +153,41 @@
153153
</table>
154154
<paragraph><location><page_8><loc_9><loc_89><loc_10><loc_90></location>- a.</paragraph>
155155
<paragraph><location><page_8><loc_11><loc_89><loc_82><loc_90></location>- Red - PDF cells, Green - predicted bounding boxes, Blue - post-processed predictions matched to PDF cells</paragraph>
156-
<caption><location><page_8><loc_9><loc_87><loc_70><loc_88></location>Japanese language (previously unseen by TableFormer): Example table from FinTabNet:</caption>
157-
<caption><location><page_8><loc_9><loc_73><loc_63><loc_74></location>b. Structure predicted by TableFormer, with superimposed matched PDF cell text:</caption>
156+
<subtitle-level-1><location><page_8><loc_9><loc_87><loc_46><loc_88></location>Japanese language (previously unseen by TableFormer):</subtitle-level-1>
157+
<subtitle-level-1><location><page_8><loc_50><loc_87><loc_70><loc_88></location>Example table from FinTabNet:</subtitle-level-1>
158158
<figure>
159159
<location><page_8><loc_8><loc_76><loc_49><loc_87></location>
160-
<caption>Japanese language (previously unseen by TableFormer): Example table from FinTabNet:b. Structure predicted by TableFormer, with superimposed matched PDF cell text:</caption>
161160
</figure>
161+
<caption><location><page_8><loc_9><loc_73><loc_63><loc_74></location>b. Structure predicted by TableFormer, with superimposed matched PDF cell text:</caption>
162162
<figure>
163-
<location><page_8><loc_9><loc_63><loc_49><loc_72></location>
163+
<location><page_8><loc_50><loc_77><loc_91><loc_88></location>
164+
<caption>b. Structure predicted by TableFormer, with superimposed matched PDF cell text:</caption>
164165
</figure>
166+
<table>
167+
<location><page_8><loc_9><loc_63><loc_49><loc_72></location>
168+
<row_0><col_0><body></col_0><col_1><body></col_1><col_2><col_header>論文ファイル</col_2><col_3><col_header>論文ファイル</col_3><col_4><col_header>参考文献</col_4><col_5><col_header>参考文献</col_5></row_0>
169+
<row_1><col_0><col_header>出典</col_0><col_1><col_header>ファイル 数</col_1><col_2><col_header>英語</col_2><col_3><col_header>日本語</col_3><col_4><col_header>英語</col_4><col_5><col_header>日本語</col_5></row_1>
170+
<row_2><col_0><row_header>Association for Computational Linguistics(ACL2003)</col_0><col_1><body>65</col_1><col_2><body>65</col_2><col_3><body>0</col_3><col_4><body>150</col_4><col_5><body>0</col_5></row_2>
171+
<row_3><col_0><row_header>Computational Linguistics(COLING2002)</col_0><col_1><body>140</col_1><col_2><body>140</col_2><col_3><body>0</col_3><col_4><body>150</col_4><col_5><body>0</col_5></row_3>
172+
<row_4><col_0><row_header>電気情報通信学会 2003 年総合大会</col_0><col_1><body>150</col_1><col_2><body>8</col_2><col_3><body>142</col_3><col_4><body>223</col_4><col_5><body>147</col_5></row_4>
173+
<row_5><col_0><row_header>情報処理学会第 65 回全国大会 (2003)</col_0><col_1><body>177</col_1><col_2><body>1</col_2><col_3><body>176</col_3><col_4><body>150</col_4><col_5><body>236</col_5></row_5>
174+
<row_6><col_0><row_header>第 17 回人工知能学会全国大会 (2003)</col_0><col_1><body>208</col_1><col_2><body>5</col_2><col_3><body>203</col_3><col_4><body>152</col_4><col_5><body>244</col_5></row_6>
175+
<row_7><col_0><row_header>自然言語処理研究会第 146 〜 155 回</col_0><col_1><body>98</col_1><col_2><body>2</col_2><col_3><body>96</col_3><col_4><body>150</col_4><col_5><body>232</col_5></row_7>
176+
<row_8><col_0><row_header>WWW から収集した論文</col_0><col_1><body>107</col_1><col_2><body>73</col_2><col_3><body>34</col_3><col_4><body>147</col_4><col_5><body>96</col_5></row_8>
177+
<row_9><col_0><body></col_0><col_1><body>945</col_1><col_2><body>294</col_2><col_3><body>651</col_3><col_4><body>1122</col_4><col_5><body>955</col_5></row_9>
178+
</table>
165179
<caption><location><page_8><loc_62><loc_62><loc_90><loc_63></location>Text is aligned to match original for ease of viewing</caption>
166-
<figure>
180+
<table>
167181
<location><page_8><loc_50><loc_64><loc_90><loc_72></location>
168182
<caption>Text is aligned to match original for ease of viewing</caption>
169-
</figure>
183+
<row_0><col_0><body></col_0><col_1><col_header>Shares (in millions)</col_1><col_2><col_header>Shares (in millions)</col_2><col_3><col_header>Weighted Average Grant Date Fair Value</col_3><col_4><col_header>Weighted Average Grant Date Fair Value</col_4></row_0>
184+
<row_1><col_0><body></col_0><col_1><col_header>RS U s</col_1><col_2><col_header>PSUs</col_2><col_3><col_header>RSUs</col_3><col_4><col_header>PSUs</col_4></row_1>
185+
<row_2><col_0><row_header>Nonvested on Janua ry 1</col_0><col_1><body>1. 1</col_1><col_2><body>0.3</col_2><col_3><body>90.10 $</col_3><col_4><body>$ 91.19</col_4></row_2>
186+
<row_3><col_0><row_header>Granted</col_0><col_1><body>0. 5</col_1><col_2><body>0.1</col_2><col_3><body>117.44</col_3><col_4><body>122.41</col_4></row_3>
187+
<row_4><col_0><row_header>Vested</col_0><col_1><body>(0. 5 )</col_1><col_2><body>(0.1)</col_2><col_3><body>87.08</col_3><col_4><body>81.14</col_4></row_4>
188+
<row_5><col_0><row_header>Canceled or forfeited</col_0><col_1><body>(0. 1 )</col_1><col_2><body>-</col_2><col_3><body>102.01</col_3><col_4><body>92.18</col_4></row_5>
189+
<row_6><col_0><row_header>Nonvested on December 31</col_0><col_1><body>1.0</col_1><col_2><body>0.3</col_2><col_3><body>104.85 $</col_3><col_4><body>$ 104.51</col_4></row_6>
190+
</table>
170191
<caption><location><page_8><loc_8><loc_54><loc_89><loc_59></location>Figure 5: One of the benefits of TableFormer is that it is language agnostic, as an example, the left part of the illustration demonstrates TableFormer predictions on previously unseen language (Japanese). Additionally, we see that TableFormer is robust to variability in style and content, right side of the illustration shows the example of the TableFormer prediction from the FinTabNet dataset.</caption>
171192
<figure>
172193
<location><page_8><loc_8><loc_44><loc_35><loc_52></location>
@@ -275,7 +296,7 @@
275296
<paragraph><location><page_13><loc_10><loc_35><loc_45><loc_37></location>Figure 8: Example of a table with multi-line header.</paragraph>
276297
<caption><location><page_13><loc_50><loc_59><loc_89><loc_61></location>Figure 9: Example of a table with big empty distance between cells.</caption>
277298
<figure>
278-
<location><page_13><loc_51><loc_63><loc_91><loc_87></location>
299+
<location><page_13><loc_51><loc_63><loc_70><loc_68></location>
279300
<caption>Figure 9: Example of a table with big empty distance between cells.</caption>
280301
</figure>
281302
<caption><location><page_13><loc_51><loc_13><loc_89><loc_14></location>Figure 10: Example of a complex table with empty cells.</caption>
@@ -298,11 +319,7 @@
298319
<location><page_14><loc_52><loc_55><loc_87><loc_89></location>
299320
<caption>Figure 13: Table predictions example on colorful table.</caption>
300321
</figure>
301-
<caption><location><page_14><loc_56><loc_13><loc_83><loc_14></location>Figure 14: Example with multi-line text.</caption>
302-
<figure>
303-
<location><page_14><loc_52><loc_25><loc_85><loc_31></location>
304-
<caption>Figure 14: Example with multi-line text.</caption>
305-
</figure>
322+
<paragraph><location><page_14><loc_56><loc_13><loc_83><loc_14></location>Figure 14: Example with multi-line text.</paragraph>
306323
<figure>
307324
<location><page_15><loc_9><loc_69><loc_46><loc_83></location>
308325
</figure>
@@ -318,9 +335,6 @@
318335
<caption>Figure 15: Example with triangular table.</caption>
319336
</figure>
320337
<figure>
321-
<location><page_15><loc_53><loc_72><loc_86><loc_85></location>
322-
</figure>
323-
<figure>
324338
<location><page_15><loc_53><loc_41><loc_86><loc_54></location>
325339
</figure>
326340
<caption><location><page_15><loc_50><loc_15><loc_89><loc_18></location>Figure 16: Example of how post-processing helps to restore mis-aligned bounding boxes prediction artifact.</caption>

tests/data/groundtruth/docling_v1/2203.01017v2.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

tests/data/groundtruth/docling_v1/2203.01017v2.md

Lines changed: 27 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -219,18 +219,40 @@ Table 4: Results of structure with content retrieved using cell detection on Pub
219219

220220
- Red - PDF cells, Green - predicted bounding boxes, Blue - post-processed predictions matched to PDF cells
221221

222-
Japanese language (previously unseen by TableFormer): Example table from FinTabNet:
222+
## Japanese language (previously unseen by TableFormer):
223223

224-
b. Structure predicted by TableFormer, with superimposed matched PDF cell text:
224+
## Example table from FinTabNet:
225225

226-
Japanese language (previously unseen by TableFormer): Example table from FinTabNet:b. Structure predicted by TableFormer, with superimposed matched PDF cell text:
227-
<!-- image -->
228226

227+
<!-- image -->
229228

229+
b. Structure predicted by TableFormer, with superimposed matched PDF cell text:
230230
<!-- image -->
231231

232+
233+
234+
| | | 論文ファイル | 論文ファイル | 参考文献 | 参考文献 |
235+
|----------------------------------------------------|-------------|----------------|----------------|------------|------------|
236+
| 出典 | ファイル 数 | 英語 | 日本語 | 英語 | 日本語 |
237+
| Association for Computational Linguistics(ACL2003) | 65 | 65 | 0 | 150 | 0 |
238+
| Computational Linguistics(COLING2002) | 140 | 140 | 0 | 150 | 0 |
239+
| 電気情報通信学会 2003 年総合大会 | 150 | 8 | 142 | 223 | 147 |
240+
| 情報処理学会第 65 回全国大会 (2003) | 177 | 1 | 176 | 150 | 236 |
241+
| 第 17 回人工知能学会全国大会 (2003) | 208 | 5 | 203 | 152 | 244 |
242+
| 自然言語処理研究会第 146 〜 155 回 | 98 | 2 | 96 | 150 | 232 |
243+
| WWW から収集した論文 | 107 | 73 | 34 | 147 | 96 |
244+
| | 945 | 294 | 651 | 1122 | 955 |
245+
232246
Text is aligned to match original for ease of viewing
233-
<!-- image -->
247+
248+
| | Shares (in millions) | Shares (in millions) | Weighted Average Grant Date Fair Value | Weighted Average Grant Date Fair Value |
249+
|--------------------------|------------------------|------------------------|------------------------------------------|------------------------------------------|
250+
| | RS U s | PSUs | RSUs | PSUs |
251+
| Nonvested on Janua ry 1 | 1. 1 | 0.3 | 90.10 $ | $ 91.19 |
252+
| Granted | 0. 5 | 0.1 | 117.44 | 122.41 |
253+
| Vested | (0. 5 ) | (0.1) | 87.08 | 81.14 |
254+
| Canceled or forfeited | (0. 1 ) | - | 102.01 | 92.18 |
255+
| Nonvested on December 31 | 1.0 | 0.3 | 104.85 $ | $ 104.51 |
234256

235257
Figure 5: One of the benefits of TableFormer is that it is language agnostic, as an example, the left part of the illustration demonstrates TableFormer predictions on previously unseen language (Japanese). Additionally, we see that TableFormer is robust to variability in style and content, right side of the illustration shows the example of the TableFormer prediction from the FinTabNet dataset.
236258
<!-- image -->
@@ -436,7 +458,6 @@ Figure 13: Table predictions example on colorful table.
436458
<!-- image -->
437459

438460
Figure 14: Example with multi-line text.
439-
<!-- image -->
440461

441462

442463
<!-- image -->
@@ -451,9 +472,6 @@ Figure 15: Example with triangular table.
451472
<!-- image -->
452473

453474

454-
<!-- image -->
455-
456-
457475
<!-- image -->
458476

459477
Figure 16: Example of how post-processing helps to restore mis-aligned bounding boxes prediction artifact.

tests/data/groundtruth/docling_v1/2203.01017v2.pages.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)