We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
就像是PubLayNet一样 但是PubLayNet最局限的地方在于分类太少了 很难满足实际需求
The text was updated successfully, but these errors were encountered:
看起来很难, 因为这个数据是token级别的, 你想要的应该是block级别的
Sorry, something went wrong.
Allen 的 vila 项目里提供了从 tokenlevel 合成 blocklevel 的数据,但是看起来质量不高
目前找下来最好的还是 doclaynet 这个数据集
No branches or pull requests
就像是PubLayNet一样
但是PubLayNet最局限的地方在于分类太少了
很难满足实际需求
The text was updated successfully, but these errors were encountered: