Skip to content

Commit

Permalink
Skip single character & too long pages (closes #1 & felixonmars#2)
Browse files Browse the repository at this point in the history
  • Loading branch information
felixonmars authored and jactry committed May 24, 2020
1 parent 16400de commit d7384c0
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion convert.py
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,11 @@
if not HANZI_RE.match(line):
continue

# Skip list page.
# Skip single character & too long pages
if not 1 < len(line) < 9:
continue

# Skip list pages
if line.endswith('\u5217\u8868'):
continue

Expand Down

0 comments on commit d7384c0

Please sign in to comment.