Skip to content

Commit

Permalink
Skip single character & too long pages (closes felixonmars#1 & felixo…
Browse files Browse the repository at this point in the history
  • Loading branch information
felixonmars committed May 23, 2020
1 parent 4aeb12f commit 8105228
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion convert.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,11 @@
if not HANZI_RE.match(line):
continue

# Skip list page.
# Skip single character & too long pages
if not 1 < len(line) < 9:
continue

# Skip list pages
if line.endswith('\u5217\u8868'):
continue

Expand Down

0 comments on commit 8105228

Please sign in to comment.