Skip to content

Commit

Permalink
formatting and slight typo
Browse files Browse the repository at this point in the history
  • Loading branch information
tmthyln committed Jan 16, 2024
1 parent 5025726 commit b0f44aa
Showing 1 changed file with 6 additions and 4 deletions.
10 changes: 6 additions & 4 deletions src/frequency_datasets.jl
Original file line number Diff line number Diff line change
Expand Up @@ -26,12 +26,14 @@ const LCMC_CATEGORIES = Dict(
"""
SimplifiedLCMC([categories])
A character frequency dataset: Lancaster Corpus for Mandarin Chinese, simplified terms only,
A word frequency dataset: Lancaster Corpus for Mandarin Chinese, simplified terms only,
based on simplified text corpus. See their
[website](https://www.lancaster.ac.uk/fass/projects/corpus/LCMC/default.htm) for more details about the corpus.
[website](https://www.lancaster.ac.uk/fass/projects/corpus/LCMC/default.htm)
for more details about the corpus.
The character frequency can be based only on selected categories (see `CJKFrequencies.LCMC_CATEGORIES` for valid
category keys and corresponding category names). Any invalid categories will be ignored.
The word frequency can be based only on selected categories
(see `CJKFrequencies.LCMC_CATEGORIES` for valid category keys and corresponding category names).
Any invalid categories will be ignored.
## Examples
Loading all the categories:
Expand Down

0 comments on commit b0f44aa

Please sign in to comment.