v0.3.1
Main changes
- Remove preparation scripts and distribute precompiled binaries #87
- Add DualConnector, a faster and smaller dictionary format #86
Precompiled dictionary files
We provide precompiled dictionaries for Vibrato, allowing you to get started with tokenization easily. You can download them from Assets in this release.
The following three variants are distributed:
ipadic-mecab-2_7_0/system.dic
from IPADIC v2.7.0jumandic-mecab-7_0/system.dic
from mecab-jumandic-utf8 v7.0naist-jdic-mecab-0_6_3b/system.dic
from NAIST Japanese Dictionary v0.6.3bunidic-mecab-2_1_2/system.dic
from UniDic v2.1.2unidic-cwj-3_1_1/system.dic
from UniDic v3.1.1
These system dictionaries were compiled and modified in the manners described in compile.md and map.md. We trained the mappings of connection ids using license-expired data obtained from Aozora Bunko, following the guideline.
The licenses are contained in each file.