-
Notifications
You must be signed in to change notification settings - Fork 394
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
45e793a
commit 764cf0c
Showing
5 changed files
with
395 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
# Tokenizer Viewer | ||
|
||
Tokenizer Viwer 是一款方便快速预览 tokenizer 的工具。 | ||
|
||
其功能包括: | ||
|
||
- [ x ] 查看词表内容,字符数统计。 | ||
- [ x ] 搜索 tokenizer 中是否包含指定 token。 | ||
- [ x ] 编码(encode)/ 解码(decode)测试。 | ||
- [ x ] 比较 2 个 tokenizer 之间的 token 差异。 | ||
- [ ] 将 2 个不同的 tokenizer 做 merge。 | ||
|
||
使用 `strat.sh` 启动平台,可在 `--server.port` 调整启用端口。 | ||
|
||
```sh | ||
streamlit run web_ui.py --server.port 8001 | ||
``` | ||
|
||
<div align='center'> | ||
|
||
<img src='assets/preview.png'> | ||
|
||
</div> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
streamlit run web_ui.py --server.port 8001 |
Oops, something went wrong.