Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

修正簡繁轉換及地區用字轉換:冢=>冢/塚 #515

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

danny0838
Copy link
Contributor

之前的討論參見 #430

「手塚」「鬼塚」等日本人名或許可以考慮加入 STPhrases.txt。

@@ -12,6 +12,7 @@
兒 児
內 内
兩 両
冢 塚
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

該轉換不屬於新字體。

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

「塚」在《常用漢字表》中,「冢」不在《常用漢字表》、《人名用漢字表》、《表外漢字字體表》中,加上兩字是異體字關係,可以相信日本原則上會用「塚」取代「冢」,應比照其他「OpenCC 正體轉日本標準」的實例處理,如「內=>内」、「鬥=>闘」、「剎=>刹」。

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

一般來說,日本傾向於不轉換不常見的異體字。我傾向於不變。

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

一般來說,日本傾向於不轉換不常見的異體字。我傾向於不變。

這說法和 JPVariants.txt 表的慣例不符。目前 JPVariants.txt 表有包括 OpenCC 正體轉日本新字體,比如「內=>内」(參見 eeb5339)。而很顯然地,OpenCC 正體在日本都是「不常見的異體字」(日本人原則上會使用標準字,而不使用它們)。

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

內 内 屬於字形不同的異體字,「冢 塚」還有古今關係。

日文有用「冢」的專有名詞。而沒有用「內」「剎」的專有名詞。

https://ja.wikipedia.org/wiki/%E6%B1%B2%E5%86%A2%E6%9B%B8
https://ja.wikipedia.org/wiki/%E5%86%A2%E7%94%B0%E5%A4%A7%E5%B3%AF

Copy link
Contributor Author

@danny0838 danny0838 Mar 4, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

你說的也對,但這樣就會造成比如台灣正體「塚」=>OpenCC正體「冢」=>日本新字體「冢」的問題,反而無法顧及大多數常用字。

比照其他OpenCC正體轉地區正體的慣例,理論上應該是要為少數古字另外建詞庫處理才對。原轉換表可改成「冢=>塜冢」。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants