多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 #275

Roseleaves · 2022-04-20T09:29:27Z

运行环境

操作系统（Linux/macOS/Windows）：Windows
Python 版本：3.9.7
pypinyin 版本：0.46.0

问题描述

我希望构建一个韵母表或者音节表，来显示每个韵母或者音节中包含了文章的多少个字。多音字一律重复计数。
以音节表为例。它的行是声母，列是韵母。然后每格表示这个音节。
但是多音字模式并不能通过遍历由
pinyin(ch,style=Style.INITIALS,heteronym=True), pinyin(ch,style=Style.FINALS,heteronym=True) 所组成的 tuple 来确定每一个音节的行列。因为多音统计时被合并过，所以它不能一一对应。

另一个解决方案是直接记录 f = pinyin(ch,style=Style.TONE3,heteronym=True)，然后试图套用 finals(f[0][0]) 来得到这个音节的韵母。但是这个音节转韵母的函数并不存在。

问题复现步骤

>>> from pypinyin import *
>>> pinyin('噷',style=Style.TONE,heteronym=True)
[['hm', 'xīn', 'hēn']]
>>> pinyin('噷',style=Style.FINALS,heteronym=True)
[['in', 'en']]
>>> pinyin('噷',style=Style.INITIALS,heteronym=True)
[['h', 'x']]

我希望看到的输出

>>> from pypinyin import *
>>> pinyin('噷',style=Style.TONE,heteronym=True)
[['hm', 'xīn', 'hēn']]
>>> pinyin('噷',style=Style.FINALS,heteronym=True)
[['', 'in', 'en']]
>>> pinyin('噷',style=Style.INITIALS,heteronym=True)
[['h', 'x', 'h']]

我希望看到的另一种输出

>>> from pypinyin import *
>>> pinyin('噷',style=Style.TONE,heteronym=True)
[['hm', 'xīn', 'hēn']]
>>> pinyin('噷',style=Style.FINALS,heteronym=True)
[['m', 'in', 'en']]
>>> pinyin('噷',style=Style.INITIALS,heteronym=True)
[['h', 'x', 'h']]

The text was updated successfully, but these errors were encountered:

mozillazg · 2022-04-20T13:15:15Z

你的这个需求，可以考虑用 #225 (comment) 这里的方法对拼音做二次处理去获取相应的声母和韵母。

Roseleaves · 2022-04-25T11:05:26Z

哦哦哦，看到转换函数了！okk
此外，对这些现代汉语中已经统读的字或者已经消亡的读音，不知道是应该怎么处置。
它似乎对我的输入法造成了不小的麻烦。

>>> pinyin('之', heteronym=True)
[['zhī', 'zhū', 'zhì']]
>>> pinyin('怕', heteronym=True)
[['pà', 'bó']]
>>> pinyin('跑', heteronym=True)
[['pǎo', 'páo', 'bó']]
>>> pinyin('重', heteronym=True)
[['zhòng', 'chóng', 'tóng']]

mozillazg · 2022-04-25T13:52:50Z

可以看看这个 issue 里提到的方法: #198

mozillazg added the question label Apr 20, 2022

Roseleaves changed the title ~~多音字的各style元素不对应，或者不能由拼音提取声母/韵母~~ 多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 Apr 25, 2022

mozillazg closed this as completed May 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 #275

多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 #275

Roseleaves commented Apr 20, 2022

mozillazg commented Apr 20, 2022

Roseleaves commented Apr 25, 2022 •

edited

Loading

mozillazg commented Apr 25, 2022

多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 #275

多音字的各style拼音序列的元素不对应，或者不知如何由拼音提取声母/韵母 #275

Comments

Roseleaves commented Apr 20, 2022

运行环境

问题描述

问题复现步骤

我希望看到的输出

我希望看到的另一种输出

mozillazg commented Apr 20, 2022

Roseleaves commented Apr 25, 2022 • edited Loading

mozillazg commented Apr 25, 2022

Roseleaves commented Apr 25, 2022 •

edited

Loading