Skip to content

Commit e0a941a

Browse files
committed
Merge branch 'master' of https://github.com/acnq/Crystal
2 parents 6d3579b + d129e54 commit e0a941a

File tree

205 files changed

+108930
-96
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

205 files changed

+108930
-96
lines changed

README.md

Lines changed: 0 additions & 96 deletions
Original file line numberDiff line numberDiff line change
@@ -1,96 +0,0 @@
1-
# Crystal Text-to-Visual-Speech (TTVS) Engine
2-
3-
Crystal TTVS engine is a real-time audio-visual Multilingual (Mandarin, Cantonese and English) speech synthesizer with a 3D expressive avatar.
4-
5-
The avatar model is parameterized according to the MPEG-4 facial animation standard, which offers a compact set of facial animation parameters (FAPs) and feature points (FPs) to enable realization of 20 visemes and 7 facial expressions. A set of TTVS engines (including Mandarin, Cantoense and English) converts the input phoneme sequence with time information into visemes and then further into FAP sequence. The 3D avatar animation is then rendered according to the FAP sequence by the Xface open source toolkit.
6-
7-
### Framework
8-
9-
* Real-time facial animation to generate synchronized lip movements on the avatar's face
10-
* Natual facial expression and head movement with human-like avatar
11-
* Platform independent framework for both stationary and embedded applications
12-
13-
<p align="center">
14-
<img width="700" src="./TTVS/docs/framework.jpg"/>
15-
</p>
16-
17-
### Reference
18-
19-
Please use the following papers for reference to this project:
20-
21-
* Zhiyong WU, Shen ZHANG, Lianhong CAI, Helen MENG, "[Real-time Synthesis of Chinese Visual Speech and Facial Expressions using MPEG-4 FAP Features in a Three-dimensional Avatar](http://www1.se.cuhk.edu.hk/~hccl/publications/pub/200609_Real-timeSynthesis.pdf)," [in] *International Conference on Spoken Language Processing (Interspeech2006, ICSLP)*, pp. 1802-1805. Pittsburgh, USA, 17-21 September 2006.
22-
23-
* Shen ZHANG, Zhiyong WU, Helen M. MENG, Lianhong CAI, "[Facial Expression Synthesis Using PAD Emotional Parameters for a Chinese Expressive Avatar](http://www1.se.cuhk.edu.hk/~hccl/publications/pub/ACII2007.pdf)," [in] *International Conference on Affective Computing and Intelligent Interaction (ACII2007)*, pp. 24-35. Lisbon, Portugal, 12-14 September 2007.
24-
25-
* Shen ZHANG, Zhiyong WU, Helen M. MENG, Lianhong CAI, "[Head Movement Synthesis based on Semantic and Prosodic Features for a Chinese Expressive Avatar](http://www1.se.cuhk.edu.hk/~hccl/publications/pub/ICASSP2007_ZHANG.pdf)," [in] *International Conference on Acoustics, Speech and Signal Processing (ICASSP2007)*, pp. 837-840. Hawaii, USA, April 15-20 2007.
26-
27-
28-
### Support of Multilingual
29-
30-
The engine supports TTVS in (but not limited to) the following languages: Chinese Mandarin, Cantonese, and English. You can actually implement your TTVS engine by overriding the **CSTHead::FapTTVS** (*/TTVS/FapTTVS.h/.cpp*), just like **CSTHead::FapMandarin**, **CSTHead::FapCantonese**, or **CSTHead::FapEnglish**.
31-
32-
### Support of Cross-platform
33-
34-
It is also possible to run the TTVS engine on different platforms, as the following figures illustrate.
35-
36-
<p align="center">
37-
<img width="500" src="./TTVS/docs/Crystal-Win.jpg"/>
38-
</p>
39-
40-
<p align="center">
41-
<img width="500" src="./TTVS/docs/Crystal-Android.jpg"/>
42-
</p>
43-
44-
### Support of Expressive Head Animation and Facial Expression
45-
46-
Six basic expressions of the 3D avatar:
47-
48-
<p align="center">
49-
<img width="500" src="./TTVS/docs/expressions.jpg"/>
50-
</p>
51-
52-
Head movement on the 3D avatar:
53-
54-
<p align="center">
55-
<img width="500" src="./TTVS/docs/headmovement.jpg"/>
56-
</p>
57-
58-
### Compiling the Projects
59-
60-
#### Compiling
61-
62-
1. Compile TinyXML
63-
- Open "TinyXML" solution in "TinyXML" directory
64-
- Set "tinyxmlSTL" project as "StartUp Project"
65-
- Set "Solution Platforms" to "Win32"
66-
- Compile "tinyxmlSTL" project in "Release" version with "Multi-threaded DLL (/MD)" option
67-
- Compile "tinyxmlSTL" project in "Debug" version (Optional) with "Multi-threaded debug DLL (/MDd)" option
68-
69-
2. Complie Xface
70-
- Open "XFace.sln" solution in "XFace" directory
71-
- Set "XFace" project as "StartUp Project"
72-
- Set "Solution Platforms" to "Win32"
73-
- Compile "Xface" project in "Release" version
74-
- Compile "Xface" project in "Debug" version
75-
76-
3. Compile TTVS
77-
- Open "TTVSTest.sln" solution in "TTVSTest" directory
78-
- Set "TTVSTest" project as "Startup Project"
79-
- Set "Solution Platforms" to "Win32"
80-
- Compile "TTVS" and "TTVSTest" project in "Release" version
81-
- Compile "TTVS" and "TTVSTest" project in "Debug" version
82-
83-
#### Runing the demo
84-
85-
- Go to "binary/win32" directory
86-
- Go to "release" or "debug" directory
87-
- Run TTVSTest.exe
88-
- Select menu "File/Open" to open the "fdp" file in "data/talkinghead/Song/song.fdp"
89-
- Select menu "TTVS/Mandarin", "TTVS/Cantonese", "TTVS/English" to switch between different langauges.
90-
- Select menu "TTVS/Screen Animation" for demo animation.
91-
92-
### References
93-
94-
* Motion Pictures Expert Group, ISO/IEC 14496-2:1999/Amd. 1: 2000(E). International Standard, Information Technology - Coding of Audio-Visual Objects. Part 2: Visual; Amendment 1: Visual Extensions.
95-
* Balci, K., "Xface: MPEG-4 based Open Source Toolkit for 3D Facial Animation", [in] *Advance Visual Interfaces*, 399-402, 2004.
96-
* [Xface 1.0 open source tools](http://xface.fbk.eu/)

binary/Win32.Debug/CrystalTest.bat

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
CrystalTest.exe config.xml

binary/Win32.Debug/config.xml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
<?xml version="1.0" encoding="UTF-8" ?>
2+
<modules xml:lang="zh-cmn">
3+
<dlib file="zh.text.dll" />
4+
<dlib file="zh.hts.dll" />
5+
<textdata path="..\..\data\putonghua\text" loaddata="true" />
6+
<voicedata path="..\..\data\putonghua\hts" />
7+
</modules>

binary/Win32.Release/CrystalTest.bat

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
CrystalTest.exe config.xml

binary/Win32.Release/config.xml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
<?xml version="1.0" encoding="UTF-8" ?>
2+
<modules xml:lang="zh-cmn">
3+
<dlib file="zh.text.dll" />
4+
<dlib file="zh.hts.dll" />
5+
<textdata path="..\..\data\putonghua\text" loaddata="true" />
6+
<voicedata path="..\..\data\putonghua\hts" />
7+
</modules>

data/putonghua/hts/hts.cfg

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
-td voice/tree-dur.inf
2+
-tm voice/tree-mgc.inf
3+
-tf voice/tree-lf0.inf
4+
-tl voice/tree-lpf.inf
5+
-md voice/dur.pdf
6+
-mm voice/mgc.pdf
7+
-mf voice/lf0.pdf
8+
-ml voice/lpf.pdf
9+
-mu voice/unvoiced.prop
10+
-dm voice/mgc.win1
11+
-dm voice/mgc.win2
12+
-dm voice/mgc.win3
13+
-df voice/lf0.win1
14+
-df voice/lf0.win2
15+
-df voice/lf0.win3
16+
-dl voice/lpf.win1
17+
-s 16000
18+
-p 80
19+
-a 0.42
20+
-g 0
21+
-b 0.4
22+
-l
23+
-r 1
24+
-vp

data/putonghua/hts/voice/_readme.txt

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
This is the voice files for HTS engine for Chinese Mandarin and English from HCSI.
2+
3+
(with initial-final format for Chinese Mandarin)
4+
(with phonemes for English)

data/putonghua/hts/voice/dur.pdf

128 KB
Binary file not shown.

data/putonghua/hts/voice/lf0.pdf

1.31 MB
Binary file not shown.

data/putonghua/hts/voice/lf0.win1

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
1 1.0

0 commit comments

Comments
 (0)