Multimodal-Calculator

Voice recognition/Button Calculator created using various Python libraries and the PAC design pattern

Running the program

To run the program, double click on the provided .exe file. The user will be greeted by the TTS, asking if they want to enter voice mode. Say ‘yes’ to enter, otherwise, say nothing/anything else in order to enter the main interface. The voice assistant will prompt the user, guiding them to use the voice mode.

Description of all software packages used

threading – allows CPU to concurrently run another function while running the GUI. This was imported to avoid issues with the GUI freezing while Speech Mode is in use.
tkinter – a python library to create interactive GUIs. Everything visual in the project is attributed to this software package – the main window, buttons, text boxes, and on-click listeners.
speech_recognition – a built in python library to incorporate speech-to-text APIs from Microsoft, Google, IBM, Sphinx, and more. Allows for the program to record user input and convert it into text.
gTTS – a python library to allow for text-to-speech.
os – allows for control of the operating system. Used in the program to delete .mp3 files as they are created, to minimize space usage.
playsound – allows for the program to play .mp3 files.
pyinstaller – allows for conversion of .py files into executables.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Multimodal-Calculator

Running the program

Description of all software packages used

Files

README.md

Latest commit

History

README.md

File metadata and controls

Multimodal-Calculator

Running the program

Description of all software packages used