ZAS-REP-TOOLS

Current Tool was developed in the frame of the linguistics Study/Project "The Pragmatic Status of Iconic Meaning in Spoken Communication: Gestures, Ideophones, Prosodic Modulations (PSIMS) as the Bachelor Thesis.

Project Members:
Bachelor Thesis Appraisers:
- Ulf Leser
- Susanne Fuchs
Tool-Developer:
- Egor Savin

ZAS-REP-TOOLS is a bundle of Tools for automatic extraction and quantification of the repetitions from the unstructured textual Data Collections for different languages with additional Search Engine wrapped around extracted data + on-board supplied Twitter Streamer to collect real-time tweets.

_{(This ReadMe still to be in the developing process and still has grammatical errors. Please ignore them=) But if you have any suggestions of improvement please contact Egor Savin )}

For a quick-start, first download and install all dependencies , then install the tool and afterwards go to the Workflow and Tutorials section to begin.

Hardware Requirements
Dependencies Installation
- On Linux
- On Windows
- On MacOS
Setting up
Definitions
Functionality
- CLI-Commands
- CLI-Options
- CLI-Usage
- Multiprocessing
- NLP-Methods
- InternDataBase-Structure
- Additional Features
  - Formatters
  - Templates
WorkFlow
Tutorials
- Python Package Tutorial
- Command line Tutorial
Input/Output
- File Formats
- Columns Explanation in the Output Tables
Restrictions
Citing ZAS-REP-TOOLS
Possible errors and warnings
Bugs
Data-Examples
Acknowledgements

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
zas_rep_tools		zas_rep_tools
zas_rep_tools_data		zas_rep_tools_data
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
setup.py		setup.py

	Minimum:	Average:
CPU	2 Core 2 GHz	4 Core 2.6 GHz
RAM	8 GB	16 GB

License

savin-berlin/zas-rep-tools

Folders and files

Latest commit

History

Repository files navigation

ZAS-REP-TOOLS

Table of Contents

1. Hardware Requirements

2. Dependencies

Dependencies Installation

On Linux (UbuntuOS 16.04.5 LTS)

On Windows (Win10)

On macOS (10.13.6)

3. Setting up

1. Package Installation

2. User Configuration

3. Package Tester

4. Definitions

5. Functionality

5.1 CLI-Commands

5.2 CLI-Options

5.3 CLI-Usage

Necessary Options

Exhausted Options

5.4 Multiprocessing

5.5 NLP-Methods

5.6 InternDataBase-Structure (SQLite)

5.7 Additional Features

6. WorkFlow

7. Tutorials

Python Package Tutorial

Command line Tutorial

Add/Create Corpus

Compute StatsDB

Export Statistics from StatsDB

Stream Twitter

8. Input/Output

Input

Output

9. Restrictions

10. Citing ZAS-REP-TOOLS

How do I Cite AutoVOT in my articles?

11. Possible errors and warnings

1. Error: 'Too many open files:'

macOS:

Ubuntu:

2. Error: "MemoryError"

Ubuntu:

3. Error: In Windows Environment - to long reaction

4. UnicodeDecodeError: 'ascii' codec can't decode byte in position : ordinal not in range(128)

5. 'Permission Error' or 'UserConfigDBGetterError:', or 'OSError: [Errno 13] Permission denied:'

6. "E: Could not get lock /var/lib/dpkg/lock - open (11: Resource temporarily unavailable)"

13. Bugs

1. CashedWriterError: Given CommitNumber'8' is not exist. It wasn't possible to write cashed Insertion into DB.

2. Status Bar doesn't work properly, if CSV was given, as input

13. Data-Examples

14. Acknowledgements

A big Thank to the following Peoples, who makes this work and current results possible:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages