Syriac Corpus Application

A TEI publishing application for Syriac texts and manuscripts, built on a ultra-reliable version of the Gaddel framework.

Overview

This application provides a digital platform for publishing and exploring Syriac corpus materials encoded in TEI XML. Originally based on the Gaddel framework developed for Syriaca.org, it has been adapted for Syriac manuscript and text collections.

Features

Multi-lingual Interface - Browse and search in multiple languages
TEI Processing - Convert TEI XML to JSON for searching, browsing, and display
TEI Processing - Convert TEI XML to HTML for item pages
Faceted Search - Filter and browse by author, date, and catalog
Full-text Search - Search within Syriac texts and translations
Multi-format Export - HTML, TEI, JSON formats
SPARQL Integration - RDF triplestore and SPARQL endpoint support

Requirements

Python 3.7+ (for TEI processing and testing)

Quick Start

# Clone repository
git clone <repository-url>
cd syriac-corpus-app

# Install Python dependencies
pip install lxml pytest

TEI Requirements

TEI files must include a unique identifier:

<tei:publicationStmt>
  <tei:idno type="URI">unique-identifier</tei:idno>
</tei:publicationStmt>

Project Structure

syriac-corpus-app/
├── resources/          # CSS, JS, fonts, images
├── siteGenerator/      # XSL templates and components
├── documentation/      # API and wiki documentation
├── exampleData/        # Sample TEI, JSON, HTML files
├── tei2json.py        # TEI to JSON converter
├── index.html         # Main entry point
└── *.html             # Page templates

Configuration

repo-config.xml - Configure data paths and unique identifiers
controller.xql - Define URL routing and request handling

Development

See DEV_PROCESS.md for detailed development workflow, testing procedures, and contribution guidelines.

Data Format

The application extracts the following from TEI files:

Title and author information
Work and catalog URIs (Syriaca.org references)
Composition dates
Section divisions with Syriac text
Full-text content for search indexing in any language

License

See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3,047 Commits
.github/workflows		.github/workflows
api-documentation		api-documentation
documentation		documentation
exampleData		exampleData
infrastructure		infrastructure
resources		resources
siteGenerator		siteGenerator
.gitattributes		.gitattributes
.gitignore		.gitignore
301.html		301.html
404.html		404.html
500.html		500.html
DEV_PROCESS.md		DEV_PROCESS.md
LICENSE		LICENSE
README.md		README.md
about-srophe.html		about-srophe.html
about-syriac.html		about-syriac.html
about.html		about.html
alt-search.html		alt-search.html
browse.html		browse.html
build-index.html		build-index.html
contact-us.html		contact-us.html
editorial-board.html		editorial-board.html
entry.html		entry.html
favicon.ico		favicon.ico
history.html		history.html
icon.png		icon.png
index.html		index.html
manuscripts.json		manuscripts.json
media-releases.html		media-releases.html
project-partners.html		project-partners.html
project-team.html		project-team.html
record.html		record.html
search-facets.html		search-facets.html
search.html		search.html
submissions.html		submissions.html
tei2json.py		tei2json.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Syriac Corpus Application

Overview

Features

Requirements

Quick Start

TEI Requirements

Project Structure

Configuration

Development

Data Format

License

Links

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Syriac Corpus Application

Overview

Features

Requirements

Quick Start

TEI Requirements

Project Structure

Configuration

Development

Data Format

License

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages