General and Technical Docs for eBL

How to set up Gitpod
Contribution guidelines
How to run migrations

Guide for eBL project

The "Electronic Babylonian Literature" (eBL) project, based at Ludwig Maximilian University of Munich (LMU) and the Bavarian Academy of Sciences (BAdW), involves the digital compilation and analysis of Babylonian texts using AI and computer vision technologies. This project leverages several advanced techniques to identify and match manuscript fragments.

The algorithms and techniques used in this project include:

Image Matching and Fragment Identification: The eBL project uses computer vision techniques to digitally reconstruct cuneiform tablets from scattered fragments. This involves capturing high-resolution images of the fragments and using pattern recognition algorithms to identify matching pieces based on the cuneiform signs and their arrangement. This process is enhanced by custom-built software applications specifically designed for Assyriological research.
N-gram Matching for Text Overlap: The project employs n-gram matching algorithms to identify overlapping text segments within the corpus of transliterated texts. This method involves breaking down the text into n-grams (substrings of n characters or words) and comparing these across different fragments to find overlaps that indicate they are parts of the same original document.
Optical Character Recognition (OCR) for Cuneiform: OCR technology adapted for cuneiform script is used to digitize and transliterate the texts from the high-resolution images of the tablets. This process involves training machine learning models to recognize and interpret the ancient script accurately.
Machine Learning and AI: Advanced machine learning algorithms are utilized to analyze the patterns and features of the cuneiform signs, facilitating the identification and classification of fragments. This helps in piecing together the fragmented texts and improving the accuracy of the digital reconstructions.

The project leverages Optical Character Recognition (OCR) and Natural Language Processing (NLP) technologies to read and match the texts. Specifically, using OCR to convert the cuneiform signs from images into machine-readable text. Then apply algorithms to detect and match overlapping segments of different manuscripts, aiding in the reconstruction of fragmented texts.

For more detailed information, you can explore the project’s official website and the related publications from LMU and the International Association for Assyriology, which provide insights into the methodologies and technological innovations employed in the eBL project and IAAssyriology.

Primary data

To date, thousands of additional cuneiform fragments have been photographed in collaboration with the British Museum in London and the Iraq Museum in Baghdad and CDLI.

Process

eBL GitHub organisation includes various tools, such as:

ebl-api: The API for accessing the Electronic Babylonian Literature database.
cuneiform-ocr: Tools for performing OCR on cuneiform texts.
ngram-matcher: Algorithms for matching n-grams to identify overlapping text segments.

These tools collectively enable the identification and matching of text fragments by comparing pixel patterns and character sequences, allowing researchers to piece together ancient manuscripts from scattered fragments like a text puzzle.

For more information about eBL project, read this article on Medium and Prof Enrique Jiménez YouTube speech.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
gitpod-setup		gitpod-setup
guides		guides
.gitignore		.gitignore
Babylon_Project_modified.atn		Babylon_Project_modified.atn
README.md		README.md
STITCHING_Modified_EJ_LH.jsx		STITCHING_Modified_EJ_LH.jsx
db_design.gv		db_design.gv
db_design.pdf		db_design.pdf
upload_images_collection_name.png		upload_images_collection_name.png
upload_images_filenames_without_paths.png		upload_images_filenames_without_paths.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General and Technical Docs for eBL

Guide for eBL project

Primary data

Process

About

Releases

Packages

Contributors 5

Languages

ElectronicBabylonianLiterature/generic-documentation

Folders and files

Latest commit

History

Repository files navigation

General and Technical Docs for eBL

Guide for eBL project

Primary data

Process

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages