Skip to content
Pietro Liuzzo edited this page Jan 31, 2017 · 4 revisions

vel ሃሌ፡ ሉያ፡ (etiam siglo ሌ፡ ሉያ፡ vel ለያ፡ scriptum) 1) Ἀλληλούϊα, Halleluja הַלְלוּיָה Apoc. 19,1; Tob. 13,19; in titulis psalmorum 105 seq.; sexcenties in Deguâ, Mavâset, Genzat et aliis libris liturgicis legitur, nam ingentem Allelujarum summam et in gaudio et in luctu intonant Abyssini (Ludolfi hist. III,6,105) 2) Cum ut in aliis festis ita per septimanam Paschae ab Abyssinis hymni et antiphonae hallelujarum solemniter decantentur, in versione Octateuchi Deut. 16,16 pro ἑορτὴ τῶν ἀζύμων በዓለ፡ ሃሌሉያ፡ et Ex. 13,4 pro μὴν τῶν νέων ወርኀ፡ ሃሌሉያ፡ ኔሳን፡ intrusum est. {{DiL.1425} Vid. ሐሌ፡ col. 69}.

mark with foreign the strings in greek, hebrew, fidel, etc.

initial Txt file preprocessing

add root Term

  • substitute \nL with </term><term> add first opening and last closing BETTER in Text Edit due to size!
  • isolate id and form
    • search (L[0-9A-Za-z]+)($)([\p{InEthiopic}\s\w.,]+)($)
    • replace <id>\1</id><form>\3</form>
  • search each language with \p{InGreekExtended} and wrap into BE CAREFUL WITH SUBSTITUTIONS NOT TO LOOSE DATA!
    • gez
    • grc
    • he
    • ar
    • syr
    • cop
  • (([\p{InEthiopic}\p{InEthiopicExtended}\p{InEthiopicSupplement}\p{InEthiopicExtended-A}]+\s?)+) --> <foreign xml:lang="gez">\1</foreign>
  • (([\p{InGreek}\p{InGreekExtended}]+(\s+)?)+) --> <foreign xml:lang="grc">\1</foreign>
    • tested also for Avestan, Phoenician, Armenian, Georgian: none found.
  • search all quotations and substitute with <ref cRef="Job" loc="1,1">Job 1,1</ref> ((((\d\s)?([\p{javaUpperCase}][a-z]+.?\s?)+([A-Za-z]+.?\s?)?))(\s+)((\d+)[,.\s]?(\d+)?))replace with <ref cRef="\2" loc="\7">\1</ref> THIS leaves plenty of dirty remains
    • 0 Matt when preceding number from other info is there
    • matches any string at the end of which there might be a number...
    • if other abbreviations like Acc. are preposed these are also matched generating a big mess.

to exclude the known abbreviation, hand substitution of the grammatical forms e.g. SEARCH : abs. replace with: \1

tags used in dillmann

amh. for amahricus referring to following string adv. ap. apocr. cannot be treated as both Dan ap. 1,1 Dan. apoc. 1.1 and dan 1.1 apoc. occour. will be cateched by citations catcher.

gramm. \1

((((\s\d{1}\s)?([\p{javaUpperCase}][a-z]+.\s?)+([A-Za-z]+.?\s?)?))(\s+)((\d+(?![)\s]))[,.\s]?(\d+)?))

\1

Clone this wiki locally