Skip to content
Pietro Liuzzo edited this page Feb 9, 2017 · 6 revisions

Ignore diacriticts

Description

when searchin negast search also negaśt

Notes

before lucene parsing and sanitizion, add options and change mode to any

Implementation

app:createquery

replaces at the moment

  • (replace($query-string, 's', 'ś'), replace($query-string, 's', 'š'))
  • (replace($query-string, 'a', 'ä'), replace($query-string, 'a', 'ā'))
  • replace($query-string, 'e', 'ǝ')

letters used in transcription

consonants h, l, ḥ, m, ś, r, s, q, b, t, ḫ, n, ʾ, k, w, ʿ, z, y, d, g, ġ, ṭ, ṗ, ṣ, ḍ, ḏ , f, p, š, ǧ, č, č̣, ñ, ḵ, ṯ, ž, ẓ, q̱. vowels a, u, i, ā, e, ə, o also ē, ī, ō, ū

for search mapping (some are already mapped by unicode some not) h finds h ḥ ḫ ḵ p finds p ṗ s finds s ś ṣ ḍ š d finds d ḍ ḏ k finds k ḵ n finds n ñ c finds c č č̣ g finds g ǧ ġ t finds t ṯ ṭ z finds z ž ẓ ' finds ʾ ʿ e finds e ə ē o finds o ō u finds u ū q finds q q̱

it would be better to have a function that calculates the options each time from any imput returning the alternatives.

Prerequisites

list with all needed variants

back to list

Clone this wiki locally