Skip to content

A synonym search engine based on XML and Perl.

Notifications You must be signed in to change notification settings

old-zet/synonym_engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

synonym_engine

French synonym search engine based on Perl regex and XML traversal. The chosen XML corpus (cf. rali.iro.umontreal.ca/DEM//DEM-1_1.xml.zip) encodes extensively around 70k French words, including their domain use, semantic features, and syntactic particularities.

The XML tree traversal algorithm retrieves the relevant information for the input word then overlays it against every XML entry to find similarities. Overlaying domain, semantic, and syntactic properties, encapsulated in the , and tags, gives fairly reliable results, more for monosemantic words (1 word = 1 meaning, e.g. house) than polysemantic ones (1 word = n>1 meanings, e.g. mouse (animal VS PC accessory)).

The output is an HTML file dividing the entry word synonyms into 3 categories from least to most relevant considering the occurrence rate and the weight of given tags.

About

A synonym search engine based on XML and Perl.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published