Skip to content

A Feature Based Approach towards Lexical Simplification

Notifications You must be signed in to change notification settings

dhruvp-8/word-complexity

Repository files navigation

Lexical Simplification (Feature Based Approach)

This project targets on the lexical simplification of a sentence based on the Feature Based Approach. Furthermore, the model used in this research project has been referred from "SemEval-2012 Task 1: English Lexical Simplification" by Lucia Specia and Sujay Kumar Jauhar Paper.

The Model

The model consists three modules which are:

  1. Feature Extraction Module (FEM)
  2. Word Substitution Ranking Module (WSRM)
  3. Result Fetching Module (RFM)

Alt text

Prerequisites

What things you need to install the software and how to install them

python 3.x
nltk
ast
bs4
urllib3
pyphen
strings
operator
pywsd
wiktionaryparser

Installing

A step by step series of examples that tell you have to get a development env running

Say what the step will be

Install python 3.6 and ADD that to your PYTHONPATH

Install pip

Install the package manager for python by going at this link [Download pip](https://pip.pypa.io/en/stable/installing/).

Install nltk

pip install nltk

Install ast

pip install ast

Similarly, install all the prerequisites.

Feature Extraction Model Data

The JSON object received from the FEM module can be shown by the below figures

Alt text Alt text

Final Output of the model

The final output received when we input a sentence is this:

Input Sentence is: John revoked his admit

Alt text

Output Sentence is: John withdrew his admit

Running the tests

No Tests to show currently.

Deployment

Not Deployed.

Versioning

I use SemVer for versioning. For the versions available, see the tags on this repository.

Authors

License

Acknowledgments