Skip to content

Classify antioxidant property on protein sequence based on protein sequence feature

Notifications You must be signed in to change notification settings

gitakartika/antioxidant-protein-classification

Repository files navigation

antioxidant-protein-classification

This project goal is to classify either the protein sequences have antioxidant property or not. To achieve this goal, we perform:

  1. Feature extraction: To obtain protein feature based on their sequences
  2. Feature selection: To select only impactful feature, there are two ways to get these features: a) Eliminate high correlated features b) Perform RFECV
  3. Analyze optimal SVM parameter to determined which parameter we used on hyperparameter tuning
  4. Hyperparameter Tuning using SVM
  5. Evaluate model using data testing
  6. Analyze whether the model is overfit or not.