Skip to content

EESH-843/stackoverflow_tag_prediction

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stack Overflow Tag Prediction

Stack Overflow Logo

Overview

Welcome to the "Stack Overflow Tag Prediction" repository! This project is dedicated to predicting tags (or labels) for questions posted on Stack Overflow. The primary objective is to develop a machine learning model capable of automatically assigning relevant tags to user-submitted questions. This prediction system aims to simplify the process of finding answers and experts in specific topics on the platform.

Why Tag Prediction Matters:

Effective tagging is essential on Stack Overflow to ensure that questions reach the right audience and receive accurate responses. However, manual tagging can be cumbersome and time-consuming for users. Our project tackles this challenge by automating the tagging process, making it easier for both question askers and answerers to navigate the vast knowledge repository of Stack Overflow.

The Impact:

With accurate tag predictions, users can find answers to their questions more quickly, and experts can discover questions aligned with their expertise. This leads to a more efficient and collaborative community, ultimately enhancing the Stack Overflow experience for everyone.

Technologies used:

  • React js
  • Node js
  • Express js
  • MongoDb
  • Redux
  • Json web token
  • Machine Learning
  • NLP

Getting Started

To begin working with this project, follow these steps:

  1. Clone this repository to your local machine:

    git clone https://github.com/tk-pranav/stackoverflow_tag_prediction.git
    
  2. Install required dependencies using the command:

    npm install
    
  3. Install the required Python packages:

    pip install -r requirements.txt
    
  4. Start the App using the command:

    npm start
    
  5. Explore the Jupyter notebooks for data analysis and model development.

  6. Train and evaluate the tag prediction model using the dataset and code.

Dataset

We used a Kaggle dataset for our project.

Link: https://www.kaggle.com/datasets/stackoverflow/stacksample

Model

Our approach involves employing machine learning techniques, including natural language processing (NLP) and classification algorithms, to predict tags for Stack Overflow questions. The model is trained on historical question data and can predict relevant tags based on the content of the questions.

Results

For detailed insights into the model's performance and evaluation metrics, please refer to the Jupyter notebook. We continuously strive to enhance the model's accuracy and efficiency.

Contributing

We enthusiastically welcome contributions from the open-source community. If you wish to contribute to this project, please adhere to these guidelines:

Fork the repository.

  • Create a new branch for your changes.
  • Implement your contributions.
  • Submit a pull request with a clear description of your changes.

Happy Coding!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 74.9%
  • JavaScript 19.6%
  • CSS 4.9%
  • HTML 0.6%