BEE is an open-source GitHub app that analyzes the issues that users submit to GitHub repositories. BEE supports issues written in any form, any format, and for any software system.
BEE focuses on analyzing three elements in bug reports (i.e., issues that describe software bugs):
- The system's observed (unexpected) behavior (OB)
- The system's expected behavior (EB)
- The steps to reproduce the bug (S2R)
BEE provides three main features:
- Detect the type of an issue: a bug report, an enhancement, or a question
- Detect when the OB, EB, and/or S2R are not provided by the user in their bug reports.
- Structure bug descriptions, by identifying and labeling the sentences that describe the OB, EB, and/or S2R
BEE can be installed as an GitHub app on any GitHub repository. BEE also offers a public web API for identification of the OB, EB, and S2R in any piece of text
A video demonstration of BEE can found here.
BEE is meant to help users provide better bug reports, increase the productivity of developers, and help researchers in their investigations.
As a developer, you can use BEE to:
- prioritize and manage incoming issues in your software projects
- check whether bug reports are complete or incomplete
- prompt reporters to provide the missing information when bug reports are incomplete
- easily understand and spot the main elements in bug descriptions (the OB, EB, and S2R)
- automate issue/bug report analysis or develop new tools using BEE's web API
As a researcher, you can use BEE's web API to investigate bug descriptions and to automate tasks that rely on them, such as bug reproduction, localization, duplicate detection, bug report quality assessment, etc.
Using BEE is simple. All you need to do is adding BEE to your GitHub repositories by following this link. Once installed, BEE will analyze any incoming issue within your repositories.
Besides, you can use BEE's web API in your script, app, or tool, by sending an HTTP POST request to http://rocco.cs.wm.edu:21203/api
, in the following format:
{
"text":"I experienced a crash when opening the settings menu"
}
The API would return a response with the sentences labeled as OB, EB, and/or S2R if they convey these elements:
{
"code":200,
"status":"success",
"bug_report":{
"0":{
"text":"I experienced a crash when opening the settings menu",
"labels":[
"OB",
"S2R"
]
}
}
}
BEE is implemented using Node.js, under the following architecture:
The main components of BEE are the following:
- GitHub Controller: it handles the communication with GitHub and controls how other BEE components interact.
- Web API Controller: it is the web interface and controller of BEE's API.
- Issue Classifier: it classifies a GitHub issue as a bug, an enhancement, or a question. The classifier is a multi-class linear neural model based on fastText.
- Sentence Classifier: it classifies each sentence of a bug report as OB, EB, or S2R (i.e., the sentence describes any of these elements. Three binary Support Vector Machines (SVMs) implemented on SVM light are used for sentence classification. The Stanford CoreNLP library is used to preprocess the bug report.
- Missing Elements Detector: based on sentence classification, it detects if a bug report lacks the OB, EB, and S2R.
We are more than happy to receive your contributions (any kind of contributions). If you have an idea of a feature or enhancement, or if you find a bug, please open an issue (or a pull request). If you have questions, feel free to contact us: Yang Song (ysong10 at email.wm.edu) & Oscar Chaparro (oscarch at wm.edu)
First, fork the repository and make sure you:
- use java 8
- install nodejs-v14.21.3. and npm to compile/install dependencies
- create and register a Github App of your own and install it in your repositories
After cloning the forked repo, create a .env file in the src folder and set the right environment variables as described here. This is the format of the file:
GITHUB_APP_ID = xxxxx
GITHUB_PRIVATE_KEY = "-----BEGIN RSA PRIVATE KEY-----
...
HkVN9...
...
-----BEGIN RSA PRIVATE KEY-----"
GITHUB_WEBHOOK_SECRET = xxx (optional)
PORT = xxxx
Download Stanford CoreNLP 3.9.0 and run the server it provides:
# Run the server using all jars in the current directory (e.g., the CoreNLP home directory),
java -mx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 9000 -timeout 15000
CoreNLP connects by default via StanfordCoreNLPServer, using port 9000. You can use a different port if you want.
Install the dependencies and run the app:
# Install dependencies
npm install
# Run the app server
npm start
You may need to use ngrok
to expose your app to the Internet (see this article from information).
Note: the binary files for OB, EB, and S2R in /github-app/models
can be executed on Linux(64-bit). If you want to
run it on other systems, you can download the binaries from here, and then train your own OB, EB, and S2R models.
BEE is an open-source GitHub app that analyzes the issues that users submit to GitHub repositories.
Copyright (C) 2020 Yang Song, Oscar Chaparro
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program. If not, see <http://www.gnu.org/licenses/>.
- Ticket Tagger by Rafael Kallis
BEE is developed by Yang Song and Oscar Chaparro from the College of William & Mary.