-
Notifications
You must be signed in to change notification settings - Fork 12
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
dbc3b75
commit 72b8c34
Showing
3 changed files
with
30 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,14 +1,21 @@ | ||
# Guidelines for Git and GitHub Workflows | ||
In this series of documents, we present what we consider best practices for executing data science projects. It’s important to note that these practices are tailored specifically to the work of the <span style="color:#3EACAD">Data Lab</span>. While they may not be universally applicable to all data science projects, we believe they remain highly valuable. | ||
This section provides essential guidelines for using Git and GitHub effectively, ensuring a structured and collaborative workflow for all team members in a project. By following these practices—such as consistently ignoring the "data" folder to protect sensitive information, avoiding direct pushes to the main branch, creating descriptive branch names, and submitting pull requests once work on a branch is complete—we can maintain a clean, organized codebase and promote efficient collaboration. These guidelines help uphold version control best practices, streamline teamwork, and reduce the potential for errors in project repositories. | ||
|
||
These documents will cover the following topics: | ||
|
||
## Branch Names and Other General Practices | ||
- **Branch names**. After joining the project and cloning the repository, create a concise, descriptive branch name for your work and ensure you switch to that branch before beginning any work on your machine. | ||
- **Update branches**. Avoid creating new update branches; instead, push your changes and resolve any conflicts directly. For instance, if bots in the repository modify your code (e.g., adjusting indentations), simply pull these changes before pushing your own updates. | ||
- **Pull requests (PR)**. When you believe your changes are final, create a pull request and assign the project lead as the reviewer. | ||
|
||
## Folders and Files to Ignore | ||
As all data science repos in the Data Lab use this template, the project repo will come with ```.ignore``` file prepopulated with most files and folders which need to be ignored. However, once you join the project and create your own branch. You will have to make sure that the following folders are being ignored. | ||
- Data folder | ||
- Virtual environments (```.venv```) | ||
- Environment (```.env```) | ||
Feel free to add any other files (e.g., system files specific to your OS) to the ```.gitignore``` | ||
|
||
|
||
1.**Folder Structure and Naming Conventions for Project Setup** | ||
|
||
2. **Git and GitHub Workflow Standards and Guidelines** | ||
|
||
3.**Standards for Documenting and Styling Analytical Notebooks** | ||
|
||
4.**Guidelines for Communicating and Presenting Data Outputs.** | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
# Guidelines for Documenting and Styling Analytical Notebooks | ||
This section provides best practices for structuring analytical notebooks to enhance readability. The guidelines include recommendations for hiding code cells to maintain a clean appearance in Jupyter Book, incorporating references where relevant, and organizing content logically to ensure clarity for readers. | ||
|
||
- **Structure**. In all the Data Lab projects, please follow [this analytics structure](https://github.com/worldbank/sudan-poverty-monitoring/blob/main/docs/2-analytics.md). | ||
- **Editing _toc.yml** | ||
- **Removing/hiding cell blocks** All notebooks will be rendered in Jupyter Book. To enhance readability, ensure code cells are hidden or removed using cell tags. In some cases, you may use the hide-input cell tag. | ||
|
||
|
||
|
||
|