Get started with Bioinfo Tools in 5 minutes!
# Clone the repository
git clone https://github.com/Mxrcon/Bioinfo-python-scripts.git
cd Bioinfo-python-scripts
# Install in development mode
pip install -e .
# Verify installation
bioinfo-tools --version
# Or: python -m bioinfo_tools --version# Install pixi (if not already installed)
curl -fsSL https://pixi.sh/install.sh | bash
# Clone and use
git clone https://github.com/Mxrcon/Bioinfo-python-scripts.git
cd Bioinfo-python-scripts
pixi install
pixi shellLet's extract some genes from GenBank files!
Create a gene list file genes.txt:
dnaA
rpoB
recA
bioinfo-tools extract-genes -i path/to/genbank_files/ \
-g genes.txt \
-o output_genes/That's it! Your extracted sequences are now in output_genes/.
bioinfo-tools extract-proteins -i genbank_files/ -g genes.txt -o proteins/Output structure:
proteins/
├── dnaA/
│ └── genome1.fasta
├── rpoB/
│ └── genome1.fasta
└── recA/
└── genome2.fasta
bioinfo-tools extract-cds -i genbank_files/ -g genes.txt -o filtered_gbk/bioinfo-tools blast -q query_sequences/ \
-d database_sequences/ \
-t nucl \
-b blastn \
-e 1e-5# General help
bioinfo-tools --help
# Help for a specific command
bioinfo-tools extract-genes --helpVerify everything works:
# Using the test script
python tests/test_scripts.py
# Or with Python module syntax
python -m unittest tests.test_scripts- Read the README.md for detailed information
- Check COMMAND_REFERENCE.md for all available options
- See MIGRATION.md if you're upgrading from old scripts
- Read CONTRIBUTING.md if you want to add features
Install the package:
pip install -e .Or use Python module syntax:
python -m bioinfo_tools extract-genes --helpMake sure your GenBank files have .gbk, .gb, or .genbank extensions.
- Open an issue: https://github.com/Mxrcon/Bioinfo-python-scripts/issues
- Email: davijosuemarcon@gmail.com
Complete example from GenBank to BLAST results:
# 1. Filter GenBank files to keep only genes of interest
bioinfo-tools extract-cds -i raw_genbank/ -g important_genes.txt -o filtered_gbk/
# 2. Extract protein sequences
bioinfo-tools extract-proteins -i filtered_gbk/ -g important_genes.txt -o proteins/
# 3. Run BLAST against a reference database
bioinfo-tools blast -q proteins/dnaA/ \
-d reference_db/ \
-t prot \
-b blastp \
-e 1e-10 \
-o blast_results/That's all you need to get started! 🚀