Phylogenetic_Relationships

This project performs an automated phylogenetic analysis of nucleotide sequences retrieved from NCBI. The pipeline includes sequence alignment, concatenation, phylogenetic inference (maximum likelihood, maximum parsimony, and Bayesian), and formatting for visualization.

🔧 Requirements

Software and Tools

MAFFT v7.505 – Multiple sequence alignment.
ModelTest-NG v0.1.7 – Substitution model selection.
RAxML-NG v1.2.2 – Maximum likelihood phylogenetic inference.
MPBoot – Maximum parsimony inference.
MrBayes – Bayesian phylogenetic inference.
FigTree v1.4.4 – Phylogenetic tree visualization.
FASTA2NEX – FASTA to NEXUS converter.

Note: The original code was modified to better fit the workflow of this pipeline. The adapted version is available in this repository.
Concatenate Fasta Tool – Concatenate multiple FASTA files.

Python Libraries

Biopython
pandas

Install with:

pip install biopython pandas

or

micromamba install -n myenv biopython pandas -c conda-forge

📂 Pipeline Structure

1. Download and Align Sequences

python3 Fasta_Entrez.py 'nucleotide' 'ACC_RANGE[accn]' 1000
mv algn_*.fasta algn_NAME.fasta

2. Convert Sequence IDs to Names

python3 names_converter.py algn_NAME.fasta ids_names.csv
mv algn_NAME_mod.fasta algn_NAME.fasta

3. Concatenate FASTA Files

mkdir concat_seq
cp algn_*.fasta ./concat_seq/
python3 Concatenate.py concat_seq/ concat_os.fasta
rm -rf concat_seq/

4. Maximum Parsimony Inference (MPBoot)

mpboot -s algn_NAME.fasta -pre max_parsimony_NAME -bb 4000

5. Evolutionary Model Selection (ModelTest-NG)

modeltest-ng -i algn_NAME.fasta

6. Maximum Likelihood Inference (RAxML-NG)

raxml-ng --msa algn_NAME.fasta --model MODEL --prefix NAME --threads 5 --seed 2
raxml-ng --bootstrap --msa algn_NAME.fasta --model MODEL --prefix NAME --seed 2 --threads 5
raxml-ng --support --tree NAME.raxml.bestTree --bs-trees NAME.raxml.bootstraps --prefix NAME --threads 5

7. Convert to NEXUS Format

python3 fasta2nex.py algn_NAME.fasta > algn_NAME.nexus

8. Bayesian Inference (MrBayes)

mb -i algn_NAME.nexus

Notes

Replace NAME with actual dataset names such as adh1, os1283, os9971, os17357, etc.
The evolutionary models used in RAxML-NG must be chosen based on the output from modeltest-ng.
It is recommended to execute each block using shell scripts or within a controlled environment (e.g., Micromamba).

Collaborators

Aleff Cavalcante
Alexandre Soares
Ana Fernando
Ravi Silva

Credits

Rendrick Carreira for providing the code to transform FASTA to NEXUS

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Concatenate.py		Concatenate.py
Fasta_Entrez.py		Fasta_Entrez.py
LICENSE		LICENSE
Max_likelihood_trees.zip		Max_likelihood_trees.zip
Max_parsimony_trees.zip		Max_parsimony_trees.zip
MrBayes_trees.zip		MrBayes_trees.zip
README.md		README.md
fasta2nex.py		fasta2nex.py
ids_names.csv		ids_names.csv
names_converter.py		names_converter.py
shell_commands.txt		shell_commands.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Phylogenetic_Relationships

🔧 Requirements

Software and Tools

Python Libraries

📂 Pipeline Structure

1. Download and Align Sequences

2. Convert Sequence IDs to Names

3. Concatenate FASTA Files

4. Maximum Parsimony Inference (MPBoot)

5. Evolutionary Model Selection (ModelTest-NG)

6. Maximum Likelihood Inference (RAxML-NG)

7. Convert to NEXUS Format

8. Bayesian Inference (MrBayes)

Notes

Collaborators

Credits

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

AceSCav/Phylogenetic_Relationships

Folders and files

Latest commit

History

Repository files navigation

Phylogenetic_Relationships

🔧 Requirements

Software and Tools

Python Libraries

📂 Pipeline Structure

1. Download and Align Sequences

2. Convert Sequence IDs to Names

3. Concatenate FASTA Files

4. Maximum Parsimony Inference (MPBoot)

5. Evolutionary Model Selection (ModelTest-NG)

6. Maximum Likelihood Inference (RAxML-NG)

7. Convert to NEXUS Format

8. Bayesian Inference (MrBayes)

Notes

Collaborators

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages