Inferring a phylogeny
There are multiple way to infer a phylogeny. Here, we will follow a maximum likelihood approach, and use the software raxml-ng.
We will use the following command to infer a quick tree with 100 bootstrap replicates:
raxml-ng --all --msa all_loci.fasta --model GTR+G --prefix tree --threads 2 --seed 2 --bs-metric fbp
We can check whether the topplogies converge with:
raxml-ng --rfdist --tree tree.raxml.mlTrees --prefix RF
And we can check the convergence of the bootstrap with:
raxml-ng --bsconverge --bs-trees tree.raxml.bootstraps --prefix t2 --seed 2 --threads 2 --bs-cutoff 0.01
Don’t forget to check raxml-ng tutorial for more options: https://github.com/amkozlov/raxml-ng/wiki/Tutorial
The file ending in .support
shows our phylogeny with bootstrap support. We can open it in FigTree.
You might want to change the label names to species name + accession, instead of only accession. We can do that with phyx, more specifically with pxrlt
(taxon relabelling for trees). We will need list of current and new names.
For the current names, we can simply get from out acc
file, and the new names we can get from the Nicholls et al. 2015 paper:
I_auristellae_FG113
I_cylindrica_FG35
I_nouragensis_FgIntype
I_cinnamomea_KGD465
Zygia_mediana_Zygia917
pxrlt -t tree.raxml.support -c current -n new > tree.raxml.support.names
Now we have a phylogeny with branch support values and proper label names. Enjoy!