The MSP algorithms has been intensively evaluated on several real world DNA sequence datasets. Here we provide the links to the datasets used in our paper.

Budgerigar (bird)

This dataset is the sequence data of Budgerigar (bird) from the Assemblathon website (http://assemblathon.org). You can click here to browse the dataset. Thanks BGI for providing this dataset.

Lake Malawi cichlid (fish)

This dataset is the sequence data of Lake Malawi cichlid (fish), which is also from the Assemblathon website (http://assemblathon.org). You can click here to browse the dataset. Thanks Broad Insitute for providing this dataset.

Bombus impatiens (bee)

This dataset is the sequence data of Bombus impatiens (bee) from the GAGE website (http://gage.cbcb.umd.edu/). You can click here to browse the dataset. Thanks UMD for providing this dataset.