Aligning Protein-Coding Nucleotide Sequences with MACSE - Agropolis Accéder directement au contenu
Chapitre D'ouvrage Année : 2021

Aligning Protein-Coding Nucleotide Sequences with MACSE

Résumé

Most genomic and evolutionary comparative analyses rely on accurate multiple sequence alignments. With their underlying codon structure, protein-coding nucleotide sequences pose a specific challenge for multiple sequence alignment. Multiple Alignment of Coding Sequences (MACSE) is a multiple sequence alignment program that provided the first automatic solution for aligning protein-coding gene datasets containing both functional and nonfunctional sequences (pseudogenes). Through its unique features, reliable codon alignments can be built in the presence of frameshifts and stop codons suitable for subsequent analysis of selection based on the ratio of nonsynonymous to synonymous substitutions. Here we offer a practical overview and guidelines on the use of MACSE v2. This major update of the initial algorithm now comes with a graphical interface providing user-friendly access to different subprograms to handle multiple alignments of protein-coding sequences. We also present new pipelines based on MACSE v2 subprograms to handle large datasets and distributed as Singularity containers. MACSE and associated pipelines are available at: https://bioweb.supagro.inra.fr/macse/.
Fichier principal
Vignette du fichier
Ranwez-MethMolBiol-HAL.pdf (8.67 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03099847 , version 1 (06-01-2021)

Identifiants

Citer

Vincent Ranwez, Nathalie Chantret, Frédéric Delsuc. Aligning Protein-Coding Nucleotide Sequences with MACSE. Methods in Molecular Biology vol. 2231: Multiple Sequence Alignment., 2231, pp.51-70, 2021, ⟨10.1007/978-1-0716-1036-7_4⟩. ⟨hal-03099847⟩
124 Consultations
355 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More