Skip NavigationSkip to Content

dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data

  1. Author:
    Ou, Jianhong
    Liu, Haibo
    Nirala, Niraj K.
    Stukalov, Alexey
    Acharya,Usha
    Green, Michael R.
    Zhu, Lihua Julie
  2. Author Address

    Univ Massachusetts, Sch Med, Dept Mol Cell & Canc Biol, Worcester, MA 02721 USA.Duke Univ, Sch Med, Regenerat NEXT, Durham, NC USA.Univ Massachusetts, Sch Med, Program Mol Med, Worcester, MA 02721 USA.Tech Univ Munich, Inst Virol, Munich, Germany.Univ Massachusetts, Sch Med, Program Bioinformat & Integrat Biol, Worcester, MA 02721 USA.NCI, Canc & Dev Biol Lab, Frederick, MD 21701 USA.
    1. Year: 2020
    2. Date: NOV 6
  1. Journal: PLOS ONE
  2. PUBLIC LIBRARY SCIENCE,
    1. 15
    2. 11
  3. Type of Article: Article
  4. Article Number: e0242030
  5. ISSN: 1932-6203
  1. Abstract:

    Sequence logos have been widely used as graphical representations of conserved nucleic acid and protein motifs. Due to the complexity of the amino acid (AA) alphabet, rich post-translational modification, and diverse subcellular localization of proteins, few versatile tools are available for effective identification and visualization of protein motifs. In addition, various reduced AA alphabets based on physicochemical, structural, or functional properties have been valuable in the study of protein alignment, folding, structure prediction, and evolution. However, there is lack of tools for applying reduced AA alphabets to the identification and visualization of statistically significant motifs. To fill this gap, we developed an R/Bioconductor package dagLogo, which has several advantages over existing tools. First, dagLogo allows various formats for input sets and provides comprehensive options to build optimal background models. It implements different reduced AA alphabets to group AAs of similar properties. Furthermore, dagLogo provides statistical and visual solutions for differential AA (or AA group) usage analysis of both large and small data sets. Case studies showed that dagLogo can better identify and visualize conserved protein sequence patterns from different types of inputs and can potentially reveal the biological patterns that could be missed by other logo generators.

    See More

External Sources

  1. DOI: 10.1371/journal.pone.0242030
  2. WOS: 000591376400029

Library Notes

  1. Fiscal Year: FY2020-2021
NCI at Frederick

You are leaving a government website.

This external link provides additional information that is consistent with the intended purpose of this site. The government cannot attest to the accuracy of a non-federal site.

Linking to a non-federal site does not constitute an endorsement by this institution or any of its employees of the sponsors or the information and products presented on the site. You will be subject to the destination site's privacy policy when you follow the link.

ContinueCancel