An open-source software suite for processing glycomics and glycoproteomics mass spectral data
Project Number5U01CA221234-03
Former Number1U01OD024841-01
Contact PI/Project LeaderZAIA, JOSEPH
Awardee OrganizationBOSTON UNIVERSITY MEDICAL CAMPUS
Description
Abstract Text
Analytical methods for liquid chromatography-mass spectrometry (LC-MS) of glycans and glycopepeptides are
mature and available to biomedical scientists with access to mass spectrometry instruments. The primary
barrier to the adoption of such methods by non-glycoscientists is the lack of tools for processing the raw LC-
MS data, evaluating the results, and results visualizations that present those findings in a systematic fashion.
Academic research groups have defined algorithmic approaches for interpretation mass spectra in the
glycoscience domain. Despite this effort, there remains a paucity of practical, user accessible tools for
processing glycan and glycopeptide mass spectral data. The first problem is that public domain and
commercial proteomics workflows make peptide-specific assumptions that exclude released glycans and
glycosylated peptides. The second is that available glycoscience mass spectral analysis tools, do not address
the full scope of data processing needed for high throughput LC-MS data sets. Thus, users with access to
mass spectrometry through local proteomics core facilities do not have access to adequate software tools for
processing the liquid-chromatography-mass spectrometry (LC-MS) datasets. The result is a serious bottleneck
for dissemination and use of glycoscience LC-MS methods in biomedicine.
We have developed three software programs for glycoscience mass spectrometry that we now propose to
engineer and disseminate as accessible solutions for biomedical scientists. GlycReSoft processes glycan
profiling and integrated omics of glycoproteins. GAGfinder is a second generation algorithm for sequencing
glycosaminoglycans from electron activated dissociation (ExD) tandem mass spectra. GlycoDeNovo
determines branched glycan topologies and linkage configurations from ExD tandem mass spectra. We
propose to engineer and disseminate open source pipelines for processing glycan and glycopeptide LC-MS
data. This will include all modules necessary to process vendor-independent mass spectrometry data, perform
LC-MS profiling, tandem mass spectrometry assignments and generate publication quality graphical outputs.
We will deliver software that builds on available data standards and MS processing libraries. The software will
use modern glycan representation standards and communicate with the international glycan structure
repository and namespace (GlyTouCan). Users will have the option of running the software as a desktop
application, through a server, or as a GRITS Toolbox plugin. The software tools will be available open source
through a Github repository. In Aim 1, we will develop our GlycReSoft program and related tools into a
complete solution for glycomics and glycoproteomics LC-MS data analysis. In Aim 2, we will develop
GAGfinder software into a complete solution for sequencing glycosaminoglycans from ExD tandem mass
spectra. In Aim 3, we will develop a pipeline based on our GlycoDeNovo software for sequencing N- and O-
glycans.
Public Health Relevance Statement
Project narrative
The present lack of available tools for processing glycoscience mass spectral data sets into biomedical
inference is a serious bottleneck to the democratization of glycoscience technologies. We propose solve this
problem by engineering user-accessible open source pipelines for N- and O-glycans, glycosaminoglycans, and
glycopeptides, respectively.
NIH Spending Category
BioengineeringNetworking and Information Technology R&D (NITRD)
Project Terms
AdoptionAlgorithmsChondroitin SulfatesClientCommunitiesComplexComputer softwareCore FacilityDataData AnalysesData SetDevelopmentDissociationElectron TransportElectronsEngineeringEnvironmentFourier transform ion cyclotron resonanceGenerationsGenesGlycopeptidesGlycoproteinsGlycosaminoglycansHeparitin SulfateImageryIndividualInternationalIsomerismKeratan SulfateLibrariesMass Spectrum AnalysisMethodsModernizationOutputPeptidesPolysaccharidesProblem SolvingProcessProteomicsPublic DomainsPublicationsResearchResourcesRunningSamplingSoftware ToolsStructureTechnologyTranslatingVendorVisualanalytical methodbasebiomedical scientistcohortcommercializationcomputerized data processingdata acquisitionexperimental studyglycoproteomicsgraphical user interfaceinstrumentinterestliquid chromatography mass spectrometryopen sourceoperationpolymerizationprogramsrepositorysoftware developmenttandem mass spectrometrytoolweb server
Bioengineering; Networking and Information Technology R&D (NITRD)
Sub Projects
No Sub Projects information available for 5U01CA221234-03
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5U01CA221234-03
Patents
No Patents information available for 5U01CA221234-03
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5U01CA221234-03
Clinical Studies
No Clinical Studies information available for 5U01CA221234-03
News and More
Related News Releases
No news release information available for 5U01CA221234-03
History
No Historical information available for 5U01CA221234-03
Similar Projects
No Similar Projects information available for 5U01CA221234-03