Top-Down Characterization of Nucleic Acids Modified by Structural Probes Using High-Resolution Tandem Mass Spectrometry and Automated Data Interpretation

Abstract
A top-down approach based on sustained off-resonance irradiation collision-induced dissociation (SORI-CID) has been implemented on an electrospray ionization (ESI) Fourier transform mass spectrometer (FTMS) to characterize nucleic acid substrates modified by structural probes. Solvent accessibility reagents, such as dimethyl sulfate (DMS), 1-cyclohexyl-3-(2-morpholinoethyl)carbodiimide metho-p-toluenesulfonate (CMCT), and β-ethoxy-α-ketobutyraldehyde (kethoxal, KT) are widely employed to reveal the position of single- vs double-stranded regions and obtain the footprint of bound proteins onto nucleic acids structures. Established methods require end-labeling of the nucleic acid constructs, probe-specific chemistry to produce strand cleavage at the modified nucleotides, and analysis by polyacrylamide gel electrophoresis to determine the position of the susceptible sites. However, these labor-intensive procedures can be avoided when mass spectrometry is used to identify the probe-induced modifications from their characteristic mass signatures. In particular, ESI-FTMS can be directly employed to monitor the conditions of probe application to avoid excessive alkylation, which could induce unwanted distortion or defolding of the substrate of interest. The sequence position of the covalent modifications can be subsequently obtained from classic tandem techniques, which allow for the analysis of individual target adducts present in complex reaction mixtures with no need for separation techniques. Selection and activation by SORI-CID has been employed to reveal the position of adducts in nucleic acid substrates in excess of 6 kDa. The stability of the different covalent modifications under SORI-CID conditions was investigated. Multiple stages of isolation and activation were employed in MSn experiments to obtain the desired sequence information whenever the adduct stability was not particularly favorable, and SORI-CID induced the facile loss of the modified base. A new program called MS2Links was developed for the automated reduction and interpretation of fragmentation data obtained from modified nucleic acids. Based on an algorithm that searches for plausible isotopic patterns, the data reduction module is capable of discriminating legitimate signals from noise spikes of comparable intensity. The fragment identification module calculates the monoisotopic mass of ion products expected from a certain sequence and user-defined covalent modifications, which are finally matched with the signals selected by the data reduction program. Considering that MS2Links can generate similar fragment libraries for peptides and their covalent conjugates with other peptides or nucleic acids, this program provides an integrated platform for the structural investigation of protein−nucleic acid complexes based on cross-linking strategies and top-down ESI-FTMS.

This publication has 32 references indexed in Scilit: