ProForma: a standardized notation for writing proteoform sequences

While the notation for writing protein sequences is defined by IUPAC, there is no such definition for proteoform sequences composed of both amino acids and PTMs at specific positions. These notations are important for communicating the results of protein and proteoform analyses, and so I worked with a subcommittee of the Consortium for Top-Down Proteomics (CTDP) to develop and define such a standard notation for writing proteoform sequences. I also contribute to a standard development kit in the C# computing language that implements this notation, including recent extensions for ambiguity in modification localizations.

ProForma

Research Papers

(1) LeDuc, R. D.; Deutsch, E. W.; Binz, P. A.; Fellers, R. T.; Cesnik, A. J.; Klein, J. A.; Van Den Bossche, T.; Gabriels, R.; Yalavarthi, A.; Perez–Riverol, Y.; Carver, J.; Bittremieux, W.; Kawano, S.; Pullman, B.; Bandeira, N.; Kelleher, N. L.; Thomas, P. M.; Vizcaíno, J. A. “Proteomics Standards Initiative’s ProForma 2.0: Unifying the encoding of Proteoforms and Peptidoforms.” Journal of Proteome Research 2022, 21, 1189- 1195. PMC7612572.

(2) LeDuc, R.*; Schwämmle, V.*; Shortreed, M. R.*; Cesnik, A. J.*; Solntsev, S. K.*; Shaw, J.*; Martin, M. J.; Vizcaíno, J. A.; Alpi, E.; Danis, P.; Kelleher, N. L.; Smith, L. M.; Ge, Y.; Agar, J. N.; Chamot-Rooke, J.; Loo, J.; Paša-Tolić, L.; Tsybin, Y. O. “ProForma: a Standard Proteoform Notation.” Journal of Proteome Research 2018, 17, 1321–1325. PMC5837035. *Contributed equally

Updated: