Journal article
Expanding the human proteome with microproteins and peptideins
- Abstract:
- A major scientific drive is to characterize the protein-coding genome, which is a primary basis for studying human health. But the fundamental question remains of what has been missed in previous analyses. Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states1, 2–3, with major implications for biomedical science. However, a key gap in knowledge has been which ncORFs produce small microproteins or alternative protein molecules that contribute to the human proteome. Here we report the collaborative efforts of the TransCODE Consortium4 to produce a consensus landscape of protein-level evidence for ncORFs. We show that about 25% of a set of 7,264 ncORFs gives rise to detectable peptides in a large-scale analysis of 95,520 proteomics experiments. We develop an annotation framework for ncORF-encoded microproteins as human proteins and codify the new conceptual model of ‘peptideins’ as microproteins that have indeterminate potential as functional proteins. To probe the biological implications of peptideins, we create an evolutionary analysis approach, termed ORF relative branch length (ORBL), and determine that evolutionary constraint is common and associates with observation of ncORF-derived peptides. We then characterize a pan-essential cellular phenotype for one peptidein from the OLMALINC long non-coding RNA. Overall, we generate public research tools supported by GENCODE and PeptideAtlas and advance biomedical discovery for understudied components of the human proteome.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 19.4MB, Terms of use)
-
(Supplementary materials, Terms of use)
-
- Publisher copy:
- 10.1038/s41586-026-10459-x
Authors
- Publisher:
- Nature Research
- Journal:
- Nature More from this journal
- Volume:
- 654
- Issue:
- 8119
- Pages:
- 813-825
- Publication date:
- 2026-05-06
- Acceptance date:
- 2026-03-27
- DOI:
- EISSN:
-
1476-4687
- ISSN:
-
0028-0836
- Language:
-
English
- Keywords:
- Source identifiers:
-
4240874
- Deposit date:
-
2026-06-17
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2026
If you are the owner of this record, you can report an update to it here: Report update to this record