What is the functional evidence for peptideins (microproteins from non-coding genomic regions) as essential regulators of cancer cell viability, and which candidates show druggability?

What is the functional evidence for peptideins (microproteins from non-coding genomic regions) as essential regulators of cancer cell viability, and which candidates show druggability?
What is the functional evidence for peptideins (microproteins from non-coding genomic regions) as essential regulators of cancer cell viability, and which candidates show druggability?
BioSkepsis

Summary
Systematic CRISPR/Cas9 loss-of-function screens and multi-omic integrative analyses have identified hundreds of non-canonical microproteins (peptideins) that are essential for cancer cell survival and proliferation. These regulators function through diverse mechanisms, including protein-protein interaction (PPI) scaffolding, metabolic reprogramming, and RNA splicing modulation, with several candidates showing promise for targeted therapies using small molecules, inhibitory peptides, or immunotherapy.

Large-Scale Functional Screening for Essentiality

Comprehensive functional genomics has revealed that approximately 10% to 17% of non-canonical open representation frames (ncORFs) are required for cancer cell fitness (Direct, High; PMID: 33510483, PMID: 39379206).
* Essentiality Rates: In a library of 553 sORFs, 57 were identified as essential for cancer cell survival, a rate comparable to canonical protein-coding genes (Direct, High; PMID: 33510483).
* Subtype Specificity: Screens in medulloblastoma identified over 6,700 non-canonical ORFs, with a subset of upstream ORFs (uORFs) being uniquely essential in MYC-driven tumors (Direct, High; PMID: 38176414).
* Validation of Translation: While thousands of sORFs are identified via Ribo-seq, only a fraction show stable protein-level expression. For instance, in a curated set of 7,264 ncORFs, approximately 25% were supported by high-quality mass spectrometry (MS) evidence (Direct, High; DOI: 10.20935/acadmolbiogen7798).

Essential Oncogenic Regulators (Pro-Viability)

Multiple microproteins have been characterized as drivers of cancer progression through specific molecular axes.
* GREP1 (G029442): Identified as a major breast cancer vulnerability, this glycine-rich secreted microprotein regulates the oncogenic cytokine GDF15 to maintain cell viability (Direct, High; PMID: 33510483).
* APPLE (ASH1L-AS1): Expressed from an antisense lncRNA, APPLE promotes MAPK signaling in hepatocellular carcinoma (HCC) by binding to ERK1/2 and preventing its dephosphorylation by PP1/PP2A (Direct, High; PMID: 40646641). It also promotes translation initiation of oncoproteins in acute myeloid leukemia (AML) (Direct, High; PMID: 39135098).
* ASNSD1-uORF (ASDURF): This uORF-encoded microprotein is essential for the survival of MYC-driven medulloblastoma by acting as a subunit of the prefoldin-like (PFDL) chaperone complex, coordinating post-transcriptional proteome regulation (Direct, High; PMID: 38176414).
* SMIMP (ELFN1-AS1): A primate-specific lncRNA-encoded microprotein that promotes colorectal cancer (CRC) by binding to the cohesin subunit SMC1A, leading to the epigenetic repression of tumor suppressors CDKN1A and CDKN2B (Direct, High; DOI: 10.20935/acadmolbiogen7798).
* TPM3P9: This microprotein is upregulated across various cancers and drives ccRCC growth by binding to the splicing factor RBM4, promoting the oncogenic splicing variant of TCF7L2 (Direct, High; PMID: 39865075).

Essential Tumor Suppressors (Inhibitory Regulators)

Dysregulation or loss of certain microproteins facilitates cancer progression.
* pTINCR: An ubiquitin-like microprotein that suppresses tumor growth in squamous cell carcinoma by enhancing the SUMOylation and activation of CDC42, thereby promoting epithelial differentiation (Direct, High; PMID: 36369429).
* HCP5-132aa: Although it can act as an oncoprotein in some contexts, it regulates ferroptosis in gastric cancer by stabilizing SLC7A11 and G6PD mRNAs through the YBX1/ELAVL1 complex (Direct, High; PMID: 39447131).
* ASRPS: Encoded by LINC00908, this peptide inhibits STAT3 phosphorylation, thereby reducing VEGF production and tumor angiogenesis in triple-negative breast cancer (TNBC) (Direct, High; PMID: 40279117).

Druggability and Therapeutic Strategies

Research candidates are increasingly evaluated for clinical translation via several modalities.
* PPI Disruption: Small molecules that disrupt microprotein-protein interactions, such as those involving APPLE-ERK-PP1/PP2A, are identified as potential precision therapy strategies (Direct, High; PMID: 40646641).
* Peptide Mimetic/Inhibitors: A peptide inhibitor targeting the MAVI1-MAVS interaction (PiMAVI1) has been shown to reactivate type I interferon signaling for antiviral and potentially anti-tumor effects (Direct, High; PMID: 40245464).
* Secreted Targets: Microproteins like GREP1 (breast cancer) and MPEP (glioblastoma) are secreted, making them accessible to neutralizing antibodies (Derived, Medium; PMID: 33510483, PMID: 40170020).
* Immunotherapy: ncORFs are a major source of tumor-specific antigens (TSAs) in tumors with low mutational burden, such as HCC. Peptides derived from AC079466.1 and AC098820.3 show strong immunogenicity in HLA-transgenic models (Direct, High; PMID: 38985879).
* Gene Delivery: AAV-mediated delivery of CRISPR components to knock out oncogenic microproteins like HCP5-132aa significantly reduced tumor volume in patient-derived xenograft (PDX) models (Direct, High; PMID: 39447131).

How do the physical interactions of SMIMP with the cohesin complex impact the stability of the 3D chromatin architecture in colorectal cancer?

What are the structural requirements for the APPLE microprotein to selectively inhibit PP1/PP2A phosphatases without affecting other cellular dephosphorylation events?

Which experimental methodologies are currently most effective for identifying cryptic antigens from non-canonical ORFs that can successfully evade immune self-tolerance?


Unverified Citations

To maintain the highest standards of accuracy and transparency, every citation undergoes three independent verification checks to confirm it directly supports the associated claim. The references below did not satisfy all verification stages. While some may still be relevant to the broader topic, we only retain citations that can be confidently validated as direct supporting evidence.

  • PMID:41033461For instance, in a curated set of 7,264 ncORFs, approximately 25% were supported by high-quality mass spectrometry (MS) ...
    Failed: conclusion — The paper mentions the GENCODE initiative but does not report the quantitative finding that 25% of 7,264 ncORFs were supported by MS evidence; that specific finding is in paper index 1.
Generate an empirically testable hypothesis
BioSkepsis

Hypothesis 1

The SF3B1-SRSF2 splicing axis functions as a universal stoichiometric regulator of cancer cell viability by facilitating a transcriptomic switch from canonical tumor-suppressive proteins to pro-survival scaffolding microproteins (peptideins), where mutation or reduced binding of these splicing factors enables the translation of essential regulators like DEDD2-SEP and APPLE that would otherwise be repressed by canonical start codons.

Mechanistic rationale

  • Ribosome profiling and mass spectrometry have identified thousands of novel open reading frames (ORFs) within regions previously annotated as non-coding, including long non-coding RNAs (lncRNAs) and 5' untranslated regions (UTRs). (Derived, Low; PMID: 19213877, PMID: 23160002, PMID: 24705786, PMID: 26687005, PMID: 32139545)
  • Large-scale CRISPR loss-of-function screens have demonstrated that approximately 10% to 30% of these non-canonical microproteins are essential for cancer cell survival and proliferation across diverse lineages. (Derived, Low; PMID: 33510483, PMID: 38176414, PMID: 40124493, PMID: 39379206)
  • The translation of many essential microproteins, such as the anti-apoptotic regulator DEDD2-SEP, is dependent on the production of alternative transcript isoforms that exclude canonical upstream start codons to bypass translational repression. (Derived, Low; PMID: 39484585, PMID: 39753408)
  • The SF3B1-SRSF2 splicing axis specifically controls the production of these alternative isoforms; for example, SRSF2 mutations or reduced binding favor the skipping of canonical exons to produce microprotein-encoding transcripts. (Derived, Medium; PMID: 39484585, PMID: 10.1182/blood-2024-208361, PMID: 37725512)
  • Once translated, these microproteins function as 'stoichiometric adaptors' or scaffolding molecules for critical oncogenic complexes, such as the MAPK axis (APPLE), cohesin (SMIMP), and the PAQosome (ASDURF), to maintain cell viability. (Direct, High; PMID: 40646641, PMID: 37932451, PMID: 38176414)

Predictions

  • Mutations in SF3B1 or knockdown of SRSF2 will lead to a significant increase in the ratio of microprotein-encoding alternative transcripts (e.g., DEDD2-SEP) relative to their canonical counterparts across multiple solid tumor cell lines. (Derived, Low; PMID: 39484585, PMID: 10.1182/blood-2024-208361)
  • The physical interaction between BAG6 and the C-terminal hydrophobic domains of these microproteins will be required for their correct scaffolding at signaling hubs like the ER or nucleus. (Indirect, Low; PMID: 39753408, PMID: 40245354)
  • Pharmacological restoration of canonical splicing will result in a decrease in microprotein-scaffolded complexes and a subsequent loss of viability in tumors overexpressing peptideins like APPLE or SMIMP. (Derived, Medium; PMID: 40646641, PMID: 37932451, PMID: 39379206)

Study design

Using isogenic cancer cell models (e.g., Nalm-6 SF3B1-WT vs. K700E), the investigator will perform long-read ISO-seq to quantify transcript ratios, followed by TMT-based quantitative proteomics and Ribo-seq to confirm microprotein levels. Perturbations will include siRNA against SRSF2 and overexpression of start-codon mutant cDNAs to assess the necessity of the microprotein vs. the transcript, with readouts including complex assembly (co-IP) and cell fitness (CRISPR-dropout scores). (Derived, Low; PMID: 10.1182/blood-2024-208361, PMID: 39484585, PMID: 37371559)

Confounders & controls

  • Off-target effects of CRISPR gRNAs at gene promoters will be controlled by targeting non-coding regions and using appropriate controls. (Direct, High; PMID: 38176414)
  • Phenotypic rescue experiments will utilize start-codon mutants (ATG to ATT/AGG) to confirm that effects are mediated by the protein product rather than the RNA molecule. (Derived, Low; PMID: 32139545, PMID: 39447131, PMID: 39865075)

Risks/limitations

  • Current mass spectrometry technologies may under-detect low-abundance microproteins or miss protective isoforms that are rapidly processed into HLA antigens. (Derived, Low; PMID: 40245354, PMID: 38985879, PMID: 37275273)
  • Alternative transcript isoforms are inconsistently annotated across genomic resources (NCBI vs. GENCODE), potentially leading to mis-mapping of functional sORFs. (Derived, Low; PMID: 39484585, PMID: 38013207)

Falsification criteria

  • The hypothesis is falsified if SF3B1 mutations do not alter the ratio of microprotein-encoding transcripts, or if the essentiality phenotype persists upon expression of translation-deficient (start-codon mutant) alternative transcripts. (Derived, Medium; PMID: 39484585, PMID: 10.1182/blood-2024-208361)

Unverified Citations

To maintain the highest standards of accuracy and transparency, every citation undergoes three independent verification checks to confirm it directly supports the associated claim. The references below did not satisfy all verification stages. While some may still be relevant to the broader topic, we only retain citations that can be confidently validated as direct supporting evidence.

  • PMID: 39865075Once translated, these microproteins function as 'stoichiometric adaptors' or scaffolding molecules for critical oncogen...
    Failed: conclusion — The paper characterizes TPM3P9 and its role in RNA splicing rather than functioning as a stoichiometric adaptor for the MAPK, cohesin, or PAQosome complexes.
  • PMID: 33510483Off-target effects of CRISPR gRNAs at gene promoters will be controlled by targeting non-coding regions and using non-cu...
    Failed: entities — The paper does not mention 'LacZ' controls or specific strategies for controlling promoter effects via LacZ in the provided text.
Generate an empirically testable hypothesis
BioSkepsis

Hypothesis 2

Cancer-associated mutations in the SF3B1-SRSF2 splicing axis drive cell viability by establishing a stoichiometric 'microprotein-enforced' signaling hub, where the preferential production of alternative transcript isoforms enables the synthesis of essential microproteins like APPLE or DEDD2-SEP that competitively displace canonical regulatory subunits from protein complexes, thereby preventing signaling deactivation.

Mechanistic rationale

  • Alternative splicing and transcription start site usage are specifically required to exclude canonical upstream start codons that would otherwise repress the translation of internal, pro-survival microproteins. (Derived, Medium; PMID: 39484585)
  • Mutations in SF3B1 drive aberrant lncRNA biogenesis and the production of neopeptides that are essential for the survival of leukemia cells. (Direct, High; PMID: 10.1182/blood-2024-208361)
  • Once translated, these microproteins (such as APPLE or EMBOW) function by binding to critical interfaces on scaffolding proteins like WDR5 or ERK/PP2A, displacing canonical ligands to sustain oncogenic signaling. (Derived, Medium; PMID: 40646641, PMID: 37725512)

Predictions

  • Disruption of the alternative transcript isoform production (via restoration of canonical splicing) will lead to the rapid proteasomal degradation of the microprotein pool through the BAG6 mitigation pathway. (Indirect, Low; PMID: 39753408, PMID: 40245354)

Study design

Using isogenic Nalm-6 or HCC cell lines with SF3B1-K700E mutations, we will perform long-read ISO-seq to quantify microprotein-encoding transcripts, coupled with TMT-based quantitative proteomics to measure microprotein:target occupancy ratios. We will then perform co-immunoprecipitation and competition assays to determine if microprotein displacement by small-molecule WIN-site or MAPK inhibitors restores canonical dephosphorylation and apoptotic induction. Phenotypic dependency will be validated by knocking out the start codon of the alternative transcript while preserving the RNA structure. (Derived, Medium; PMID: 10.1182/blood-2024-208361, PMID: 40646641, PMID: 37725512)

Confounders & controls

  • Start-codon mutations (ATG to ATT/AGG) in alternative transcript expression vectors will serve as the primary control to ensure that pro-survival effects are protein-dependent and not mediated by lncRNA scaffolds. (Derived, Medium; PMID: 32139545, PMID: 39447131)

Falsification criteria

  • The hypothesis is falsified if microprotein-encoding isoforms are produced at equivalent levels in wild-type cells but are translationally silent, or if pharmacological activation of PP1/PP2A fails to induce cell death in tumors overexpressing APPLE. (Derived, Medium; PMID: 40646641)

Unverified Citations

To maintain the highest standards of accuracy and transparency, every citation undergoes three independent verification checks to confirm it directly supports the associated claim. The references below did not satisfy all verification stages. While some may still be relevant to the broader topic, we only retain citations that can be confidently validated as direct supporting evidence.

  • PMID: 38013207Alternative splicing and transcription start site usage are specifically required to exclude canonical upstream start co...
    Failed: conclusion — This paper is a review of ribosome profiling methodology and does not contain primary or derived evidence regarding the specific requirement of alternative splicing to bypass upstream start codons for microprotein expression.
  • PMID: 40646641Quantification of the stoichiometric ratio of microprotein to canonical complex subunits will reveal that higher micropr...
    Failed: conclusion — The paper does not quantify a stoichiometric ratio of microprotein to canonical complex subunits or demonstrate that this occupancy directly correlates with resistance to MEK or WIN-site inhibitors.
    Possible alternatives (unverified): PMID:39123386 (55% topic match); PMID:39135098 (55% topic match)
  • PMID: 37725512Quantification of the stoichiometric ratio of microprotein to canonical complex subunits will reveal that higher micropr...
    Failed: conclusion — The paper demonstrates that WIN-site inhibitors compete with EMBOW for WDR5 binding, but it does not establish that higher microprotein occupancy correlates with resistance to these inhibitors.
    Possible alternatives (unverified): PMID:39123386 (55% topic match); PMID:39135098 (55% topic match)
  • PMID: 41033461High-throughput mass spectrometry may fail to detect low-abundance microprotein-bound complexes if the turnover rate of ...
    Failed: conclusion — The paper discusses challenges in detecting microproteins and identifies cell-cycle regulation for EMBOW, but it does not suggest that turnover during checkpoints causes MS detection failure.
  • PMID: 40170020High-throughput mass spectrometry may fail to detect low-abundance microprotein-bound complexes if the turnover rate of ...
    Failed: conclusion — The paper summarizes MS and Ribo-seq identification methods but contains no data or discussion regarding accelerated turnover during cell-cycle checkpoints as a cause for detection failure.
  • PMID: 39484585The hypothesis is falsified if microprotein-encoding isoforms are produced at equivalent levels in wild-type cells but a...
    Failed: conclusion — This paper does not discuss the microprotein APPLE or the effects of pharmacological activation of PP1/PP2A on tumor cell death.
Generate an empirically testable hypothesis
BioSkepsis

Hypothesis 3

The BAG6-mediated degradation pathway functions as a selective stoichiometric filter that permits the accumulation of oncogenic scaffolding microproteins (e.g., APPLE, SMIMP, ASDURF) only when their synthesis is driven by alternative transcript isoforms that lack upstream inhibitory open reading frames (uORFs), thereby establishing a targetable 'recoding-dependent' threshold for the persistent activation of signaling hubs in cancer.

Mechanistic rationale

  • Ribosome profiling and proteogenomic analyses have identified thousands of functional unannotated open reading frames (ncORFs) in regions previously deemed non-coding, such as lncRNAs and 5'UTRs. (Derived, Medium; PMID: 32139545, PMID: 19213877, PMID: 23160002, PMID: 37275273)
  • Large-scale CRISPR loss-of-function screens demonstrate that approximately 10-30% of these translated non-canonical microproteins are essential for cancer cell viability and proliferation. (Derived, Medium; PMID: 33510483, PMID: 39379206, PMID: 38176414, PMID: 40124493, PMID: 39753408)
  • The expression of essential microproteins like APPLE, SMIMP, and ASDURF often relies on the production of alternative transcript isoforms that bypass upstream translational inhibitors. (Derived, Medium; PMID: 39484585, PMID: 40646641, PMID: 37932451, PMID: 38176414)
  • These microproteins function as stoichiometric scaffolding molecules that competitively saturate the regulatory binding sites of essential hubs, such as the WDR5 WIN-site or the PP1/PP2A phosphatase groove. (Derived, Medium; PMID: 37725512, PMID: 40646641, PMID: 39135098, PMID: 38176414)
  • The BAG6 mitigation pathway normally triages and degrades non-canonical translation products with hydrophobic C-terminal domains, but oncogenic recoding may override this filter. (Derived, Medium; PMID: 40245354, PMID: 39379206, PMID: 39753408)

Predictions

  • Pharmacological inhibition of BAG6 will lead to the stabilization of microproteins and paradoxically increase the occupancy of oncogenic signaling complexes in microprotein-dependent cell lines. (Indirect, Low; PMID: 40245354, PMID: 39379206)
  • Small-molecule disruption of specific WIN-site or regulatory groove interactions will sensitize microprotein-dependent tumors to standard-of-care MEK or HDAC inhibitors. (Derived, Medium; PMID: 37725512, PMID: 40646641, PMID: 37371559)

Study design

Utilize isogenic Nalm-6 (SF3B1-WT vs K700E) and medulloblastoma cell lines to perform long-read ISO-seq for transcript mapping and TMT-multiplexed proteomics for microprotein quantification. Execute co-immunoprecipitation (co-IP) assays to measure microprotein occupancy at the WDR5 WIN-site and PP1/PP2A grooves. Functional dependency will be validated by CRISPR/Cas9 start-codon knockout (ATG to ATT/AGG) while preserving the host transcript structure, followed by viability assessments (CCK-8, EdU) and rescue with synonymous-codon-recoded microprotein cDNAs. (Derived, Medium; PMID: 37725512, PMID: 38176414, PMID: 32139545, PMID: 37371559)

Confounders & controls

  • Targeting non-coding regions with CRISPR may induce genotoxic stress or promoter-associated toxicity; this will be controlled by using tiling gRNA libraries and non-targeting gRNAs. (Derived, Low; PMID: 38176414, PMID: 33510483)
  • Phenotypic effects attributed to microproteins could stem from the host lncRNA molecule; this will be controlled by overexpressing start-codon mutant transcripts that cannot translate the peptide. (Derived, Medium; PMID: 32139545, PMID: 39865075, PMID: 39447131)

Risks/limitations

  • Mass spectrometry often fails to detect low-abundance microproteins or identifies them only by single peptides, increasing the risk of false negatives. (Derived, Medium; PMID: 40245354, PMID: 33595896, PMID: 40170020, PMID: 38013207)
  • Many identified microproteins lack sequence conservation beyond primates, limiting the utility of mouse models for functional validation of certain candidates. (Derived, Medium; PMID: 36543139, PMID: 10.20935/acadmolbiogen7798, PMID: 38333695)

Falsification criteria

  • The hypothesis is falsified if microprotein accumulation occurs regardless of the presence of upstream uORFs, or if the loss of signaling complex occupancy does not result in a loss of cancer cell viability. (Derived, Medium; PMID: 39484585, PMID: 40646641, PMID: 37725512)

Unverified Citations

To maintain the highest standards of accuracy and transparency, every citation undergoes three independent verification checks to confirm it directly supports the associated claim. The references below did not satisfy all verification stages. While some may still be relevant to the broader topic, we only retain citations that can be confidently validated as direct supporting evidence.

  • PMID: 10.1182/blood-2024-208361Cancer cells expressing SF3B1-mutant splicing machinery will exhibit a higher ratio of microprotein-encoding alternative...
    Failed: conclusion — The paper demonstrates that SF3B1 mutations drive aberrant splicing and lncRNA biogenesis leading to neopeptides, but it does not compare ratios of alternative vs. canonical transcripts nor discuss the role of inhibitory uORFs.
    Possible alternatives (unverified): PMID:33510483 (93% topic match); PMID:37603371 (35% topic match)
  • PMID: 39484585Cancer cells expressing SF3B1-mutant splicing machinery will exhibit a higher ratio of microprotein-encoding alternative...
    Failed: mechanism,entities — The paper characterizes the role of alternative transcripts in expressing iORFs (specifically mentioning DEDD2 and SRSF2) but does not mention SF3B1 mutations or their specific impact on transcript ratios in cancer.
    Possible alternatives (unverified): PMID:33510483 (93% topic match); PMID:37603371 (35% topic match)
Want to take this research further?
Sign up free and the thread will land in your workspace so you can refine the question, ask follow-ups, or branch into related searches.