๐Ÿงช Genomics

Two Labs Just Cracked the Size Limit on Gene Editing. The Old Ceiling Was 800 Base Pairs. The New One Is 11,000.

Independent teams in Ohio and Beijing simultaneously solved the same bottleneck that has stalled gene therapy for five years, inserting up to 11,000 base pairs of healthy DNA without breaking both strands. An original regulatory math analysis reveals why this collapses thousands of separate FDA approvals into one and shifts the field from fixing mutations to replacing genes.

Abstract molecular visualization of DNA double helix with a large glowing segment of new genetic code being inserted, navy and teal palette with bioluminescent gold accents

Eleven thousand base pairs. Not kilobases. Not a loose estimate. Eleven thousand individual nucleotides of new DNA, inserted into a living human cell without making a single double-strand break, according to a paper published this week in Nature. A second team at the Chinese Academy of Sciences, publishing simultaneously in Nature Biomedical Engineering, achieved the same feat through a completely different mechanism, reaching 85 percent efficiency and integrating CAR constructs into 50 percent of primary human T cells without a virus. Five years ago, the ceiling for this kind of insertion was around 800 base pairs, and even that number required dividing cells and favorable conditions. Both groups blew past the old limit by an order of magnitude, working independently, on different continents, and arriving at the same conclusion within weeks of each other.

That convergence matters more than the numbers themselves, because when two unrelated labs solve the same problem simultaneously, it is usually because the enabling pieces fell into place at the same time, not because one group got lucky. And the problem they solved is not an incremental improvement but a category shift from gene editing to gene replacement, from correcting single typos in the genome to swapping out entire paragraphs of damaged code for healthy copies.

Why the Size Limit Mattered So Much

CRISPR was never the bottleneck. Not really. Since 2012, researchers have been able to cut DNA with surgical precision, snip out a defective sequence, and let the cell's repair machinery patch the gap. Base editing, introduced in 2016, could swap a single nucleotide without cutting both strands. Prime editing, published in 2019 by David Liu's lab at the Broad Institute, extended that to inserting or deleting roughly 40 to 80 base pairs with high precision. Subsequent improvements pushed the ceiling to around 800 base pairs in favorable conditions.

But here is the problem: the median human gene coding sequence is approximately 1,300 base pairs, and many disease-relevant genes are far larger. Dystrophin, the gene mutated in Duchenne muscular dystrophy, spans 2.4 million base pairs with a coding sequence of 11,058. You cannot fix a 14,000-nucleotide deletion by editing 800 base pairs at a time, which means you need to replace the whole segment in a single operation. And until May 2026, doing that required double-strand breaks, which trigger p53-mediated cell death, chromosomal translocations, and the kind of off-target chaos that makes any FDA reviewer reach for the rejection stamp.

Two Roads to the Same Destination

Bin Liu's group at Ohio State and UMass Chan took an approach they call prime assembly, and its elegance lies in borrowing a trick from synthetic biology. Twin prime editing guide RNAs create overlapping single-stranded flaps at the target site, essentially preparing two matching edges of a molecular jigsaw puzzle. Linear DNA donors, whose ends are designed to overlap with those flaps, are then inserted through a Gibson-like assembly reaction that happens inside the cell. No exogenous DNA polymerases are needed, no homology-directed repair pathway is required, and because the technique avoids double-strand breaks entirely, it works in non-dividing cells like neurons and cardiomyocytes, which is critical because most cells in an adult human body have stopped dividing. Adding an NHEJ inhibitor called AZD-7648 further enhances both efficiency and precision, bringing the error rate down to levels that might survive regulatory scrutiny. Liu's team demonstrated insertions ranging from 100 base pairs to 11,000, including a full dystrophin gene construct and a complete CAR cassette for cancer immunotherapy (Figures 3 and 4 of the paper).

In Beijing, a separate group at the Chinese Academy of Sciences developed a method called PRIME-In. Their strategy uses prime editing to create short stretches of microhomology between the genomic target and a plasmid donor, then deploys paired genomic nicks to drive DSB-free knockin. Cargo sizes ranged from 2 to 9.6 kilobases, with efficiency hitting 85 percent in cell lines and roughly 50 percent CAR construct integration in primary human T cells without any viral vector, a result that has immediate implications for manufacturing because current CAR-T production depends on lentiviral or retroviral transduction, a process that costs hundreds of thousands of dollars per patient and takes weeks to complete.

The Regulatory Math That Changes Everything

Here is the original calculation that, as far as I can find, nobody has run publicly. Duchenne muscular dystrophy has over 7,000 known pathogenic mutations scattered across the dystrophin gene. Under the current gene therapy paradigm, each distinct correction requires its own preclinical development, its own IND filing, and its own FDA review. As of May 2026, the FDA has approved a total of 12 gene therapies for any condition, which means that covering just 10 percent of DMD mutations at current approval rates would take centuries.

Prime assembly changes the arithmetic entirely, because instead of correcting mutation 4,271 in exon 47 with one bespoke edit, and mutation 6,893 in exon 64 with another bespoke edit, and repeating that process for each of the remaining 6,998 variants, you replace the entire mutated segment with a healthy copy. One insertion covers every mutation in that region. For DMD specifically, Liu's team demonstrated exactly this: a full dystrophin gene insertion, which means one therapeutic construct could theoretically treat all 7,000-plus DMD mutation carriers rather than requiring 7,000 separate products.

Scale that logic across rare diseases globally and the numbers become almost absurd. The WHO estimates that more than 7,000 rare diseases affect 300 to 400 million people worldwide. Approximately 72 percent are genetic, and about 80 percent of those are monogenic, caused by a single defective gene, yielding roughly 4,000 monogenic diseases. For any of them where the causal gene's coding sequence fits within 11 kilobases, and the median human coding sequence is 1.3 kilobases so the vast majority do, prime assembly is at least theoretically applicable. Current gene therapies serve a single-digit number of these conditions, so the gap between 4,000 addressable diseases and 12 approved therapies is the chasm these two papers begin to bridge.

What About CAR-T Manufacturing?

PRIME-In's 50 percent non-viral CAR integration efficiency deserves its own section because the economics are staggering. Potentially transformative. Exa-cel (marketed as Casgevy), the first CRISPR-based therapy approved by the FDA, costs $2.2 million per patient and requires myeloablative conditioning, which means destroying the patient's bone marrow before reinfusing edited cells, a process so brutal that many patients who qualify on genetic grounds are excluded on physical ones. Much of that cost comes from viral vector manufacturing and the cleanroom infrastructure needed to produce patient-specific viral transductions.

If you can integrate a CAR construct into half of a patient's T cells using a plasmid and a pair of guide RNAs, you have eliminated the most expensive and time-consuming step in the entire pipeline. Gone. Plasmid DNA costs a fraction of what GMP-grade lentivirus costs. You still need autologous cell processing, quality control, and clinical oversight, but the manufacturing bottleneck shifts from molecular biology to logistics, which is a problem the pharmaceutical industry actually knows how to solve at scale.

Limitations

Neither paper has tested these techniques in living animals, let alone humans. That is the caveat that should sit in bold at the top of every press release about these results, and it does not. Prime assembly's 11,000 base pair insertions were demonstrated in cell culture, and efficiency numbers at the upper end of the size range remain modest. PRIME-In's 85 percent efficiency was measured in cell lines; the 50 percent CAR-T figure comes from primary human T cells ex vivo, not from an infusion into a patient. Delivery remains a fundamental unsolved problem for in vivo gene therapy: getting the editing machinery into the right cells, in the right tissue, at therapeutic concentrations, without immune rejection. Liu's group plans to test lipid nanoparticle and AAV delivery vehicles next, but those experiments have not started. Regulatory timelines from first-in-cell to first-in-human typically span five to eight years at minimum, so nobody is getting this treatment in 2027.

Strongest Counterargument

Efficiency numbers in cell culture rarely survive the transition to therapeutic reality. Gene therapy has a graveyard of techniques that worked beautifully in a dish and failed in patients because of immune responses, off-target editing at rates too low to detect in small-scale experiments, or delivery losses that reduced effective doses by orders of magnitude. Jesse Gelsinger died in 1999 from an immune reaction to the adenoviral vector used to deliver a gene therapy that had shown spectacular preclinical results. He was eighteen. More recently, several AAV-based therapies have caused serious liver toxicity in clinical trials despite clean safety profiles in mice, reinforcing a pattern that has repeated so many times in this field that veteran gene therapy researchers have a sardonic shorthand for it: "mice lie." Claiming that a 50 percent ex vivo efficiency number translates directly to a viable commercial therapy is the same mistake this field has made before, and no paper, regardless of how elegant the mechanism, should be treated as proof of clinical utility until it survives a Phase I trial.

What You Can Do

If you or a family member has a monogenic rare disease, check whether the causal gene has been identified and whether its coding sequence is smaller than 11 kilobases. Resources like OMIM (Online Mendelian Inheritance in Man) and ClinVar at NCBI provide free lookup by disease name. If the gene fits, these techniques are now at least theoretically capable of addressing it, and you should track the labs involved: Bin Liu at Ohio State and the groups at the Chinese Academy of Sciences who published the PRIME-In work. Neither has announced clinical development timelines, but both have published enough preclinical data that licensing deals or startup spinoffs are likely within the next one to two years.

If you work in biotech or pharmaceutical manufacturing, the PRIME-In non-viral integration result is the number to watch. A plasmid-based CAR-T manufacturing process at 50 percent efficiency could cut per-patient costs by an order of magnitude and reduce turnaround time from weeks to days. Whether that number holds up under GMP conditions is the billion-dollar question, and the team that answers it first will own the manufacturing playbook for the next decade of cell therapy.

If you are an investor evaluating the CRISPR sector, which analysts project will grow from $3.65 billion in 2026 to $8.28 billion by 2030 at a 22.7 percent CAGR, these two papers shift the addressable market calculation. Gene replacement for monogenic diseases is a fundamentally larger opportunity than single-nucleotide correction, and the competitive field now has two independent, patent-distinct approaches racing toward the same applications.

The Bottom Line

For five years, gene editing could fix typos but not rewrite paragraphs. That era is over. Bin Liu's metaphor is apt: "If we think of the genome as a book, we can remove one paragraph and replace it with a new one, or even rewrite a chapter." Two independent teams have now demonstrated that paragraph-level rewriting works, that it does not require the dangerous double-strand breaks that have limited every prior attempt, and that it functions in the cell types that actually matter for human disease, from quiescent neurons in the brain to cardiomyocytes in the heart to the primary T cells that form the backbone of cancer immunotherapy. Clinical reality is still years away, and this field has earned its skeptics through decades of broken promises and tragedies that cost actual lives. But the math has changed. When 7,000 mutations collapse into one replacement, and when viral vectors become optional rather than mandatory, the bottleneck shifts from molecular biology to manufacturing and regulation. Those are solvable problems. That is what makes this week different.