Optimization of AAV expression cassettes to improve packaging capacity and transgene expression in neurons
- Jun-Hyeok Choi1Email author,
- Nam-Kyung Yu1Email author,
- Gi-Chul Baek1,
- Joseph Bakes1,
- Daekwan Seo1, 2,
- Hye Jin Nam1, 4,
- Sung Hee Baek1, 4,
- Chae-Seok Lim1,
- Yong-Seok Lee3 and
- Bong-Kiun Kaang1Email author
© Choi et al.; licensee BioMed Central Ltd. 2014
Received: 7 February 2014
Accepted: 27 February 2014
Published: 11 March 2014
Adeno-associated virus (AAV) vectors can deliver transgenes to diverse cell types and are therefore useful for basic research and gene therapy. Although AAV has many advantages over other viral vectors, its relatively small packaging capacity limits its use for delivering large genes. The available transgene size is further limited by the existence of additional elements in the expression cassette without which the gene expression level becomes much lower. By using alternative combinations of shorter elements, we generated a series of AAV expression cassettes and systematically evaluated their expression efficiency in neurons to maximize the transgene size available within the AAV packaging capacity while not compromising the transgene expression. We found that the newly developed smaller expression cassette shows comparable expression efficiency with an efficient vector generally used for strong gene expression. This new expression cassette will allow us to package larger transgenes without compromising expression efficiency.
KeywordsAdeno-associated virus WPRE SV40 late polyadenylation signal sequence Neuron
Viral vectors are efficient tools for expressing transgenes in diverse cell types in vitro and in vivo. The type of virus should be selected with care, because they possess advantages and disadvantages, depending on the purpose of the research . Adeno-associated virus (AAV) is known to be one of the most useful viral vectors for transgene delivery because it is not pathogenic, induces a minimal immune response, infects both dividing and non-dividing cells, and the transgene expression persists for a longer time in cells. The genomes of AAV vectors infected into the host cell exist primarily in an extrachromosomal state, not integrating into the host genome. They can be inserted into only a single specific integration site in the human genome, which may be an advantage for stable long-term expression without insertional mutagenesis. In addition, AAV can be readily produced in high titers and deliver transgenes with high efficiency. Adenovirus, herpes simplex virus (HSV), retrovirus, and lentivirus also have their advantages in transgene delivery. However, adenovirus and HSV are not suitable for the purpose of stable expression in non-dividing cells such as neurons, because they only allow transient expression. They also have the disadvantage of inducing high immune responses. Retroviruses only transduce dividing cells, and, in common with lentivirus, have a risk of insertional mutagenesis . For these reasons, AAV vectors are frequently used in basic researches and are attractive candidates for use in gene therapy. AAV has also been the vector of choice in recent clinical trials of neurological diseases, including Parkinson's and Alzheimer's disease .
However, the use of AAV vectors is limited by their relatively small maximal packaging capacity [3, 4]. Although there is a report describing specific serotypes with much larger packaging capacity , other studies [6–8] indicated that intact genomes were not, in fact, packaged in that case , and further suggested a strict size cutoff of 5.2 kb. Efforts have been made to expand the available AAV transgene size by trans-splicing and/or recombination to split the AAV expression cassette to two viral vectors . However, the expression efficiency is still much lower than that of a single viral vector.
Despite these efforts, an AAV expression cassette capable of delivering a large transgene with high expression level is still not available. As a strategy to expand the available size of transgenes in AAV, we decided to minimize the size of the expression cassette to allow more space for transgenes. Using an AAV expression cassette known to express transgenes with high efficiency as a template, we generated a variety of shorter cassettes. Then we systematically evaluated their expression efficiency in neurons to maximize packaging capacity while not compromising transgene expression.
Experimental strategy for modifying expression cassettes
Transgene expression driven by AAV expression cassettes with modified WPRE
WPRE is a 600 bp long tripartite element containing gamma, alpha, and beta elements, in the given order  and contributes to the strong expression of transgenes in AAV systems [11–13]. It also enhances the expression of a transgene lacking introns. In a previous study , a shortened WPRE sequence (WPRE2) containing a minimal gamma element and a partial alpha-beta element was used in an AAV vector, but its transgene expression efficiency was not quantitatively examined. We shortened this sequence further so that it contained only minimal gamma and alpha elements (WPRE3). Introns are known to possess a posttranscriptional regulatory element that efficiently induces transport of mRNA out of the nucleus and enhances mRNA stability . Therefore, we inserted a chimeric intron sequence instead of WPRE to examine whether an intron sequence, shorter than WPRE, could increase transgene expression comparable to WPRE. We compared the EGFP expression efficiency of expression cassettes containing WPRE (CWB), WPRE3 (CW3B), or a short chimeric intron (CIB), together with an expression cassette lacking any of these (CB) (Figure 1). CB that lacks WPRE from CWB exhibited reduced EGFP expression compared to CWB (28.6% of CWB), indicating that WPRE is critical for strong expression. The chimeric intron did not enhance the expression of EGFP. The EGFP expression level of CW3B was not statistically different compared with that of the control CWB cassette (83.4%), even though the size of WPRE had been significantly reduced (from 600 bp WPRE to 247 bp WPRE3, approximately 41.2% of original size).
Effects of modification in the polyadenylation signal on transgene expression
Using CW3B as a template, we next compared the transgene expression efficiency of various polyadenylation signal sequences (Figure 2A). Aside from the CWB component bGHpA, the SV40 early/late polyadenylation signal is also commonly used for efficient transgene expression in mammalian cells. Because the late polyadenylation signal sequence acts more efficiently for transgene expression than the early polyadenylation signal sequence due to the presence of a short upstream element preceding the mRNA cleavage site [19, 20], only the late polyadenylation signal sequence was tested in this study. We first designed an expression cassette containing the SV40 late polyadenylation signal sequence instead of bGHpA (CW3L). A previous study using lentiviral vectors showed that the upstream element in the late SV40 polyadenylation signal enhances the function of the polyadenylation signal to a greater extent in a tandem arrangement than when alone . Therefore, we tested this in our study as well, by adding an additional upstream sequence element prior to the full late polyadenylation signal (CW3SL). Because a short synthetic polyadenylation signal was successfully used in a previous study , we also generated AAV expression cassettes that have a synthetic polyadenylation signal alone (CW3A), or together with a single (CW3SA) or tandem upstream element of the SV40 late polyadenylation signal (CW3SSA). Compared with CW3B, the expression levels of EGFP were 106.5% by CW3L, 122.7% by CW3SL, 49.7% by CW3A, 70.3% by CW3SA, and 76.4% by CW3SSA (Figure 2B-D).
Effect of cassette size on transgene expression efficiency
Transgene expression by CW3SL in mice brain
CW3SL significantly increases AAV packaging limit of transgene
AAV offers many advantages as a gene delivery system over other viral vectors; however, its strict packaging capacity limits its use for delivering large transgenes or large cell-type specific promoters. Techniques to divide a gene into two AAV vectors and then combining each segment to reconstitute the full-length sequence after transduction by trans-splicing and recombination have been developed, but this technique is still in its infancy showing much lower expression efficiency compared to a non-divided single vector . Other groups have also developed shorter expression cassettes containing a functional, minimal-sized polyadenylation signal [15, 22]; however, we found that they yield only very low levels of transgene expression. To fully exploit the potential of AAV vectors, a short expression cassette capable of supporting proper expression levels of transgenes required for basic and clinical applications is highly required.
In this study, we systematically compared AAV expression cassettes comprising various short regulatory elements as substitutes for the WPRE and polyadenylation signal of a prototype cassette capable of expressing high levels of transgenes. We found that a shortened WPRE, named WPRE3, which contains two of the three regulatory elements of WPRE, affords efficient transgene expression (83.4%) even when reduced in size from 600 bp to 247 bp.
We then altered the polyadenylation signal of the modified expression cassette (CW3B) that contains WPRE3. Because synthetic, minimal consensus polyadenylation signal sequence was used successfully in conjunction with a strong promoter in a previous study , we used this 49 bp segment in AAV in conjunction with the CaMKIIα promoter (CW3A); however, the resulting transgene expression driven by CW3A was only 40.4% of the original expression cassette CWB. The SV40 early/late polyadenylation signal is often used for expressing the transgene in mammalian cells. It is well known that SV40 late polyadenylation signal is more efficient due to a short upstream element sequence [19, 20]. This sequence is also known to increase the efficiency of a weak polyadenylation signal and even more so when used in tandem . In the present study, we found that the 135 bp SV40 late polyadenylation signal (CW3L) showed similar efficiency to that of bGHpA, which is 223 bp (CW3B). An additional upstream element sequence (CW3SL) further improved the efficiency and restored the reduced level of expression from WPRE3. The upstream element also improved the efficiency of the synthetic polyadenylation signal (comparing CW3A, CW3SA, and CW3SSA). However, these expression cassettes were not as efficient in terms of size as the ones containing the SV40 late polyadenylation signal (CW3L and CW3SL).
The new CW3SL cassette provided 399 bp additional cloning capacity compared with the original template vector CWB and produced comparable expression to CWB (103.4%). The maximal packaging capacity of CWB and CW3SL is approximately 3.6 kb and 4 kb respectively. Among the transcripts expressed in the mouse hippocampus (defined by RNA sequencing), 1.82% (488 genes) have coding sequence length within the range of 3.6 to 4 kb (Figure 5C). Furthermore, note that CWB represents an optimized, compact expression cassette from which sequences not required for expression have been eliminated in our study. Conventionally used AAV backbones which include a commonly used neuronal expression cassette composed of 1.3 kb CaMKIIα promoter (which shows comparable expression efficiency with 0.4 kb CaMKIIα promoter  used in CWB), WPRE, and human growth hormone polyadenylation signal [23–25] has a capacity of 2.4 kb. Therefore, when compared with these conventionally used AAV cassettes, the CW3SL construct would obtain even more cloning capacity than 399 bp. 12.40% (3324 genes) of the transcripts expressed in the mouse hippocampus have coding sequence length within the range of 2.4 to 4 kb. Therefore, CW3SL now makes it possible to express larger transgenes at high efficiency compared with the prototype strong expression cassette CWB and other conventional expression cassettes. In some cases, CW3SL may provide an opportunity to package larger transgenes in AAV vectors that were previously impossible to package, while in other cases it may enable fusing a reporter such as EGFP or other fluorescent proteins without losing expression efficiency as we demonstrated. The family of new expression cassettes tested in this study could also provide a variety of cloning options. Other cassettes shorter than CW3SL would allow packaging of even longer transgenes and still would be useful in certain experimental conditions although their expression level would be lower than that of CW3SL. The new expression cassettes described here would also be useful for designing other derivatives. For example, the SV40 late polyadenylation signal supported the expression of transgenes as efficiently as vectors containing bGHpA in the presence of WPRE3 (Figure 2). Thus, the SV40 late polyadenylation signal alone may support expression at a level similar to that of the CB expression cassette while saving 88 bp. In addition, an SV40 late polyadenylation signal upstream element may further enhance the efficiency of transgene expression. In this study, we used the CaMKIIα promoter to drive transgene expression in neurons, but it would also be possible to substitute this promoter with other ubiquitous promoters, cell-type–specific promoters, or even conditional promoters for different applications.
One of the most critical limitations for application of AAV is the strict packaging capacity. To overcome this issue we systematically developed and compared various expression cassettes. We found that the newly developed smaller expression cassette shows comparable transgene expression efficiency with the efficient vector generally used for strong transgene expression in AAV system. This new expression cassette will allow us to package larger transgenes without compromising expression efficiency.
The expression cassettes tested in this study are shown in Figures 1 and 2. Each element was PCR amplified and inserted into the AAV vector pAAV-Ef1a-DIO-hChR2(H134R)-EYFP-WPRE-pA, replacing the DNA sequences between the inverted terminal repeats (ITRs). The backbone plasmid and a CaMKIIα promoter (0.4 kb) were gifts from K. Deisseroth and T. Kaneko , respectively. WPRE was amplified from pLentilox3.1, and WPRE3 was produced by using recombinant PCR, in which the minimal sequences of γ and α elements were fused . bGHpA was amplified from pcDNA3.1. A chimeric intron (5´-donor site from a human β-globin intron and 3´-acceptor site from an immunoglobulin heavy chain variable region intron) and the SV40 late polyadenylation signal were amplified from pZac2.1. The complete plasmid sequences are deposited in GenBank (GenBank accession number KJ411911-KJ411919).
AAVs (serotype 1) were purified from HEK293T cells that were transfected with plasmids containing each expression cassette, p5E18-RXC1, pAd-ΔF6  (20 ml medium [DMEM/10% FBS] in a 150-mm culture dish). Three days after transfection, the medium containing AAV particles was collected and treated with Benzonase® (50 U/ml for 30 min at 37°C). After 1,500 rpm, 10 min centrifugation, the supernatant was loaded onto iodixanol gradients (54% 4 ml, 40% 5 ml, 25% 5 ml, and 15% 6 ml). Four milliliters of the 40% layer was collected after ultracentrifugation (69000 rpm, 70Ti rotor). The solution was exchanged with PBS and concentrated using an Amicon Ultra-15 centrifugal filter unit. The titer was measured using quantitative RT-PCR.
Viral infection and western blot analysis
AAVs expressing EGFP and tdTomato were co-infected to DIV9 cultured hippocampal neurons derived from E18 rat embryos (5 × 108 GC/well for a 12-well plate) and incubated at 37°C in a CO2 incubator for 9 days. Cells were then collected after brief washing with 1× PBS and lysed with RIPA buffer (50 mM Tris-Cl, pH 7.6, 150 mM NaCl, 1 mM EDTA, 1% NP-40, 0.25% SDS, 0.5% sodium deoxycholate) containing PIC (Protease Inhibitor Cocktail, Roche). The cell lysate was incubated on ice for 10 min, and SDS-gel loading buffer (50 mM Tris-Cl, pH 6.8, 100 mM dithiothreitol, 2% SDS, 0.1% bromophenol blue, 10% glycerol) was added to each sample. For electrophoresis, equal amount of proteins were loaded onto 12% polyacrylamide gels. Separated proteins were then transferred onto a nitrocellulose membrane at 4°C for 15–20 h. After blocking for 1 h with 5% skim milk in Tris-buffered saline containing 0.1% Tween-20 (TBST) at room temperature, the membranes were incubated with antibodies against either EGFP (1:5,000 NeuroMab), red fluorescent protein (RFP; 1:10,000, Rockland) or tubulin (1:100,000), at 4°C overnight. HRP-conjugated secondary antibodies were applied for 1 h at room temperature after washing with TBST. Enhanced chemiluminescence (Millipore) was used to detect antigen-antibody complexes. Images were acquired using a ChemiDoc XRS + System (Bio-Rad), and the densities of the bands were measured using the Image Lab program (Bio-Rad). For western blot analysis in Figures 1 and 2, the EGFP expression level of each expression cassette was normalized with tdTomato expression level. For western blot analysis in Figure 5, EGFP-p110γ levels were first normalized with tubulin levels, then normalized with the average of the level for CWB.
Stereotactic viral injection and imaging
All surgical procedures were conducted under sterile conditions and approved by the Institutional Animal Care and Use Committee of Seoul National University. Wild-type C57BL/6 male mice (8 weeks of age) were anesthetized by i.p. injection of Zoletile and arranged in a stereotactic frame (Stoelting Co.). The hippocampal CA1 regions (AP: −1.8 mm, ML: ± 1.5 mm, DV: −1.7 mm) were targeted, and an equivalent amount of AAVs (EGFP: 8 × 108 GC; tdTomato: 4 × 108 GC) were delivered by using a 10-μL syringe pump (Hamilton Co.) at 6.0 μL/h. After an additional 10 min of diffusion, a needle was withdrawn and the animal’s scalp was sutured with black silk. After 3 weeks, the brain was removed and fixed in 4% paraformaldehyde at 4°C overnight, followed by dehydration in 30% sucrose at 4°C for 2 days. Sections (50 μm) were prepared by using a cryostat and mounted on glass slides with Vectashield (Vector Lab). Fluorescence images from hippocampal CA1 region were collected using a confocal microscope and analyzed with ImageJ program. To measure the EGFP expression efficiency from the virus-infected area, tdTomato-expressed CA1 pyramidal cell body layer was selected from each image, and the fluorescence intensities of EGFP and tdTomato of the region were measured. EGFP intensity was divided by tdTomato intensity to normalize the virus infusion efficiency.
This study was supported by the National Honor Scientist Program of Korea.
- Vannucci L, Lai M, Chiuppesi F, Ceccherini-Nelli L, Pistello M: Viral vectors: a look back and ahead on gene transfer technology. New Microbiol 2013, 36:1–22.PubMed
- Weinberg MS, Samulski RJ, McCown TJ: Adeno-associated virus (AAV) gene therapy for neurological disease. Neuropharmacology 2013, 69:82–88.PubMedView Article
- Dong JY, Fan PD, Frizzell RA: Quantitative analysis of the packaging capacity of recombinant adeno-associated virus. Hum Gene Ther 1996, 7:2101–2112.PubMedView Article
- Hermonat PL, Quirk JG, Bishop BM, Han L: The packaging capacity of adeno-associated virus (AAV) and the potential for wild-type-plus AAV gene therapy vectors. FEBS Lett 1997, 407:78–84.PubMedView Article
- Allocca M, Doria M, Petrillo M, Colella P, Garcia-Hoyos M, Gibbs D, Kim SR, Maguire A, Rex TS, Di Vicino U, Cutillo L, Sparrow JR, Williams DS, Bennett J, Auricchio A: Serotype-dependent packaging of large genes in adeno-associated viral vectors results in effective gene delivery in mice. J Clin Invest 1955–1964, 2008:118.
- Dong B, Nakai H, Xiao W: Characterization of genome integrity for oversized recombinant AAV vector. Mol Ther J Am Soc Gene Ther 2010, 18:87–92.View Article
- Wu Z, Yang H, Colosi P: Effect of genome size on AAV vector packaging. Mol Ther J Am Soc Gene Ther 2010, 18:80–86.View Article
- Lai Y, Yue Y, Duan D: Evidence for the failure of adeno-associated virus serotype 5 to package a viral genome > or = 8.2 kb. Mol. Ther J Am Soc Gene Ther 2010, 18:75–79.View Article
- Ghosh A, Yue Y, Duan D: Efficient transgene reconstitution with hybrid dual AAV vectors carrying the minimized bridging sequences. Hum Gene Ther 2011, 22:77–83.PubMed CentralPubMedView Article
- Dittgen T, Nimmerjahn A, Komai S, Licznerski P, Waters J, Margrie TW, Helmchen F, Denk W, Brecht M, Osten P: Lentivirus-based genetic manipulations of cortical neurons and their optical and electrophysiological monitoring in vivo. Proc Natl Acad Sci 2004, 101:18206–18211.PubMed CentralPubMedView Article
- Loeb JE, Cordier WS, Harris ME, Weitzman MD, Hope TJ: Enhanced expression of transgenes from adeno-associated virus vectors with the woodchuck hepatitis virus posttranscriptional regulatory element: implications for gene therapy. Hum Gene Ther 1999, 10:2295–2305.PubMedView Article
- Paterna JC, Moccetti T, Mura A, Feldon J, Büeler H: Influence of promoter and WHV post-transcriptional regulatory element on AAV-mediated transgene expression in the rat brain. Gene Ther 2000, 7:1304–1311.PubMedView Article
- Xu R, Janson CG, Mastakov M, Lawlor P, Young D, Mouravlev A, Fitzsimons H, Choi KL, Ma H, Dragunow M, Leone P, Chen Q, Dicker B, During MJ: Quantitative comparison of expression with adeno-associated virus (AAV-2) brain-specific gene cassettes. Gene Ther 2001, 8:1323–1332.PubMedView Article
- Goodwin EC, Rottman FM: The 3’-flanking sequence of the bovine growth hormone gene contains novel elements required for efficient and accurate polyadenylation. J Biol Chem 1992, 267:16330–16334.PubMed
- Wang D, Fischer H, Zhang L, Fan P, Ding RX, Dong J: Efficient CFTR expression from AAV vectors packaged with promoters–the second generation. Gene Ther 1999, 6:667–675.PubMedView Article
- Donello JE, Loeb JE, Hope TJ: Woodchuck hepatitis virus contains a tripartite posttranscriptional regulatory element. J Virol 1998, 72:5085–5092.PubMed CentralPubMed
- Kalev-Zylinska ML, During MJ: Paradoxical facilitatory effect of low-dose alcohol consumption on memory mediated by NMDA receptors. J Neurosci 2007, 27:10456–10467.PubMedView Article
- Huang MT, Gorman CM: Intervening sequences increase efficiency of RNA 3’ processing and accumulation of cytoplasmic RNA. Nucleic Acids Res 1990, 18:937–947.PubMed CentralPubMedView Article
- Carswell S, Alwine JC: Efficiency of utilization of the simian virus 40 late polyadenylation site: effects of upstream sequences. Mol Cell Biol 1989, 9:4248–4258.PubMed CentralPubMed
- Schek N, Cooke C, Alwine JC: Definition of the upstream efficiency element of the simian virus 40 late polyadenylation signal by using in vitro analyses. Mol Cell Biol 1992, 12:5386–5393.PubMed CentralPubMed
- Schambach A, Galla M, Maetzig T, Loew R, Baum C: Improving transcriptional termination of self-inactivating gamma-retroviral and lentiviral vectors. Mol Ther 2007, 15:1167–1173.PubMed
- Ostedgaard LS, Rokhlina T, Karp PH, Lashmit P, Afione S, Schmidt M, Zabner J, Stinski MF, Chiorini JA, Welsh MJ: A shortened adeno - associated virus expression cassette for CFTR gene transfer to cystic fibrosis airway epithelia . Proc Natl Acad Sci U S A 2005, 102:2952–2957.PubMed CentralPubMedView Article
- Tye KM, Prakash R, Kim S-Y, Fenno LE, Grosenick L, Zarabi H, Thompson KR, Gradinaru V, Ramakrishnan C, Deisseroth K: Amygdala circuitry mediating reversible and bidirectional control of anxiety. Nature 2011, 471:358–362.PubMed CentralPubMedView Article
- Yizhar O, Fenno LE, Prigge M, Schneider F, Davidson TJ, O’Shea DJ, Sohal VS, Goshen I, Finkelstein J, Paz JT, Stehfest K, Fudim R, Ramakrishnan C, Huguenard JR, Hegemann P, Deisseroth K: Neocortical excitation/inhibition balance in information processing and social dysfunction. Nature 2011, 477:171–178.PubMedView Article
- Mattis J, Tye KM, Ferenczi EA, Ramakrishnan C, O’Shea DJ, Prakash R, Gunaydin LA, Hyun M, Fenno LE, Gradinaru V, Yizhar O, Deisseroth K: Principles for applying optogenetic tools derived from direct comparative analysis of microbial opsins. Nat Methods 2012, 9:159–172.View Article
- Hioki H, Kameda H, Nakamura H, Okunomiya T, Ohira K, Nakamura K, Kuroda M, Furuta T, Kaneko T: Efficient gene transduction of neurons by lentivirus with enhanced neuron-specific promoters. Gene Ther 2007, 14:872–882.PubMedView Article
- Vandenberghe LH, Xiao R, Lock M, Lin J, Korn M, Wilson JM: Efficient serotype-dependent release of functional vector into the culture medium during adeno-associated virus manufacturing. Hum Gene Ther 2010, 21:1251–1257.PubMed CentralPubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.