Introduction

Terpene synthases catalyze the cleavage and formation of highly complex multiple bonds via a series of carbocation intermediates to create a significant diversity of terpene molecular skeletons.1 Recently, a large number of putative terpene cyclase genes obtained by genomic sequencing and transcriptome analysis were made available through genetic databases of bacteria, fungi and plants. The catalytic mechanism of terpene synthases has been extensively studied using structural biology and examined by quantum mechanics/molecular mechanics calculations.2, 3 For a rational engineering of terpene cyclases, a detailed elucidation of the reaction mechanism of individual cyclases is essential. Conventionally, whole cell-incorporation experiments of simple isotopically labeled precursors, such as doubly 13C- and 2H-labeled sodium acetate and mevalonolactone, are used to study the cyclization pathways.4 In vitro analysis of terpene cyclases with specifically isotope-labeled precursors is a more straightforward method for elucidating their mechanisms. In most cases, NMR analysis is used for the correct assignment of the labeling positions. However, owing to the analytical detection limit (usually>100 μg) and difficulties in the preparation of site-specifically labeled precursors, an alternative method may improve this situation.

EI-MS is one of the most widely used techniques to analyze apolar, volatile, thermostable and low-MW natural products including terpenes, and has been employed for the discovery and characterization of various terpenoids.5 More importantly, this analysis requires only small amounts of samples (ca. 10 ng per analysis), thus it is >100 times more sensitive than the NMR analysis. In addition, it gives reproducible fragmentations that include structural information on cyclic terpenes, suggesting a potential application of EI-MS in the investigation of the cyclization mechanisms. However, the application of this technique has been hampered by the difficult interpretation of the fragmentation mechanisms. One of the solutions to overcome this issue is the systematic analysis of site-specifically deuterium-labeled samples. Herein, we report the development of a versatile chemoenzymatic method for the synthesis of deuterium-labeled prenyl diphosphates and their study by MS analysis to elucidate the cyclization mechanism of a diterpene synthase, namely phomopsene synthase (PaPS).

Bifunctional terpene synthases, which possess a prenyltransferase domain at the N-terminal and a terpene cyclase domain at the C-terminal, represent a new family of terpene synthases. In contrast to other terpene synthases, this type of enzymes can synthesize diterpenes or sesterterpenes owing to their ability to convert cellularly abundant farnesyl diphosphate (FPP) to geranylgeranyl diphosphate (GGPP) or geranylfarnesyl diphosphate as well as cyclize the resultant prenyl diphosphate. In 2007, Sassa and co-workers identified the first bifunctional terpene synthase PaFS, which gives fusicoccadiene possessing a tricyclic 5-8-5 hydrocarbon skeleton.6 Following this pioneering work, a homology-based cloning allowed us to identify homologous enzymes such as PaPS, ophiobolin F synthase and sesterfisherol synthase.7, 8, 9 The cyclization products of the growing family of terpene synthases are unique structures featuring polycyclic skeletons such as tricyclic fusicoccadiene/ophiobolin F,6, 8 tetracyclic phomopsene (1)/sesterfisherol7, 9 and pentacyclic quiannulatene (Scheme 1 and Supplementary Figure S1).10 The cyclization can occur under specific control of the terpene synthase, thus it is of particular interest for the researchers in the fields of organic and natural product chemistry. In this study, we applied an MS-based analysis of isotopically labeled products to elucidate the cyclization mechanism of PaPS.

Results and discussion

Chemoenzymatic synthesis of deuterium-labeled prenyl diphosphates

Isopentenyl diphosphate (IPP) is a fundamental building block in the enzymatic synthesis of prenyl diphosphates such as GPP, FPP and GGPP.1 Its isomerization and chain elongation involve four enzymes, that is, IPP isomerase, GPP synthase, FPP synthase and GGPP synthase, and proceed in a highly regio- and stereo-selective manner (Scheme 1a). The function of each of these enzymes has been firmly established by in vitro enzymatic reactions. Therefore, an enzymatic synthesis allows us to prepare a series of site-specifically deuterated prenyl diphosphates from a limited set of site-specifically labeled IPPs (2a2d) (Scheme 1a).

Initially, 2-CD2-IPP 2b was synthesized as shown in Scheme 1b. After the base-catalyzed deuterium exchange of protected ester 3 in CH3OD, the resultant ester D3-3 was converted into a TBDPS ether via the corresponding alcohol. Ketal deprotection and subsequent Peterson olefination 11 yielded silyl ether 4, whose TBDPS protecting group was then converted into a tosyl group 5 before the following phosphorylation step. 4-CD2-IPP 2c was synthesized starting from deuterium-labeled alkyne 6. After deprotection, the regioselective carboalumination 12 of the resultant alcohol with Me3Al and Cp2ZrCl2 and subsequent quenching with D2O gave the desired alcohol, which was converted into tosylate 7 (Scheme 1c). For the synthesis of 5′-CD3-IPP 2d, the vinyl anion prepared from THP-protected vinyl bromide 8 upon treatment with t-BuLi followed by CuCN was coupled with CD3I to give THP ether, which was converted into the corresponding tosylate 9 (Scheme 1d). Each tosylate was subjected to phosphorylation using a standard protocol.13 Together with 1-CD2-IPP 2a, which was synthesized according to a literature procedure,13, 14 we prepared a total of four specifically deuterium-labeled IPP 2a2d.

To facilitate the discussion on the labeling positions in both GGPP precursor and phomopsene 1, the prenyl chain of GGPP was divided into four isoprene units I–IV, and the corresponding positions were highlighted with the same color (Scheme 1a). In order to perform the chemoenzymatic synthesis of various [2H]-GGPPs, we prepared IPP isomerase (IDI),15 GPP synthase (IspA-S81F),16 FPP synthase (IspA)17 and PaPS 7 according to literature procedures. Although the PT domain of PaPS was able to convert dimethylallyl diphosphate (DMAPP) and GPP into GGPP, the reaction was too sluggish.6, 7 Thus, IspA was added to achieve a rapid conversion of the prenyl diphosphate precursors.

The incubation of 2a and non-labeled DMAPP with IspA-S81F gave specifically labeled [2H]-GPP, which was then incubated with IspA and PaPS to give site-specifically labeled [2H]-1 via III-[1-CD2]-GGPP (Supplementary Scheme S1). Similarly, using the three remaining [2H]-IPPs 2b2d as extender units, we prepared three specifically labeled [2H]-GPPs which were converted into isoprene unit III-specifically labeled [2H]-1 via the corresponding III-[2H]-GGPPs. For the preparation of the starter unit DMAPP, [2H]-2a2d were isomerized with IPP isomerase. After dilution with non-labeled IPP to suppress the incorporation of labeled IPP, the resultant DMAPP was converted into isoprene unit IV-specifically labeled 1 via IV-[2H]-GGPPs with IspA and PaPS (Supplementary Scheme S1). Thus, we developed an efficient method to synthesize all combinations of specifically deuterium-labeled GGPP and the corresponding cyclization product 1 by conversion of simple starting units 2a2d with prenyltransferases and/or IPP isomerase.

MS-based analysis of site-specifically labeled terpenes

In the EI-MS spectrum of 1, we observed the following major fragment ions at m/z 257 (F257), 202 (F202), 160 (F160) and 145 (F145) (Supplementary Figure S2a),7 as well as two other fragment ions (F243 and F216). The MS/MS analysis of F202 and F160 showed common product ions at m/z 145, 117 and 105 (Supplementary Figures S2b and c), indicating the fragmentation sequence M-F202-F160-F145-F105. All these fragments most likely originated from the elimination of a methyl radical and neutral molecules A–E, which have methyl groups. On the basis of these results, we could speculate which structure corresponded to each fragment, as shown Figure 1a. Later, the proposed structures were confirmed by extensive MS analysis of site-specifically labeled [2H]-1. During the MS analysis, we found that fragment F257 consisted of two isomeric fragments, which could be separated during the fragmentation of specifically labeled [2H]-1 (Figures 1d and Supplementary Figure S3). These fragments may provide further confirmation of the labeling pattern upon detailed HR-MS and MS/MS analyses. However, these experiments were not performed since sufficient data were obtained from other fragment ions.

Figure 1
figure 1

(ae) Fragmentation sequences of 1. (f) Fragmentations observed in 18, 19 20-CD3-1, 19, 20-CD3-1 and 20-CD3-1. A full colour version of this figure is available at the Journal of Antibiotics journal online.

In the cyclization with various terpene synthases, most of methyl groups on the isoprene unit of a prenyl diphosphate remain intact in the cyclization product. Thus, we speculated that, in the MS analysis of 1, methyl groups could be useful fragmentation tracers to identify fragments derived from the specific isoprene units I–VI in GGPP. To obtain information on the origin of the methyl groups in the fragments, we analyzed the MS spectra from the enzymatic reactions with multiply and specifically [C2H3]-labeled GGPP. The results showed that the labels in the isoprene units I, II, III, IV were located at C20, C19, C18 and C16/C17, respectively (Figure 1f and Supplementary Figure S3).

Considering the proposed structures of the key fragments, we initially focused on the formation of an A-ring that could most likely originate from isoprene units III and IV based on the structure of 1. Four fragments, F243, F216, F202 and F105, were useful to examine this mechanism (Figure 2). Mass shifts between non-labeled and labeled-1 enabled us to speculate the labeling pattern of 1. Isobutene unit D (M-F216) from IV-[1′-CD2]- and IV-[5′-CD3]-GGPP retained the labels (Figures 2a and b), whereas C5 unit A (M-F202) possessed additional labels from III-[4′-CD2]-GGPP (Figure 3c). The neutral C3 unit C (F202-F160-F145-F105) from IV-[2′-CD] and III-[5′-CD3]-GGPP (Figures 2d and e) and ethylene unit E (M-F243) from IV-[1′-CD2]- and III-[4′-CD2]-GGPP (Figures 2a and c) were eliminated with the labels. Consequently, the origin of all the hydrogen atoms on the A-ring was confirmed, as shown in Figure 2.

Figure 2
figure 2

MS spectra of deuterium labeled 1; (a) IV-[1′-CD2]-GGPP, (b) IV-[5′-CD3]-GGPP, (c) III-[4′-CD2]-GGPP, (d) IV-[2′-CD1]-GGPP, (e) III-[5′-CD3]-GGPP, (f) I-[2′-CD1]-GGPP, (g) I-[4′-CD2]-GGPP, (h) I-[5′-CD3]-GGPP and (i) II-[2′-CD1]-GGPP. The observed mass shifts are shown in the parenthesis. A full colour version of this figure is available at the Journal of Antibiotics journal online.

Figure 3
figure 3

Classification of di/sesterterpenes based on the initial cyclization mode.

Next, we examined the formation of a C/D-ring that originated from isoprene units I and II. Propene unit B (F202-F160) possessed the labels from I-[2′-CD], I-[4′-CD2] and I-[5′-CD3]-GGPP (Figures 2f–h). By contract, no deuterium label (M272) from II-[2′-CD]-GGPP was observed (Figure 2i), while other labels from II-[1′-CD2], II-[4′-CD2] and II-[5′-CD3]-GGPP were all retained (Supplementary Figure S3). Most of the remaining labels of units I-III were found in F160. An exceptional labeling pattern was observed in the analysis of 1 from III-[2′-CD] and I-[1′-CD2]-GGPP. Although 1 retained the labels, the expected MS shift of F202 was not observed in these cases. We speculated that an unexpected hydrogen scrambling occurred at these positions. However, this does not influence our proposal for the cyclization mechanism.

For the A-ring formation, labels from unit IV in GGPP located at C13 and C14-C15(C17)-C16 in 1, indicated that a C13–C14 bond cleavage occurred during the cyclization. Isoprene unit III C9–C10 and C11(C18)-C12 in 1 was separated by the insertion of C14 between C10 and C11. The labeling pattern revealed that two rounds of 1,2-C-shifts are involved in the A-ring construction. In the D-ring construction, the labels from I-[2′-CD], I-[4′-CD2] and I-[5′-CD3]-GGPP were located on the D-ring at C3-C4-C20, respectively, suggesting that the C2-olefinic proton of unit I in GGPP was shifted on C3 in 1. The loss of the label from II-[2′-CD]-GGPP in 1 suggested that the final quenching of the C7-carbocation occurred during the deprotonation at C6 to afford 1. Overall, the cyclization mechanism of PaPS is summarized in Scheme 2. The cyclization begins with the elimination of the pyrophosphate group of GGPP followed by a cyclization to afford dolabelladien-15-yl cation (10+). Two sequential 1, 2-alkyl shifts are the key processes accounting for the carbon rearrangement of 1. The resultant 12-membered ring cation 11+ then undergoes a C2–C10 cyclization to afford 5-5-9 tricyclic cation 12+. Subsequent 1, 2-hydride shift and C2–C6 ring closure followed by C6-deprotonation furnish 1.

Sesterterpene mangicol A, derived from a marine fungus, possesses a unique tetracyclic scaffold, which is closely related to 1. Its biosynthetic pathway resembles that of 1 and was proposed by feeding experiments of various 13C-labeled acetates.18 A branch point between the metabolic pathways of 1 and mangicol is the cyclization of 11+/11a+. Instead of the C2–C10 cyclization in the pathway of 1, an alternative C6–C10 cyclization leads to the core structure of mangicol A via a key 5-9-5 tricyclic intermediate (Supplementary Scheme S2).

In 2015, we proposed that the amino acid sequence of the terpene cyclase domain of bifunctional terpene synthases might reflect the first cyclization mode (Figure 3).9 A phylogenetic analysis showed that PaPS could be classified into clade B, which catalyzes the formation of a 5–11 ring system via a C1-III-IV cyclization (type B cyclization). The elucidated cyclization mechanism of PaPS is consistent with our proposal. Furthermore, this has been extended to two more clade B-bifunctional terpene synthases, that is, astellifadiene synthase and stellatatriene synthase, which have recently been identified.19, 20

In this study, we developed an efficient method to synthesize site-specifically isotope-labeled oligoprenyl diphosphate precursors, such as DMAPP, GPP, FPP and GGPP, starting from four chemically synthesized [2H]-IPPs by enzymatic transformation. By using these precursors, we demonstrated that the MS-based analysis of site-specifically isotope-labeled terpenes provides sufficient information to elucidate the cyclization mechanism catalyzed by PaPS. During this study, we proposed a general strategy for the assignment of individual fragment ions: (1) information on the location of each isoprene unit (outline of fragmentation) could be obtained by analysis of the samples from the [C2H3]-labeled precursors; (2) detailed MS analysis of terpenes from isoprene unit-specifically labeled precursors provided useful information to assign the structure of the eliminated fragments; (3) fragmentation process of terpene could be proposed on the basis of chemical structure of cyclic terpene and labeling pattern of the eliminated fragments. In addition, overlapped ions with the same molecular m.u. could be separated by using a [C2H3]-labeled precursor and further analysis of 3 m.u.-shifted peaks provided additional information.

This highly sensitive method can be applied to a number of structurally complex terpenes. It was pointed out that a careful interpretation is required to perform the analysis due to the non-specific hydrogen migrations during the fragmentation processes and the kinetic isotope effects.21 These may be overcome by further confirmation of the labeling pattern by detailed HR-MS and MS/MS analyses, which were not used in this study. Recently, a systematic investigation into the EI-MS fragmentation mechanism of sesquiterpenes using all possible 13C-labeled isotopomers of FPP was also reported.21, 22 Thus, several methods for investigating the cyclization mechanism of terpene synthases are now available. These mechanisms will contribute to the rational engineering of terpene synthases for the generation of diverse terpene skeletons.

Methods

General

Oligonucleotides for polymerase chain reaction (PCR) were purchased from Hokkaido System Science Co., Ltd. Sequence analysis of the PCR fragment was performed by an automatic DNA sequencer (Applied Biosystems, Foster, CA, USA, ABI PRISM 310 Genetic Analyzer). Cell disruption was dealt with a ultrasonic disrupter UR-200P (TOMY SEIKO, Tokyo, Japan). Analysis of the samples during protein purification was performed using SDS-polyacrylamide gel electrophoresis, and the proteins were visualized by using coomassie brilliant blue staining. Protein concentration was determined by the bradford method with bovine serum albumin as a standard.

Strain and culture conditions

Escherichia coli DH5α were used for cloning and the following standard recombinant DNA techniques. E. coli BL21(DE3) was used for protein expression.

Cloning and protein expression

To generate the overexpression construct for idi and ispA, each gene was amplified using genomic DNA of E. coli as template and following primer sets; IDI-F (5′-GGGATTCCATATGCAAACGGAACACGTCATTTTAT-3′, NdeI site underlined), IDI-R (5′-CCGGAATTCTTATTTAAGCTGGGTAAATGCAGATA-3′, EcoRI site underlined) for idi overexpression, IspA-F (5′-AGGATCCATGGACTTTCCGCAGC-3′, BamHI site underlined) and IspA-R (5′-CCCGAATTCTTATTTATTACGCTGGATG-3′, EcoRI site underlined) for ispA overexpression. The PCR products were digested with appropriate restriction enzymes and cloned to the corresponding sites of pET28a to generate pET28a-idi and pET28a-ispA.

Mutation of ispA was introduced into pET28a-ispA by PCR using IspA-S81F-F (5′-AGTAAGCGTGGATACACTCAACGG-3′) and IspA-S81F-R (5′-TTTTAATTCATGATGATTTACCGGCAA-3′, mutation site underlined). The PCR product was then dealt with T4 Polynucleotide Kinase for phosphorylation of 5′-OH termini of the polynucleotides and re-ligated using Ligation high Ver.2 to generate pET28a-ispA-S81F.

pET28a-idi, pET28a-ispA and pET28a-ispA-S81F were separately introduced into E. coli BL21(DE3) for overexpression. Protein expression and purification was performed according to the literature procedure.15, 16, 17

Enzyme synthesis of deuterium labeled-2

Labeling of the first unit

Typical conditions are as follows; a reaction mixture (100 μl of Tris-HCl buffer (pH 7.4)) containing 1–2 μg of labeled-IPP, 5 μM of MgCl2, 0.5 mM of EDTA, 2 mM of DTT and 10–50 ng of IDI was incubated at 30 °C. After 3 h, the reaction mixture was filtrated by utilizing the Amicon Ultra centrifugal filter (Merck Millipore, Billerica, MA, USA). To the filtrate was added 20–40 μg of non-labeled IPP, 1–5 ng of IspA and 0.6–3 ng of PaPS was added and the reaction mixture was incubated at 30 °C. The reaction mixture was extracted with hexane and the crude extract was directly used for the analysis with MS-2010 (Shimadzu, Kyoto, Japan), using a DB-1 MS capillary column (0.32 mm × 30 m, 0.25 μm film thickness; J&W Scientific, Folsom, CA, USA).

Analytical conditions: the sample was injected into the column at 80 °C in the splitless mode. After a 3 min isothermal hold at 80 °C, the column temperature was increased at 30 °C min−1 to 290 °C, with a 4 min isothermal hold at 290 °C. The flow rate of helium carrier gas was 0.66 ml min−1.

Labeling of the second or third unit

Typical conditions are as follows; a reaction mixture (100 μl of Tris-HCl buffer (pH 7.4)) containing 1–2 μg of non-labeled DMAPP, 2–4 μg of labeled-IPP, 5 μM of MgCl2, 0.5 mM of EDTA, 2 mM of DTT and 1.3–13 ng of IspA-S81F was incubated at 30 °C. After 3 h, the reaction mixture was filtrated by utilizing the Amicon Ultra 0.5 ml centrifugal filter. To the filtrate was added 20–40 μg of non-labeled IPP, 1–5 ng of IspA and 0.6–3 ng of PaPS was added and the reaction mixture was incubated at 30 °C. The reaction mixture was extracted with hexane and the crude extract was directly analyzed by GC-MS as in the similar conditions described above. The similar reaction conditions with non-labeled GPP and IspA instead of DMAPP and IspA-S81F was applied for labeling of FPP on the third unit. The labeled FPP was used for the PaPS reaction.

Labeling of the fourth unit

Typical conditions are as follows; a reaction mixture (100 μl of Tris-HCl buffer (pH 7.4)) containing 1–3 μg of non-labeled FPP, 4 μg of the labeled-IPP, 5 μM of MgCl2, 0.5 mM of EDTA, 2 mM of DTT and 0.6–3 ng of PaPS was incubated at 30 °C. The reaction mixture was extracted with hexane and the crude extract was directly analyzed by GC-MS as in the similar conditions described above.

scheme 1

(a) Outline of the chemoenzymatic synthesis of deuterium labeled 1. Synthetic schemes of (b) 2-CD2-IPP 2b, (c) 4-CD2-IPP 2c and (d) 5-CD3-IPP 2d.

scheme 2

Proposed cyclization mechanism of PaPS.