An Integrated Genomic and Transcriptomic Analysis Reveals Candidates of Susceptibility Genes for Crohn’s Disease in Japanese Populations

Kakuta, Yoichi; Ichikawa, Ryo; Fuyuno, Yuta; Hirano, Atsushi; Umeno, Junji; Torisu, Takehiro; Watanabe, Kazuhiro; Asakura, Akihiro; Nakano, Takeru; Izumiyama, Yasuhiro; Okamoto, Daisuke; Naito, Takeo; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Naito, Takeshi; Esaki, Motohiro; Kawai, Yosuke; Tokunaga, Katsushi; Nakamura, Minoru; Matsumoto, Takayuki; Nagasaki, Masao; Kinouchi, Yoshitaka; Unno, Michiaki; Masamune, Atsushi

doi:10.1038/s41598-020-66951-5

Download PDF

Article
Open access
Published: 24 June 2020

An Integrated Genomic and Transcriptomic Analysis Reveals Candidates of Susceptibility Genes for Crohn’s Disease in Japanese Populations

Yoichi Kakuta¹^na1,
Ryo Ichikawa¹^na1,
Yuta Fuyuno²,
Atsushi Hirano²,
Junji Umeno²,
Takehiro Torisu²,
Kazuhiro Watanabe³,
Akihiro Asakura⁴,
Takeru Nakano¹,
Yasuhiro Izumiyama¹,
Daisuke Okamoto¹,
Takeo Naito¹,
Rintaro Moroi¹,
Masatake Kuroha¹,
Yoshitake Kanazawa¹,
Tomoya Kimura¹,
Hisashi Shiga¹,
Takeshi Naito³,
Motohiro Esaki^2,5,
Yosuke Kawai⁶,
Katsushi Tokunaga⁶,
Minoru Nakamura⁷,
Takayuki Matsumoto^2,8,
Masao Nagasaki^4,9,
Yoshitaka Kinouchi¹⁰,
Michiaki Unno³ &
…
Atsushi Masamune¹

Scientific Reports volume 10, Article number: 10236 (2020) Cite this article

2608 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Expression quantitative trait locus (eQTL) analyses have enabled us to predict the function of disease susceptibility SNPs. However, eQTL for the effector memory T cells (TEM) located in the lamina propria mononuclear cells (LPMCs), which play an important role in Crohn’s disease (CD), are not yet available. Thus, we conducted RNA sequencing and eQTL analyses of TEM cells located in the LPMCs from IBD patients (n = 20). Genome-wide association study (GWAS) was performed using genotyping data of 713 Japanese CD patients and 2,063 controls. We compared the results of GWAS and eQTL of TEM, and also performed a transcriptome-wide association study using eQTL from Genotype Tissue Expression project. By eQTL analyses of TEM, correlations of possible candidates were confirmed in 22,632 pairs and 2,463 genes. Among these candidates, 19 SNPs which showed significant correlation with tenascin-XA (TNXA) expression were significantly associated with CD in GWAS. By TWAS, TNFSF15 (FDR = 1.35e-13) in whole blood, ERV3-1 (FDR = 2.18e-2) in lymphocytes, and ZNF713 (FDR = 3.04e-2) in the sigmoid colon was significantly associated with CD. By conducting integration analyses using GWAS and eQTL data, we confirmed multiple gene transcripts are involved in the development of CD.

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

Introduction

Inflammatory bowel disease (IBD) is a term for two conditions: Crohn’s disease (CD) and ulcerative colitis. IBD is a multifactorial disease where development of the disease involves both hereditary factors and environmental factors. A genome-wide association study (GWAS) was conducted by various institutions to identify hereditary factors, which revealed that more than 200 regions in the human genome confer susceptibility to IBD^1,2,3. However, many of the polymorphisms that show correlation with the disease are located in non-transcribed regions. Regions that show disease susceptibility due to functional mutation caused by amino acid substitution were limited to regions such as the nucleotide binding oligomerization domain-containing 2 (NOD2), interleukin 23 receptor (IL23R), and autophagy related 16-like 1 (ATG16L1). The GWAS of Japanese CD patients, reported by us in 2019, indicated that the only polymorphism with amino acid substitution among 11 identified disease susceptibility regions was the p.G149R polymorphism located in IL23R⁴. In cases of polymorphisms with amino acid substitution, it is highly likely that genes with such polymorphisms are involved in the development of IBD. However, the mechanism of IBD development caused by other polymorphisms is unknown, and susceptibility genes remain to be confirmed. It is predicted that the genomic mutations located in these disease susceptibility regions impact the expression of nearby genes and are involved in the development of IBD.

In recent years, many expression quantitative trait locus (eQTL) analyses have been performed with the aim of examining the relationship between comprehensive gene expression in various cell types and the genetic background. These findings have been used to create a database. Within this database, the Genotype Tissue Expression (GTEx) project examined gene polymorphism expression of every human tissue⁵. Using the eQTL database, it is possible to predict which tissues are affected by gene polymorphisms, which genes are involved, and what is the degree of expression of these genes. Furthermore, it is now possible to examine the relationship between polymorphism and changes in expression, to predict changes in gene expression levels caused by polymorphisms, and to perform a transcriptome-wide association study (TWAS) based on the data⁶.

By utilizing GWAS and eQTL analysis, polymorphisms that correlate to the development of IBD can be identified and the expression of genes impacted by these polymorphisms can be predicted. Moreover, TWAS enabled us to predict disease susceptibility genes and changes in expression that cause IBD by analyzing each gene unit.

However, each eQTL database is an analysis performed under specific conditions in specific cells. Moreover, racial variations need to be considered. To determine the causes of the development of CD, it is important to consider gene expression and the relationship of gene expression to single-nucleotide polymorphisms (SNPs) in cells that play a role in immunity in the sites of inflammation of the disease (i.e., the intestinal tissues). Although data regarding samples such as the small intestine, large intestine, and whole blood are available from previously described GTEx, data for the immunocompetent cells located in the intestinal sites are not yet available. Thus, the genes involved in IBD and how the expression of such genes is impacted by susceptibility gene polymorphism in Japanese IBD patients remain unknown.

Based on the above, we performed eQTL analyses by collecting CD4+ effector memory T cells (TEM cells) from lamina propria mononuclear cells (LPMCs), the cell type considered to be involved in disease state of Japanese CD patients. Using our results and the eQTL data from previously constructed database for other tissues, disease susceptibility genes involved in the development of CD in the Japanese population were identified.

Results

In LPMC-derived TEM cells, eQTL of 2,463 genes at 22,632 regions were identified

The analysis flow chart is shown in Fig. 1. RNA sequencing performed on TEM cells of 20 IBD patients (15 CD patients, 5 UC patients), which advanced to expression analysis, confirmed expression of 32,363 genes. According to eQTL analyses, 22,632 pairs in 2,463 genes were confirmed to be candidates (p < 1e-04) which showed correlation between gene polymorphism and expression. Among these pairs, 2,000 pairs in 220 genes showed significant (p < 1e-06) correlation (Supplementary Tables S1 and S2).

Twenty-five sites were confirmed as candidates correlated with Japanese CD by GWAS

Manhattan plots were constructed based on the GWAS of CD patients performed using a linear mixed model (Supplementary Fig. S1). Significant correlation was found in 370 SNPs (p < 5e-08). These SNPs were found to be located in two regions, the human leukocyte antigen (HLA) region on chromosome 6 (rs184950714, p = 1.07e-17) and upstream of tumor necrosis factor superfamily member 15 (TNFSF15) (rs55951892, p = 1.76e-23) on chromosome 9. Moreover, 301 SNPs that showed a candidate level of correlation (p < 1e-05) were found in an additional 23 regions (Table 1). Among the SNPs that showed more than a candidate level of correlation, only three polymorphisms, IL23R p.Gly149Arg (p = 4.22e-07), IL27 p.Leu119Pro (p = 3.28e-05), and SULT1A2 p.Asn235Thr (p = 4.38e-05), showed amino acid substitutions (Supplementary Table S3).

Table 1 Summary of the CD-GWAS results in Japanese patients.

Full size table

Correlation between Japanese CD and TNXA based on GWAS and eQTL results was assessed

Among the candidate polymorphisms identified by the GWAS, 19 SNPs of chromosome 6 showed significant correlation with expression of the tenascin-XA (TNXA) in intestinal TEM cells (rs117433623, P_GWAS = 6.34e-09, P_eQTL = 3.49e-05) (Fig. 2, Supplementary Figure S2, Supplementary Table S4). Only one SNP showed a genotype of GG; therefore, further analyses were conducted using two groups—CC and G carrier—in which a correlation tendency was also observed (p = 1.60e-03, Wilcoxon rank-sum test).

Six novel genes were identified by TWAS in addition to the previously reported TNFSF15 and RAP1A

Analyses of HLA regions by TWAS were performed separately from other regions. The relationship of gene expression of multiple genes such as HLA-DQ and HLA-DR with CD was confirmed in all analyzed cell types. Almost all of the correlations were found to be related to re9271170 in the GWAS (Supplementary Table S5). Excluding the HLA region, TNFSF15 (TWAS. p = 2.28e-17, FDR = 1.35e-13) in whole blood, endogenous retrovirus group 3 member 1 (ERV3-1) (TWAS. p = 4.79e-05, FDR = 2.20e-02) in EBV-immortalized lymphocytes, and zinc finger protein 713 (ZNF713) (TWAS. p = 4.41e-05, FDR = 3.03e-02) in the sigmoidal colon showed significant correlation (Table 2). Additionally, apolipoprotein B MRNA editing enzyme catalytic subunit 3 A (APOBEC3A) in whole blood, ras-related protein Rap-1A (RAP1A) in EBV-immortalized lymphocytes, nuclear pore complex interacting protein family member B9 (NPIPB9) and immunoglobulin lambda variable 3-29 (IGLV3-29) in the transverse colon, and WD repeat domain 31 (WDR31) in the sigmoidal colon showed possible associations (FDR < 0.10) as candidate genes (Table 2). Some of these genes showed possible associations in other tissues (Supplementary Table S6). Among these genes, correlation of SNPs within the regions of genes such as ERV3-1, RAP1A, ZNF713 was lost when a correction was made using the predicted expression levels, however, multiple SNPs continued to show a strong correlation in TNFSF15 after correction using the predicted expression levels (Fig. 3, Supplementary Figure S3).

Table 2 Summary of TWAS with the susceptibility genes for CD in Japanese patients (non-HLA genes).

Full size table

Discussion

The novel outcomes of this study were as follows: (1) even though on a small scale, eQTL data of intestinal LPMCs derived from the TEM cells of Japanese IBD patients were constructed for the first time, (2) polymorphisms that showed correlation by GWAS of Japanese CD patients indicated correlation with expression of TNXA in intestinal LPMC-derived TEM cells, (3) TNFSF15 in whole blood and RAP1A in lymphocytes were confirmed to be disease susceptibility genes when using TWAS for the first time in Japanese CD patients, (4) six genes (including 4 candidates) were newly identified to be correlative.

The eQTL constructed in this study, albeit at a very small scale, was limited to intestinal LPMC-derived TEM cells of Japanese IBD patients and has not previously been reported. The reason why we analyzed eQTL in TEM cells was TEM is considered to be strongly associated with IBD pathogenesis. For example, colitis can be induced in immunodeficient mice by transferring naïve T cells⁷, strategies blocking T-cell function are useful for attenuating mucosal inflammation in mice with experimental colitis⁸, and IBD is frequently associated with other T-cell mediated diseases (i.e., psoriasis and multiple sclerosis)^9,10. Based on the integration analysis of this eQTL data and the GWAS, new polymorphisms involved in the development of CD in the Japanese population that correlated to the expression of TNXA were identified. TNXA is considered a pseudogene which is not capable of producing functional protein. Therefore, it is unclear whether the gene is involved in the disease state, and if so, how it is involved. However, a report has suggested that TNXA is a serum protein characteristic of stricturing CD¹¹. Thus, combined with this report, it is possible that TNXA actually codes for a protein with unknown function. In addition, it may be involved in the development of the specific disease phenotype of CD. However, there is currently insufficient data to conclude that an increased level of TNXA in the serum of CD patients is involved in the development of CD. Polymorphisms that showed correlation with CD may have two functions: one may involve the development of CD via other functions and the other may involve the expression of TNXA that does not code for functional protein. Hence, the expression of TNXA may function as a marker of polymorphism in the gene. Future studies should consider the function of TNXA using models (i.e., mice) in addition to the conformation of a TNXA expression level in intestinal sites in Japanese CD patients. Additionally, the association of TNXA gene with CD causality was shown indirectly by connecting the results of GWAS and eQTL. To confirm this association, additional analysis such as Mendelian randomization analysis with a larger eQTL data set of Tem from LPMC in the Japanese population should be performed.

In this study, TWAS was first conducted on Japanese CD patients with the use of previously reported eQTL data. A verified correlation around the periphery of TNFSF15 may indicate disease susceptibility via TNFSF15 expression in whole blood according to TWAS. In recent years, many statistical correlations of polymorphisms with unknown function have been identified because genome-wide studies has become available due to low-cost genome analysis technology. However, it is difficult to analyze the expression of genes of various tissue samples in terms of sample collection cost. On the other hand, eQTL databases of various cell types have been constructed and such databases have become freely available. TWAS is one approach that can be used to solve the limitations of GWAS by analyzing such databases integrally and is an analytical method that can be used to identify new disease susceptibility genes. Those regions sometimes contain multiple genes; however, correlation with each gene can be identified by TWAS due to the analysis of a gene unit. The correlation of the TNFSF15 periphery identified by GWAS was found to exist in the region stretching from TNFSF15 to TNFSF8; however, the whole region was indicated to be involved in TNFSF15 expression and to correlate with CD, according to TWAS.

TNFSF15 is a cytokine gene belonging to the TNF family (also called TNF-like ligand 1 A (TL1A)) and is known to show increased expression at intestinal CD sites¹². TNFSF15 is mainly secreted from monocytic cells, such as macrophage and dendric cells, and is thought to promote Th1 and Th17 cell activities, leading to CD development¹³. Multiple studies have reported that TNFSF15 polymorphisms involve gene expression^14,15. TWAS results in this study agree with these reports. Therefore, the usefulness of TWAS is supported by analyses using independent databases such as TWAS.

The TWAS method used in this study confirmed multiple novel candidate genes in addition to TNFSF15 and RAP1A^4,16,17. APOBEC3A (cytidine deaminase) targets single-stranded DNA and functions as a restriction factor in retrovirus replication. It has been previously reported that this gene is involved in cell cycle arrest caused by DNA damage and oxidative stress¹⁸. Polymorphisms located relatively close to the gene are reported to correlate to IBD in the Western population; however, involvement of the genes in IBD has not been indicated. Therefore, this study showed such a correlation for the first time.

ERV3-1 is a gene found in endogenous retroviruses; however, the relationship of ERV3-1 with IBD has not been reported previously. The function of both NPIPB9 and IGLV3-29 is also unknown. One study has reported that changes in the expression level of ZNF713 due to mutation in the gene are involved in autism spectrum disorder¹⁹; however, the function of ZNF713 and its relationship with IBD are unknown.

WDR31 is a member of the family of WD40 repeat proteins. WD40 repeat proteins belong to a large family observed among all eukaryotes and are involved in various functions, including signal transduction, regulation of transcription, regulation of cell cycle, autophagy, and apoptosis. It is plausible that changes in the expression of members belonging to this family of genes would relate to disease. In fact, WDR30 is also known as ATG16L1, which is a disease susceptibility gene in Western CD patients and is involved in autophagy²⁰. However, the function of WDR31, which showed correlation in this study, is currently unknown, and no relationship with IBD has been reported. Many of these novel candidate genes have unknown functions and unknown relationships with IBD; however, future functional analyses may provide this information. And these associations were only observed in colon, it will be interesting to see associations of these genes with each clinical sub-phenotype (i.e. disease locations) of CD. Further analyses using additional sample set will be needed.

This study showed that multiple correlations could be confirmed with the use of TWAS. Correlation shown by GWAS at the regions of some genes such as ERV3-1 can be lost when a predicted expression level of ERV3-1 was taken into consideration. Therefore, it was indicated that correlation in the region is due to changes in the expression level of ERV3-1. However, correlation of some SNPs in genes such as TNFSF15 does not diminish when a predicted expression level of the genes is taken into consideration; thus, it has been confirmed that some SNPs have correlation regardless of predicted gene expression levels. In fact, it has been demonstrated previously that there are two independent correlations in this region²¹, the result of which are consistent with those found this study. However, how the polymorphisms that showed independent correlations are involved in the disease is unknown. The referenced eQTL data are from the Western population, and there may be vastly distinctive Asian-specific eQTL data. Further research is necessary.

Limitations in this study regarding eQTL are as follows: (1) the sample size was small, (2) only IBD patients who required surgery were studied, and mild IBD patients who did not require surgery were not included in this study, (3) there were differences in inflammation sites and degree of inflammation in surgical specimens, and (4) there were difference in drugs administered before surgery (individual results may be affected by such drugs). Limitations of TWAS are that (6) referenced gene expression data are from a different ethnic group and (7) evaluation of genes induced under specific conditions was not possible. To increase the number of subjects and reduce the effect of medications or severity issue, analyses of biopsy samples at the initial endoscopy will be informative. However, we aimed to establish eQTL dataset of specific cell population in this study, we analyzed surgical specimens. The most serious limitation of our study was we could only see eQTLs of TEM cells in Japanese patients with IBD, because the number of LPMCs, which could be isolated from surgical specimens, was still too few to analyze several immunocompetent cells. The increasing number of samples and cell species of immunocompetent cells and/or adopting new technologies (i.e. single cell analysis) may show more certainly eQTL, although this is a subject for future analysis. However, this study included a functional approach utilizing data regarding function of polymorphisms in addition to existing GWAS, which simply examines whether SNPs are involved in the development of the disease. Factors related to the development of the disease at a gene level in a specific tissue could be predicted. Moreover, the results obtained in this study included genes (TNFS15 and RAP1A) that have shown correlation by functional analyses as candidate genes and thus the usefulness of this approach was shown. Integration analyses using GWAS and eQTL data are considered useful not only for the analysis of disease susceptibility genes but also for analyzing disease-modifying genes that determine the disease state and pharmacogenomics, which involves analysis of drug efficacy and adverse effects. Future analyses are anticipated.

In conclusion, by conducting integration analyses using information regarding polymorphism and transcriptome-related analysis data, we confirmed multiple gene transcripts involved in the development of CD in the Japanese population. The study also indicated that expression of TNFSF15 in blood cells was likely to be involved in the development of CD in the Japanese population.

Materials and Methods

In this study, analyses were processed using the following two approaches to accomplish our objective. First, eQTL analyses were conducted on intestinal TEM cells of Japanese CD patients and disease susceptibility genes were predicted by projecting the function of disease susceptibility polymorphisms in these patients. Second, a TWAS was conducted using data from the existing eQTL database and the GWAS results to analyze the susceptibility genes of Japanese CD patients.

Subjects

For TEM transcriptome analyses, cells were isolated from 18 patients who were in an active phase of CD and nine patients who were in an active phase of UC from a cohort of IBD patients hospitalized in Tohoku University Hospital between July 2015 and July 2018. The studied cohort underwent surgery that involved intestinal resection and consented to research including genetic analysis. The subjects for GWAS were 713 Japanese CD patients who regularly visited either Tohoku University Hospital (379 patients) or Kyushu University Hospital (334 patients) and could be analyzed by previous GWAS of Crohn’s disease⁴. A total of 2,063 healthy individuals who resided in Tohoku (1,621 individuals) or Kyushu (462 individuals) were also studied as controls²². Diagnosis was performed according to the diagnostic criteria proposed by the Japanese Ministry of Health, Labor and Welfare²³, based on clinical symptoms and endoscopic, X-ray, and tissue findings. All subjects were Japanese.

This study was conducted after receiving written consent from subjects and approval from the ethics committee of the School of Medicine at Tohoku University (2017-1-253, 2019-1-161). All methods in this study were performed in accordance with ethical guidelines for medical and health research involving human subjects established by the Ministry of Health, Labour and Welfare in Japan. The demographic profiles of the subjects are shown in Table 3.

Table 3 Patient characteristics.

Full size table

Isolation of LPMCs

LPMCs were isolated from inflammation sites surgically resected from the small intestine or the large intestine according to the method described by Fiocchi et al.^24,25. In brief, a resected specimen was cut lengthwise, and feces were removed by washing the intestine in Hank’s balanced salt solution (HBSS) (Wako, Osaka, Japan). The specimen was then cut into 2–3 cm × 10 cm sections. The sections were then washed in HBSS containing 0.15% dithiothreitol (Wako) for 30 minutes with shaking. The specimens were then washed in HBSS containing 1 mM ethylenediaminetetraacetic acid (Wako) for 90 minutes with shaking. This wash was repeated until the epithelial layer was completely removed. After removing the epithelial layer completely, the specimens were washed again in HBSS with shaking and the washed specimens were finely divided into 5-mm sections. The specimens were then digested in HBSS containing 1 mg/ml collagenase-3 (Worthington Biochemical Corporation, Lakewood, USA) and DNase I (Roche, Basel, Switzerland) at 37 °C for 8–10 hours. The digested specimens were then passed through a 100 μm cell strainer (BD Biosciences, Franklin Lake, USA) and the cell suspension was recovered. The suspension was centrifuged at 700 × g and the cell pellet was resuspended in HBSS. The suspension was overlaid on Ficoll–Hypaque (GE Healthcare, Little Chalfont, UK) and centrifuged for 20 minutes at 1,000 × g. LPMC cells located at the interface between HBSS and Ficoll–Hypaque were recovered.

Isolation of TEM cells and extraction of DNA/RNA

CD4+ T cells were isolated by negative selection from isolated LPMCs using an Easy Sep Magnet (STEMCELL Technology, Vancouver, Canada) and an Easy Sep Human CD4+ T cell Enrichment kit (STEMCELL Technology). Furthermore, the isolated CD4 positive T cells were stained with anti-CD3-FITC, anti-CD4-PE, anti- CD45RO-APC, anti-CD197 (CCR7) -BV421, and 7ADD-Cell Viability Solution (BD Biosciences), followed by isolation of TEM cells using a FACS Aria II cell sorter (BD Biosciences). Sorting efficiency was consistently over 98%. These TEM cells may include a few regulatory T Cells. However, to keep the number of cells to perform RNA sequencing, we used these samples as TEM cells. DNA and total RNA were extracted from isolated TEM cells using an AllPrep DNA/RNA mini kit (QIAGEN, Hilden, Germany).

Genotyping

Transcriptome analysis of subjects by Japonica array V1 (Thermo-Fisher Scientific Inc., Waltham, MA) was contracted to Toshiba Inc. (Tokyo, Japan)²⁶. Affymetrix Power Tools software (Thermo-Fisher Scientific Inc.) was used for genotyping. For genotyping of SNPs that could not be typed by the array, IMPUTE2 (Version 2.3.2) (Center for Statistical Genetics, University of Michigan, USA) was used for performing imputation with the genome reference panel of people from the Tohoku region (2KJPN)^27,28. For genotyping data for the GWAS, data which had undergone analyses by Japonica array VI, imputation by the 1KJPN panel, and quality control (QC) by previous studies were used⁴.

Transcriptome and eQTL analyses

For the total RNA collected from the intestines of 27 IBD patients (18 CD patients, 9 UC patients), QC, library construction, and transcriptome analysis by RNA sequencing were contracted to Macrogen Inc, Japan. QC was performed using TapeStation HighSensitivity RNA ScreenTape (Agilent Technologies, Santa Clara, USA), where the standard was set as RNA integrity number >7. RNA amplification, was performed using SMART Seq V4 Ultra Low Input RNA Kit (Takara Bio, Kusatsu, Japan), following the manufacturer’s protocol. TruSeq Stranded mRNA Library Prep (Illumina, San Diego, USA) was used for library construction. NovaSeq. 6000 (Illumina) was used for RNA sequencing. Processes from alignment to post-treatment of FASTQ data obtained by RNA sequencing was performed using STAR²⁹ and Picard software (http://broadinstitute.github.io/picard/) according to TOPMed RNA-seq pipeline guidelines using the supercomputer system at Tohoku University’ medical-megabank institute. The consistency of RNA/DNA samples was confirmed by comparing RNA sequence data and genotype of genomic DNA. Samples with insufficient data or a low number of reads were excluded, which resulted in 15 active-phase CD patients and five active-phase ulcerative colitis patients for expression analysis. The number of reads of each transcript was calculated using the featureCounts (Ver 1.6.4)³⁰ and were standardized against entire transcripts using edgeR (Ver 3.20.9)³¹. eQTL analysis and standardization at the gene level were performed using FastQTL (Ver 2.184) with–normal option³².

GWAS

The GWAS data were analyzed with a linear mixed model. Genome-wide Complex Trait Analysis software (Ver 1.91.7b1) was used for the analysis³³, where 7,424,691 polymorphisms with minor allele frequencies of over 0.5% were analyzed among the 16,919,636 polymorphisms input.

TWAS

FUSION software was used for the TWAS⁶. The data for analysis consisted of RNA sequence data from whole blood, Epstein–Barr virus (EBV)-immortalized B cells, the transverse colon, the sigmoidal colon, and the small intestine (the ileum terminal), as these tissues are considered, among GTEx V7 data released in GTEx⁵, to be highly related to IBD.

Statistical analysis

In eQTL analysis, samples showing p values less than 1e-06 were considered significant correlation. p values under 1e-04 were considered as candidates and were used for further analyses. In GWAS, polymorphisms showing p < 5e-08 in the linear mixed model were considered to be significant and those with p values under 1e-05 were considered to be candidates. Polymorphisms showing correlation within 500 kbps upstream and downstream of the polymorphism were considered cases of correlation at the same region. For TWAS, the genes with false discovery rate (FDR) of <0.05 were considered to be susceptibility genes, and genes with FDR < 0.10 were considered to be candidates. Data obtained from each analysis was further analyzed using R software (Ver 3.4.4). Supplementary Table S7 shows the eQTL data set analyzed in this study.

References

Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124, https://doi.org/10.1038/nature11582 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979–986, https://doi.org/10.1038/ng.3359 (2015).
Article CAS PubMed PubMed Central Google Scholar
Huang, H. et al. Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature 547, 173–178, https://doi.org/10.1038/nature22969 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Kakuta, Y. et al. A Genome-wide Association Study Identifying RAP1A as a Novel Susceptibility Gene for Crohn’s Disease in Japanese Individuals. J. Crohns Colitis 13, 648–658, https://doi.org/10.1093/ecco-jcc/jjy197 (2019).
Article PubMed Google Scholar
Carithers, L. J. et al. A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project. Biopreserv Biobank 13, 311–319, https://doi.org/10.1089/bio.2015.0032 (2015).
Article PubMed PubMed Central Google Scholar
Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet 48, 245–252, https://doi.org/10.1038/ng.3506 (2016).
Article CAS PubMed PubMed Central Google Scholar
Powrie, F., Coffman, R. L. & Correa-Oliveira, R. Transfer of CD4+ T cells to C.B-17 SCID mice: a model to study Th1 and Th2 cell differentiation and regulation in vivo. Res Immunol 145, 347–353 (1994).
Article CAS PubMed Google Scholar
Monteleone, G. & Caprioli, F. T-cell-directed therapies in inflammatory bowel diseases. Clin Sci (Lond) 118, 707–715, https://doi.org/10.1042/CS20100027 (2010).
Article CAS Google Scholar
Fiorino, G. & Omodei, P. D. Psoriasis and Inflammatory Bowel Disease: Two Sides of the Same Coin? J Crohns Colitis 9, 697–698, https://doi.org/10.1093/ecco-jcc/jjv110 (2015).
Article PubMed Google Scholar
Sonnenberg, A. & Ajdacic-Gross, V. Similar birth-cohort patterns in Crohn’s disease and multiple sclerosis. Mult Scler 24, 140–149, https://doi.org/10.1177/1352458517691620 (2018).
Article PubMed Google Scholar
Townsend, P. et al. Serum Proteome Profiles in Stricturing Crohn’s Disease: A Pilot Study. Inflamm Bowel Dis 21, 1935–1941, https://doi.org/10.1097/MIB.0000000000000445 (2015).
Article PubMed Google Scholar
Bamias, G. et al. Expression, localization, and functional activity of TL1A, a novel Th1-polarizing cytokine in inflammatory bowel disease. J Immunol 171, 4868–4874, https://doi.org/10.4049/jimmunol.171.9.4868 (2003).
Article CAS PubMed Google Scholar
Takedatsu, H. et al. TL1A (TNFSF15) regulates the development of chronic colitis by modulating both T-helper 1 and T-helper 17 activation. Gastroenterology 135, 552–567, https://doi.org/10.1053/j.gastro.2008.04.037 (2008).
Article CAS PubMed Google Scholar
Kakuta, Y. et al. TNFSF15 transcripts from risk haplotype for Crohn’s disease are overexpressed in stimulated T cells. Hum Mol Genet 18, 1089–1098, https://doi.org/10.1093/hmg/ddp005 (2009).
Article CAS PubMed Google Scholar
Michelsen, K. S. et al. IBD-associated TL1A gene (TNFSF15) haplotypes determine increased expression of TL1A protein. PLoS One 4, e4719, https://doi.org/10.1371/journal.pone.0004719 (2009).
Article CAS PubMed PubMed Central ADS Google Scholar
Kakuta, Y., Kinouchi, Y., Negoro, K., Takahashi, S. & Shimosegawa, T. Association study of TNFSF15 polymorphisms in Japanese patients with inflammatory bowel disease. Gut 55, 1527–1528, https://doi.org/10.1136/gut.2006.100297 (2006).
Article CAS PubMed PubMed Central Google Scholar
Yamazaki, K. et al. Single nucleotide polymorphisms in TNFSF15 confer susceptibility to Crohn’s disease. Hum Mol Genet 14, 3499–3506, https://doi.org/10.1093/hmg/ddi379 (2005).
Article CAS PubMed Google Scholar
Niocel, M., Appourchaux, R., Nguyen, X. N., Delpeuch, M. & Cimarelli, A. The DNA damage induced by the Cytosine Deaminase APOBEC3A Leads to the production of ROS. Sci Rep 9, 4714, https://doi.org/10.1038/s41598-019-40941-8 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Metsu, S. et al. A CGG-repeat expansion mutation in ZNF713 causes FRA7A: association with autistic spectrum disorder in two families. Human mutation 35, 1295–1300, https://doi.org/10.1002/humu.22683 (2014).
Article CAS PubMed Google Scholar
Hampe, J. et al. A genome-wide association scan of nonsynonymous SNPs identifies a susceptibility variant for Crohn disease in ATG16L1. Nat Genet 39, 207–211, https://doi.org/10.1038/ng1954 (2007).
Article CAS PubMed Google Scholar
Kakuta, Y. et al. Rare Variants of <em>TNFSF15</em> Are Significantly Associated With Crohn’s Disease in Non-Jewish Caucasian Independent of the Known Common Susceptibility SNPs. Gastroenterology 144, S–466, https://doi.org/10.1016/S0016-5085(13)61723-0 (2013).
Kuriyama, S. et al. The Tohoku Medical Megabank Project: Design and Mission. J Epidemiol 26, 493–511, https://doi.org/10.2188/jea.JE20150268 (2016).
Article PubMed Google Scholar
Matsui T.H. F., Hisabe T. Proposed diagnostic criteria for Crohn’s disease. Annual reports of the research group of intractable inflammatory bowel disease granted by the Minitry of Health, Labour, and Welfare of Japan., 52–54 (2011).
Fiocchi, A. et al. World Allergy Organization-McMaster University Guidelines for Allergic Disease Prevention (GLAD-P): Probiotics. World Allergy Organ J 8, 4, https://doi.org/10.1186/s40413-015-0055-2 (2015).
Article PubMed PubMed Central Google Scholar
Fiocchi, C. & Youngman, K. R. Isolation of human intestinal mucosal mononuclear cells. Curr Protoc Immunol Chapter 7, Unit 7 30, https://doi.org/10.1002/0471142735.im0730s19 (2001).
Kawai, Y. et al. Japonica array: improved genotype imputation by designing a population-specific SNP array with 1070 Japanese individuals. J Hum Genet 60, 581–587, https://doi.org/10.1038/jhg.2015.68 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nagasaki, M. et al. Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals. Nat Commun 6, 8018, https://doi.org/10.1038/ncomms9018 (2015).
Article CAS PubMed Google Scholar
Yamaguchi-Kabata, Y. et al. iJGVD: an integrative Japanese genome variation database based on whole-genome sequencing. Human genome variation 2, 15050, https://doi.org/10.1038/hgv.2015.50 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21, https://doi.org/10.1093/bioinformatics/bts635 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930, https://doi.org/10.1093/bioinformatics/btt656 (2014).
Article CAS PubMed Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140, https://doi.org/10.1093/bioinformatics/btp616 (2010).
Article CAS PubMed Google Scholar
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485, https://doi.org/10.1093/bioinformatics/btv722 (2016).
Article CAS PubMed Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88, 76–82, https://doi.org/10.1016/j.ajhg.2010.11.011 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant Numbers JP15H04805, 18K07929 and in part by the Tohoku Medical Megabank Project (Special Account for reconstruction from the Great East Japan Earthquake). SNP genotyping was supported in part by the Center of Innovation Program from the Japan Science and Technology Agency, JST. Computational resources were provided in part by the ToMMo supercomputer system.

Author information

These authors contributed equally: Yoichi Kakuta and Ryo Ichikawa.

Authors and Affiliations

Division of Gastroenterology, Tohoku University Graduate School of Medicine, Sendai, Japan
Yoichi Kakuta, Ryo Ichikawa, Takeru Nakano, Yasuhiro Izumiyama, Daisuke Okamoto, Takeo Naito, Rintaro Moroi, Masatake Kuroha, Yoshitake Kanazawa, Tomoya Kimura, Hisashi Shiga & Atsushi Masamune
Department of Medicine and Clinical Science, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
Yuta Fuyuno, Atsushi Hirano, Junji Umeno, Takehiro Torisu, Motohiro Esaki & Takayuki Matsumoto
Department of Surgery, Tohoku University Graduate School of Medicine, Sendai, Japan
Kazuhiro Watanabe, Takeshi Naito & Michiaki Unno
Human Biosciences Unit for the Top Global Course Center for the Promotion of Interdisciplinary Education and Research (CPIER), Kyoto University, Kyoto, Japan
Akihiro Asakura & Masao Nagasaki
Department of Endoscopic Diagnostics and Therapeutics, Saga University Hospital, Saga, Japan
Motohiro Esaki
Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
Yosuke Kawai & Katsushi Tokunaga
Clinical Research Center, National Hospital Organisation (NHO) Nagasaki Medical Center, Omura, Japan
Minoru Nakamura
Division of Gastroenterology, Department of Internal Medicine, Faculty of Medicine, Iwate Medical University, Morioka, Japan
Takayuki Matsumoto
Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
Masao Nagasaki
Student Health Care Center, Institute for Excellence in Higher Education, Tohoku University, Sendai, Japan
Yoshitaka Kinouchi

Authors

Yoichi Kakuta
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Ichikawa
View author publications
You can also search for this author in PubMed Google Scholar
Yuta Fuyuno
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Hirano
View author publications
You can also search for this author in PubMed Google Scholar
Junji Umeno
View author publications
You can also search for this author in PubMed Google Scholar
Takehiro Torisu
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Watanabe
View author publications
You can also search for this author in PubMed Google Scholar
Akihiro Asakura
View author publications
You can also search for this author in PubMed Google Scholar
Takeru Nakano
View author publications
You can also search for this author in PubMed Google Scholar
Yasuhiro Izumiyama
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Okamoto
View author publications
You can also search for this author in PubMed Google Scholar
Takeo Naito
View author publications
You can also search for this author in PubMed Google Scholar
Rintaro Moroi
View author publications
You can also search for this author in PubMed Google Scholar
Masatake Kuroha
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitake Kanazawa
View author publications
You can also search for this author in PubMed Google Scholar
Tomoya Kimura
View author publications
You can also search for this author in PubMed Google Scholar
Hisashi Shiga
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Naito
View author publications
You can also search for this author in PubMed Google Scholar
Motohiro Esaki
View author publications
You can also search for this author in PubMed Google Scholar
Yosuke Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Katsushi Tokunaga
View author publications
You can also search for this author in PubMed Google Scholar
Minoru Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Takayuki Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Masao Nagasaki
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitaka Kinouchi
View author publications
You can also search for this author in PubMed Google Scholar
Michiaki Unno
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Masamune
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y. Kakuta, R.I., M. Nagasaki and Y. Kinouchi designed the study. R.I., Takeo Naito, Y. Kawai, K.T., A.A. and Y. Kakuta acquired data. A.H., J.U., Y.F., T.T., T. Nakano, Y.I., R.I., D.O., R.M., M.K., H.S., Y. Kanazawa, T.K., M. Nakamura, K.W., Takeshi Naito, M.U., T.M. and M.E. recruited patients. R.I., Y. Kakuta, Y. Kawai and M. Nagasaki analysed data. Y. Kakuta, R.I., Y. Kinouchi, A.M., drafted the manuscript.

Corresponding author

Correspondence to Yoichi Kakuta.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information.

Supplementary information2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kakuta, Y., Ichikawa, R., Fuyuno, Y. et al. An Integrated Genomic and Transcriptomic Analysis Reveals Candidates of Susceptibility Genes for Crohn’s Disease in Japanese Populations. Sci Rep 10, 10236 (2020). https://doi.org/10.1038/s41598-020-66951-5

Download citation

Received: 12 September 2019
Accepted: 01 June 2020
Published: 24 June 2020
DOI: https://doi.org/10.1038/s41598-020-66951-5

This article is cited by

Transcriptome-wide association studies associated with Crohn’s disease: challenges and perspectives
- Keyu Jia
- Jun Shen
Cell & Bioscience (2024)
Long non-coding RNA RP11-342L8.2, derived from RNA sequencing and validated via RT-qPCR, is upregulated and correlates with disease severity in psoriasis patients
- Yuanting Zhi
- Jiru Du
- Ningjing Song
Irish Journal of Medical Science (1971 -) (2022)
A joint analysis using exome and transcriptome data identifies candidate polymorphisms and genes involved with umbilical hernia in pigs
- Igor Ricardo Savoldi
- Adriana Mércia Guaratini Ibelli
- Mônica Corrêa Ledur
BMC Genomics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

In LPMC-derived TEM cells, eQTL of 2,463 genes at 22,632 regions were identified

Twenty-five sites were confirmed as candidates correlated with Japanese CD by GWAS

Correlation between Japanese CD and TNXA based on GWAS and eQTL results was assessed

Six novel genes were identified by TWAS in addition to the previously reported TNFSF15 and RAP1A

Discussion

Materials and Methods

Subjects

Isolation of LPMCs

Isolation of TEM cells and extraction of DNA/RNA

Genotyping

Transcriptome and eQTL analyses

GWAS

TWAS

Statistical analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links