Association between fruit intake and non-small cell lung cancer: a Mendelian randomization study
Highlight box
Key findings
• Our Mendelian randomization (MR) study shows a significant protective effect of dried fruit intake against non-small cell lung cancer, with an odds ratio of 0.32 (P=0.003). In contrast, fresh fruit intake did not demonstrate a statistically significant association.
What is known and what is new?
• Previous studies have established a connection between fruit consumption and various health outcomes, indicating potential benefits in reducing cancer risks.
• This manuscript employs a MR approach, providing genetic-level evidence that highlights a significant association between dried fruit consumption and reduced lung cancer risk, while fresh fruit intake showed no significant impact. Clinically, these findings suggest that promoting dried fruit consumption may be an effective strategy for lung cancer prevention, warranting further exploration in dietary guidelines.
What is the implication, and what should change now?
• The findings suggest that incorporating dried fruits into dietary recommendations may have significant implications for lung cancer prevention. Future research should further investigate the mechanisms through which dried fruits confer these benefits and consider expanding studies to diverse populations.
Introduction
Lung cancer is known as one of the important reasons of cancer death. According to the latest cancer statistics in the United State, the incidence of lung cancer among adults under the age of 50 years old is increasing by 2% per year (1). That means that lung cancer is occurring at an increasingly younger age. This concerning trend accentuates the urgency of early preventative measures, including lifestyle modifications and dietary interventions, particularly targeting the younger population to mitigate the susceptibility to lung cancer.
Some studies have shed light on the potential anticancer properties of certain constituents found in fruits, which have spurred interest in their role in cancer risk reduction (2,3). Notably, a diet rich in fruits has been associated with lung cancer prevention owing to the abundance of antioxidants present in these natural sources. These antioxidants, encompassing compounds like retinol, ellagic acid, and carotenoids, exhibit protective attributes against the development of lung cancer (4). Furthermore, the bioactive compounds present in fruits have demonstrated significant regulatory capabilities over intracellular enzyme reactions, thus influencing the survival pathways of cancer cells (5). Empirical evidence from foundational studies and case-control trials has also underscored the potential of heightened fruit consumption in diminishing the risk of lung cancer (5,6).
Although several studies have reported a correlation between fruit consumption and the incidence of lung cancer, it is essential to acknowledge the inherent methodological limitations. These limitations include confounding factors that may distort the association between fruit intake and disease development, as well as the potential for reverse causation (7). On the other side, these investigations did not provide any evidence suggesting a divergence in the effects of consuming dried fruit versus fresh fruit. These limitations highlight the need for alternative approaches, such as the utilization of genetic methodologies, which furnish a more robust avenue for causal analysis. In this regard, the utilization of Mendelian randomization (MR) emerges as a promising solution.
Therefore, we opted to employ the MR approach, which harnesses genetic variants as instrumental variables (IVs) for the exposure variable—in this case, dried fruit consumption—to establish causal inferences (8). Given the absence of randomized controlled trials (RCTs) or prospective studies, the MR framework assumes a pivotal role in facilitating causal inference. This innovative strategy capitalizes on the random assortment of genetic variants during meiosis, thereby mimicking the core principles of an RCT (9). Through this exploitation of inherent genetic randomness, the MR design provides valuable insights into causal relationships.
In our study, we performed MR analyses to analyze the causal association between fruit intake and the risk of non-small cell lung cancer (NSCLC). In addition, we categorized fruits and analyzed the effect of fresh and dried fruit intake on NSCLC risk separately. By employing this approach, we were able to evaluate the genetic evidence supporting a decrease in NSCLC risk with higher daily fruit intake. We present this article in accordance with the STROBE-MR reporting checklist (available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-24-276/rc).
Methods
Study overview
MR studies rely on the following three fundamental assumptions: (I) the genetic variants utilized as IVs should exhibit a robust association with fruit intake; (II) the genetic variants employed as IVs should not demonstrate any correlation with confounding factors that are associated with the link between fruit intake and NSCLC; and (III) the genetic variants should solely influence the risk of NSCLC through their impact on fruit intake (Figure 1) (10). This study was conceived using the summary-level data sourced from publicly accessible genome-wide association studies (GWAS). The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Data source
We obtained data on fruit intake from analyzed version of Medical Research Council Integrative Epidemiology Unit (MRC-IEU) based on UK Biobank. The dataset of fresh fruit intake was ukb-b-3881 (sample size for fresh fruit intake =446,462) while the dried fruit intake was ukb-b-16576 (sample size for dried fruit intake =421,764). The actual intake of fresh fruit was assessed by asking, “About how many pieces of FRESH fruit do you eat per DAY? (Count one apple, one banana, 10 grapes, etc., as one piece; put ‘0’ if you do not eat any)”. The intake of dried fruit was measured with the question, “About how many pieces of DRIED fruit do you eat per DAY? (Count one prune, one dried apricot, 10 raisins, etc., as one piece; put ‘0’ if you do not eat any)”. Quality control identified responses reporting consumption of more than 50 pieces of fresh fruit or more than 100 pieces of dried fruit as invalid.
GWAS summary statistics for NSCLC were obtained from the FinnGen consortium R8 release (11). The dataset named “non-small cell lung cancer” was used in this study. This GWAS included 263,448 (3,865 cases versus 259,583 controls) European adult male and female subjects.
Instrument selection criteria
In a GWAS with 446,462 participants of European origin, we set genome-wide significance thresholds (P<5×10−8), while excluding single nucleotide polymorphisms (SNPs) with the threshold for linkage disequilibrium (LD) set r2>0.001 and clump distance <10,000 kb (12,13). As a result, we identified 49 SNPs as IVs for fresh fruit intake. Additionally, we selected 36 SNPs from a GWAS involving 421,764 participants of European descent as IVs for dried fruit intake. The use of genome-wide significance thresholds served as a rigorous criterion to establish the association of SNPs with their respective exposures, ensuring a high degree of statistical confidence. The information of the filtered SNPs can be found in tables available at https://cdn.amegroups.cn/static/public/tlcr-24-276-1.zip.
Statistical analysis
In this study, various MR methods were employed, included inverse variance weighted (IVW), MR-Egger regression, weighted median, to investigate the potential genetic causal relationship between fruit intake and NSCLC. The IVW method used a meta-analysis approach, combining Wald estimates for each SNP, to derive an overall estimation of the impact of fruit intake on NSCLC. If horizontal pleiotropy did not exist, the result of IVW would be unbiased. Analyzing Wald estimates for each SNP and combining them, to derive an overall estimation of the effect of fruit intake on NSCLC (14). The MR-Egger method, basing on the assumption of InSIDE (Instrument Strength Independent of Direct Effect), represents an approach for estimating consistent causal effects. It can also have directed multidirectional effects. However, MR-Egger method may cause the bias of results and increase Type 1 error rate. That is because this method was easily effected by violations of the InSIDE assumption (15). While the MR-Egger approach is acknowledged for its relative resilience against pleiotropy, it is important to acknowledge that it may be influenced by reduced statistical power, thereby warranting careful interpretation.
We evaluated the deviation of the MR-Egger intercept from zero to assess the existence of horizontal pleiotropy, allowing us to compare the IVW results with MR-Egger regression results. The MR-PRESSO (Mendelian Randomization Pleiotropy RESidual Sum and Outlier) analysis, a sophisticated technique utilized in this study, adeptly identifies and endeavors to mitigate the impact of horizontal pleiotropy through the identification and removal of noteworthy outliers. This meticulous approach enhances the robustness and reliability of the findings by effectively addressing potential confounding factors (16).
To evaluate potential heterogeneity within our study, we performed a Cochran’s Q test using the inverse-variance weighted model and MR-Egger model. Additionally, to verify causality and ensure the correct directionality, we employed the MR Steiger directionality test.
The robustness of IVs was evaluated through the calculation of the F-statistic, employing the formula F = R2 × (N−2)/(1−R2), wherein R2 signifies the proportion of variance in the exposure variable explained by the genetic variants and N denotes the sample size. A F-statistic greater than 10 (F-statistic >10) was deemed indicative of the absence of significant weak instrumental bias, thereby bolstering the validity of the IV analysis (17).
In addition to conducting the aforementioned analyses, we employed various visualization techniques to present our findings. Forest plots, scatter plots, leave-one-out plots, and funnel plots were utilized to visualize the results of our analyses. The forest plot allows for the visualization of the effects of each SNP on the outcomes. Additionally, we used MR-Steiger method to infer the causal direction between two phenotypes.
All intricate statistical analyses were conducted utilizing the R version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria), a powerful platform renowned for its robust capabilities. For the MR analyses, the tools of two-sample MR (version 0.5.6) and MR-PRESSO (version 1.0) were employed.
Results
The outcomes of our MR study, as presented in Figure 2, delineate notable insights. Specifically, the odds ratios (ORs) were 0.74 [95% confidence interval (CI): 0.31, 1.79; P=0.50] for fresh fruit intake, 0.32 (95% CI: 0.15, 0.67; P=0.003) for dried fruit intake. This discernible disparity underscored a distinctive pattern—while dried fruit consumption appeared to confer a protective effect against NSCLC, fresh fruit intake failed to manifest a statistically significant influence.
Importantly, we did not found evidence of heterogeneity in the dried fruit intake. Although a visual examination of scatter plots (Figure 3) did identify certain IVs associated with dried fruit consumption as potential outliers, it is noteworthy that the preponderance of data points closely adheres to the fitted linear regression line.
Additionally, the comprehensive MR-PRESSO analysis revealed no compelling evidence of significant outliers (global test P>0.05) within the association between dried fruit intake and NSCLC. Consequently, these findings collectively suggest a paucity of substantial support for the existence of horizontal pleiotropy in this specific association.
We did not found evidence of heterogeneity in the dried fruit intakes (PCochran’s Q>0.05), indicating that the estimation of fixed-effects IVW is convincing (Tables S1-S6). Furthermore, the outcomes of the F-statistic showed the statistical robustness inherent in the entirety of the selected SNPs, as shown in table available at https://cdn.amegroups.cn/static/public/tlcr-24-276-2.pdf.
In the MR Steiger directionality test, we discovered strong directional causality in both fruit analyses. The result of MR Steiger showed there might be a low risk of reverse causation, thus reinforcing the integrity of our findings in a causal framework.
The results of leave-one-out plots and funnel plots were shown in Figure S1 and Figure S2.
Discussion
In summary, our investigation employed a two-sample MR approach, leveraging data from the UK Biobank and a comprehensive GWAS encompassing over 218,000 individuals, to explore the potential relationship between fruit consumption and lung cancer risk. Our findings reveal compelling evidence suggesting a significant and noteworthy reduction in lung cancer risk associated with dried fruit intake. Intriguingly, this effect was not observed with fresh fruit intake, where the association did not reach statistical significance. These results emphasize the potential health benefits of dried fruit consumption in reducing lung cancer risk, shedding light on a specific aspect of fruit consumption that warrants further attention.
Dried fruits, abundant in diverse bioactive compounds endowed with potent antioxidant attributes, emerge as potential sources of substantial health benefits (4). Certain types of dried fruits have the potential to modulate the composition of the human gut microbiota in a beneficial manner (18). This presents a potentially significant pathway through which dried fruits may impact the development of NSCLC.
However, it is noteworthy that there are only a paucity of clinical investigations exclusively focusing on dried fruit consumption and its association with lung cancer. An observational study involving 34,192 participants from the California Seventh-day Adventist population revealed a significant inverse association between increased dried fruit consumption and the risk of developing lung cancer (7). Two studies conducted by Jin et al. and Yan et al. using MR further support the protective effect of dried fruits against lung cancer development (19,20). It is important to note that these prior studies analyzed data encompassing multiple subtypes of lung cancer, which introduces a potential bias in the classification of these subtypes. In our analysis, we included a larger cohort of 218,792 subjects, thereby surpassing the sample sizes of the aforementioned studies.
In a prospective trial involving 92,327 participants, the Experimenter Gateway conducted an analysis to investigate the potential relationship between 92 dietary factors and the risk of developing lung cancer. The results of that study suggest that there may be a negative association between the consumption of fruit and vitamin C and the incidence of squamous cell lung cancer (21). This study may also shed light on potential distinctions in the impact of fresh fruit consumption compared to dried fruit consumption on NSCLC.
Our study has several methodological strengths. Firstly, the MR approach minimizes the presence of reverse causality and residual confounding, which are inherent issues in observational studies. Moreover, by solely considering individuals of European population structure and adjusting for it in the GWASs, population stratification bias was effectively reduced in our study. The selection of SNPs and the utilization of summary data from NSCLC were specifically based on this population group. Additionally, the consistent results obtained from various sensitivity analyses, along with the absence of evidence indicating the presence of horizontal pleiotropy, further support the robustness of our study’s findings. These findings specifically pertain to the association between fruit consumption and NSCLC, suggesting that the observed results are unlikely to be influenced by horizontal pleiotropy.
However, certain limitations warrant consideration. Firstly, despite we used some manners to control, there is still a possibility of unmeasured confounding due to IVs. Secondly, the validity of many IVs relies on the assumption of monotonicity. In our case, the IV is a genetic variant identified from the UK Biobank. While we have information regarding the size of this cohort, specific stratifications and characteristics of this subgroup remain unknown. The study population was restricted to the European, thus caution should be exercised when generalizing the results to other regions.
Conclusions
In summary, our study offers compelling and causally suggestive evidence that genetically predicted dried fruit consumption is associated with a lowered risk of lung cancer. However, our results do not support the relationship between fresh fruit intake and lung cancer. The findings highlight the potential significance of increasing dried fruit intake as an important dietary modification in the prevention of lung cancer in the future.
Acknowledgments
Funding: This study was supported by
Footnote
Reporting Checklist: The authors have completed the STROBE-MR reporting checklist. Available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-24-276/rc
Peer Review File: Available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-24-276/prf
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-24-276/coif). The authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Siegel RL, Miller KD, Wagle NS, et al. Cancer statistics, 2023. CA Cancer J Clin 2023;73:17-48. [Crossref] [PubMed]
- Hseu YC, Huang YC, Thiyagarajan V, et al. Anticancer activities of chalcone flavokawain B from Alpinia pricei Hayata in human lung adenocarcinoma (A549) cells via induction of reactive oxygen species-mediated apoptotic and autophagic cell death. J Cell Physiol 2019;234:17514-26. [Crossref] [PubMed]
- Wang X, Parvathaneni V, Shukla SK, et al. Inhalable resveratrol-cyclodextrin complex loaded biodegradable nanoparticles for enhanced efficacy against non-small cell lung cancer. Int J Biol Macromol 2020;164:638-50. [Crossref] [PubMed]
- Abar L, Vieira AR, Aune D, et al. Blood concentrations of carotenoids and retinol and lung cancer risk: an update of the WCRF-AICR systematic review of published prospective studies. Cancer Med 2016;5:2069-83. [Crossref] [PubMed]
- Amararathna M, Johnston MR, Rupasinghe HP. Plant Polyphenols as Chemopreventive Agents for Lung Cancer. Int J Mol Sci 2016;17:1352. [Crossref] [PubMed]
- Yin X, Lai GGY, Seow A, et al. Dietary factors and the risk of lung cancer by epidermal growth factor receptor mutation status and histological subtypes. Front Public Health 2022;10:1079543. [Crossref] [PubMed]
- Fraser GE. Associations between diet and cancer, ischemic heart disease, and all-cause mortality in non-Hispanic white California Seventh-day Adventists. Am J Clin Nutr 1999;70:532S-8S. [Crossref] [PubMed]
- Larsson SC. Mendelian randomization as a tool for causal inference in human nutrition and metabolism. Curr Opin Lipidol 2021;32:1-8. [Crossref] [PubMed]
- Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet 2014;23:R89-98. [Crossref] [PubMed]
- Lawlor DA. Commentary: Two-sample Mendelian randomization: opportunities and challenges. Int J Epidemiol 2016;45:908-15. [Crossref] [PubMed]
- Kurki MI, Karjalainen J, Palta P, et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 2023;613:508-18. [Crossref] [PubMed]
- Turley P, Walters RK, Maghzian O, et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet 2018;50:229-37. [Crossref] [PubMed]
- Hemani G, Zheng J, Elsworth B, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 2018;7:e34408. [Crossref] [PubMed]
- Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol 2013;37:658-65. [Crossref] [PubMed]
- Burgess S, Thompson SG. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol 2017;32:377-89. [Crossref] [PubMed]
- Verbanck M, Chen CY, Neale B, et al. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet 2018;50:693-8. [Crossref] [PubMed]
- Burgess S, Thompson SG. Avoiding bias from weak instruments in Mendelian randomization studies. Int J Epidemiol 2011;40:755-64. [Crossref] [PubMed]
- Alasalvar C, Chang SK, Kris-Etherton PM, et al. Dried Fruits: Bioactives, Effects on Gut Microbiota, and Possible Health Benefits-An Update. Nutrients 2023;15:1611. [Crossref] [PubMed]
- Jin C, Li R, Deng T, et al. Association between dried fruit intake and pan-cancers incidence risk: A two-sample Mendelian randomization study. Front Nutr 2022;9:899137. [Crossref] [PubMed]
- Yan H, Jin X, Zhang C, et al. Associations between diet and incidence risk of lung cancer: A Mendelian randomization study. Front Nutr 2023;10:1149317. [Crossref] [PubMed]
- Heath AK, Muller DC, van den Brandt PA, et al. Diet-wide association study of 92 foods and nutrients and lung cancer risk in the European Prospective Investigation into Cancer and Nutrition study and the Netherlands Cohort Study. Int J Cancer 2022;151:1935-46. [Crossref] [PubMed]