Basigin

Introduction

Basigin, also known as CD147 or EMMPRIN (Extracellular Matrix Metalloproteinase Inducer), is a highly glycosylated transmembrane protein belonging to the immunoglobulin superfamily. It is widely expressed on the surface of various cell types, including immune cells, epithelial cells, and tumor cells.

Biological Basis

Biologically, basigin plays a crucial role in regulating cell-cell interactions, cell adhesion, and signal transduction. Its most notable function is the induction of matrix metalloproteinases (MMPs) in surrounding stromal cells, which are enzymes involved in the degradation of the extracellular matrix. This activity facilitates tissue remodeling and cell migration. Basigin is also involved in various physiological processes such as brain development, spermatogenesis, and immune cell activation.

Clinical Relevance

The widespread expression and diverse functions of basigin contribute to its significant clinical relevance. It is implicated in the progression of several pathological conditions, particularly various types of cancer. In oncology, basigin promotes tumor growth, invasion, metastasis, and angiogenesis by inducing MMPs, thus facilitating the breakdown of tissue barriers and the formation of new blood vessels to support tumor development. Beyond cancer, basigin is also associated with inflammatory processes and has been identified as a receptor for certain viral infections, highlighting its role in host-pathogen interactions.

Given its involvement in critical physiological processes and its contribution to major diseases like cancer and inflammatory conditions, basigin holds considerable social importance. Research into basigin's functions and pathways offers potential avenues for the development of novel diagnostic markers and therapeutic strategies. Targeting basigin could lead to new treatments for aggressive cancers, reduce inflammation in chronic diseases, and potentially offer antiviral interventions, thereby improving human health and quality of life.

Methodological and Statistical Constraints

Genetic studies of various human traits are subject to several methodological and statistical limitations that can influence the interpretation and reliability of findings. Moderate cohort sizes in some studies limited the statistical power to detect modest genetic associations, increasing the risk of false negative findings for variants with smaller effect sizes. ^{[1], [2]} Conversely, the extensive multiple testing inherent in genome-wide association studies (GWAS) necessitates stringent statistical thresholds to control for false positives, which can further reduce the ability to identify weaker, yet potentially significant, genetic signals. ^{[1], [2]} To manage the multiple testing burden, some analyses were performed in a sex-pooled manner, potentially overlooking sex-specific genetic associations that may be unique to males or females. ^[3]

The reliance on imputation to infer genotypes for unassayed single nucleotide polymorphisms (SNPs), particularly when different marker sets were used across studies, introduced a potential for error in genotype calls, typically estimated to be between 1.46% and 2.14% per allele. ^[4] Furthermore, the estimation of effect sizes and the proportion of variance explained by genetic variants could be influenced by the methods used for phenotype assessment, such as averaging repeated observations per individual or using observations from monozygotic twins. These varied approaches may not directly reflect population-level effects without careful adjustment, potentially leading to discrepancies in reported effect sizes across different studies. ^[5] The ultimate validation of genetic findings often relies on replication in independent cohorts, a process that faced limitations due to partial genetic variation coverage in some initial studies, hindering the ability to confirm previously reported associations. ^{[1], [2]}

Generalizability and Genetic Coverage

A significant limitation in the generalizability of genetic findings stems from the predominant focus on populations of European ancestry, including Caucasian individuals and those from HapMap CEU reference panels. ^{[4], [6], [7]} This demographic specificity means that genetic associations identified may not be directly transferable or have the same effect sizes and allele frequencies in populations with different ancestral backgrounds. This limitation restricts the broader applicability of the discoveries and underscores the need for more diverse cohorts to ensure equitable understanding of genetic influences across global populations.

Many initial GWAS were conducted using genotyping arrays that covered only a subset of all known SNPs from resources like HapMap, leading to incomplete genetic variation coverage across the genome. ^{[2], [3]} This limited coverage implies that some causal genes or variants may have been missed entirely, and the available GWAS data were often insufficient for comprehensive investigation of candidate genes. ^[3] Additionally, the assays were primarily designed for SNPs, meaning that other types of genetic variants, such as non-SNP variants or structural variations, were not routinely captured, potentially leaving important genetic contributions to traits undiscovered. ^[1]

Environmental and Context-Specific Influences

A notable gap in understanding the full impact of genetic variants on various traits is the limited investigation into gene-environment (GxE) interactions. Genetic effects on phenotypes are often modulated by environmental factors, such as dietary intake, and the absence of such analyses means that the context-specific influences on genetic associations remain largely unexplored. ^[2] This leads to an incomplete picture of disease etiology and trait variability, as the penetrance and expressivity of genetic predispositions can be significantly altered by an individual's environment.

Beyond GxE interactions, several studies acknowledged specific environmental and physiological factors that could confound phenotype measurements, such as the time of day blood samples were collected or an individual's menopausal status. These factors are known to influence various biomarker levels, including serum markers for iron status. ^[5] While some studies attempted to adjust for such variables in their statistical models, their pervasive influence highlights the inherent complexity of isolating purely genetic effects and underscores the need for meticulous phenotypic characterization and comprehensive covariate adjustment to minimize bias. ^[5]

Variants

_BSG_ (Basigin), also known as CD147, is a crucial transmembrane glycoprotein involved in numerous cellular functions, including cell adhesion, migration, and the induction of matrix metalloproteinases essential for tissue remodeling. It also facilitates glucose transport as part of the monocarboxylate transporter complex, highlighting its broad impact on cell physiology. Genetic variants in genes influencing membrane protein function and signaling, such as _HCN2_ and _DLG4_, can have downstream effects relevant to basigin's activity. For example, rs56101188 and rs11882870 in _HCN2_, which encodes a hyperpolarization-activated cyclic nucleotide-gated channel, may modulate ion channel activity and cellular excitability, a mechanism often explored in genome-wide association studies investigating protein quantitative trait loci. ^[8] Similarly, rs200489612 in _DLG4_ (PSD-95), a key scaffolding protein at neuronal synapses, could influence the precise localization and interaction of membrane proteins, indirectly affecting the functional environment of basigin and related cell surface molecules. ^[8]

Metabolic pathways and enzymatic processes are closely intertwined with cellular health and can influence the broader physiological context in which basigin operates. The _GCKR_ gene, encoding Glucokinase Regulatory Protein, plays a pivotal role in glucose metabolism by regulating glucokinase activity, and its common variant rs1260326 is widely associated with various metabolic traits. Given basigin's involvement in glucose transport, alterations in _GCKR_ function could indirectly impact cellular glucose dynamics and metabolic stress, areas often investigated in studies identifying protein quantitative trait loci. ^[8] Furthermore, _ADAMTS13_ (ADAM Metallopeptidase With Thrombospondin Type 1 Motif 13) is an enzyme critical for cleaving von Willebrand factor, thereby regulating blood coagulation, and variant rs41296094 may affect its activity. Changes in coagulation or vascular integrity, where basigin can also play a role through inflammation or cell adhesion, could be influenced by such variants. The _NIBAN1_ gene, with variant rs115276619, contributes to cell proliferation and apoptosis regulation, and its dysregulation could affect tissue homeostasis and repair, processes where basigin's role in cell-cell communication and matrix remodeling is significant. ^[8]

The _ABO_ blood group system, determined by glycosyltransferases, is fundamental to cell surface antigen presentation, a process directly relevant to the glycosylation of many membrane proteins, including basigin. The specific variant rs2519093 in _ABO_, along with rs78590974 associated with _ABO - Y_RNA_, could modify glycosylation patterns or gene expression, thereby impacting the structure and function of highly glycosylated proteins like basigin, which is explicitly recognized as part of the _ABO_ gene region in genetic mapping efforts. ^[8] Proper protein folding is also essential, and the _PDILT_ gene, with variant rs77924615, encodes a protein disulfide isomerase involved in endoplasmic reticulum protein maturation. This variant could influence the correct folding and trafficking of complex transmembrane proteins such as basigin, affecting its presentation and activity at the cell surface. Moreover, _NPHS2_, encoding podocin, is vital for the integrity of the kidney's glomerular filtration barrier, and rs61747728 is associated with kidney pathologies, where basigin's involvement in inflammation and tissue damage could be significant. ^[8] Finally, the intergenic variant rs700753, located near _HMGN1P19 - EPS15P1_, may influence regulatory elements affecting gene expression in the region, with potential indirect effects on cellular processes relevant to basigin.

Key Variants

RS ID	Gene	Related Traits
rs56101188 rs11882870	BSG - HCN2	basigin measurement educational attainment
rs2519093	ABO	coronary artery disease venous thromboembolism hemoglobin measurement hematocrit erythrocyte count
rs115276619	NIBAN1	glomerular filtration rate basigin measurement level of thioredoxin domain-containing protein 15 in blood B-cell antigen receptor complex-associated protein beta chain measurement interleukin-15 receptor subunit alpha measurement
rs61747728	NPHS2	gout thrombomodulin measurement tumor necrosis factor receptor superfamily member 1B amount basigin measurement CD27 antigen measurement
rs1260326	GCKR	urate measurement total blood protein measurement serum albumin amount coronary artery calcification lipid measurement
rs77924615	PDILT	glomerular filtration rate chronic kidney disease blood urea nitrogen amount serum creatinine amount protein measurement
rs200489612	DLG4	alkaline phosphatase measurement cholesteryl esters:total lipids ratio, intermediate density lipoprotein measurement cholesteryl ester measurement, intermediate density lipoprotein measurement lipid measurement, intermediate density lipoprotein measurement free cholesterol measurement, low density lipoprotein cholesterol measurement
rs700753	HMGN1P19 - EPS15P1	glomerular filtration rate blood urea nitrogen amount basigin measurement IGF-1 measurement IGFBP-3 measurement
rs78590974	ABO - Y_RNA	angiotensin-converting enzyme measurement basigin measurement level of carcinoembryonic antigen-related cell adhesion molecule 20 in blood tgf-beta receptor type-2 measurement vascular endothelial growth factor receptor 3 amount
rs41296094	ADAMTS13	alkaline phosphatase measurement basigin measurement interleukin-10 receptor subunit beta measurement level of interleukin-6 receptor subunit beta in blood serum level of MHC class I polypeptide-related sequence A in blood, level of MHC class I polypeptide-related sequence B in blood

Urate Transport and Metabolic Homeostasis

SLC2A9, also known as GLUT9, plays a central role in the active biological transport of urate, significantly influencing serum urate concentrations and its excretion from the body. ^[9] This protein functions as a renal urate anion exchanger, critically regulating blood urate levels and contributing to the overall metabolic balance of uric acid. ^[10] Beyond its primary role in urate transport, SLC2A9 is also implicated in fructose metabolism, suggesting a broader involvement in carbohydrate metabolic pathways. ^[9] The efficient flux of urate through SLC2A9 is essential for maintaining purine homeostasis, preventing the accumulation of uric acid that can lead to pathological conditions.

Genetic and Post-Translational Regulation of SLC2A9 Function

The function of SLC2A9 is modulated by both genetic and post-translational mechanisms. Common single nucleotide polymorphisms (SNPs) within the SLC2A9 gene are strongly associated with variations in serum uric acid levels, with some variants exhibiting pronounced sex-specific effects. ^[11] These genetic variations can impact the expression or activity of the transporter, thereby altering its efficiency in urate handling. Furthermore, alternative splicing of the GLUT9 gene has been shown to alter the trafficking of the SLC2A9 protein within the cell, which can influence its cellular localization and, consequently, its ability to transport substrates. ^[12] The protein's substrate selectivity, including its preference for urate, is determined by a highly conserved hydrophobic motif located in its exofacial vestibule. ^[12]

Systems-Level Metabolic Integration

SLC2A9 is not an isolated component but is integrated into a complex network of metabolic pathways, demonstrating significant pathway crosstalk. Genome-wide association studies have linked variants in SLC2A9 to not only uric acid levels but also to other metabolic traits, including type 2 diabetes and triglyceride levels . ^{[13], [14]} This suggests its involvement in the broader regulation of energy metabolism and lipid homeostasis. The associations with multiple metabolic parameters highlight SLC2A9's role in the emergent properties of metabolic networks, where its function can have cascading effects on various physiological systems. Its influence on metabolite profiles in human serum further underscores its systemic metabolic impact. ^[14]

Clinical Implications and Disease Pathomechanisms

Dysregulation of SLC2A9 pathways contributes directly to several disease-relevant mechanisms. Imbalances in SLC2A9-mediated urate transport are a primary driver of hyperuricemia, a key risk factor for gout. ^[9] Genetic variants that impair SLC2A9 function or expression can lead to elevated serum uric acid, predisposing individuals to this inflammatory arthropathy. ^[9] Moreover, the association of SLC2A9 with type 2 diabetes, triglyceride levels, and the broader metabolic syndrome indicates its involvement in the pathogenesis of these widespread conditions. ^[13] Understanding these pathway dysregulations makes SLC2A9 a potential therapeutic target for managing hyperuricemia, gout, and potentially other metabolic disorders.

References

[1] Benjamin, E. J., et al. "Genome-wide association with select biomarker traits in the Framingham Heart Study." BMC Med Genet, vol. 8 Suppl 1, 2007, p. S11.

[2] Vasan, R. S., et al. "Genome-wide association of echocardiographic dimensions, brachial artery endothelial function and treadmill exercise responses in the Framingham Heart Study." BMC Med Genet, vol. 8 Suppl 1, 2007, p. S2.

[3] Yang, Q., et al. "Genome-wide association and linkage analyses of hemostatic factors and hematological phenotypes in the Framingham Heart Study." BMC Med Genet, vol. 8 Suppl 1, 2007, p. S9.

[4] Willer, C. J., et al. "Newly identified loci that influence lipid concentrations and risk of coronary artery disease." Nat Genet, vol. 40, no. 2, 2008, pp. 161-169.

[5] Benyamin, B., et al. "Variants in TF and HFE explain approximately 40% of genetic variation in serum-transferrin levels." Am J Hum Genet, vol. 84, no. 1, 2009, pp. 60-65.

[6] Dehghan, A., et al. "Association of three genetic loci with uric acid concentration and risk of gout: a genome-wide association study." Lancet, vol. 372, no. 9654, 2008, pp. 1953-1961.

[7] Pare, G., et al. "Novel association of ABO histo-blood group antigen with soluble ICAM-1: results of a genome-wide association study of 6,578 women." PLoS Genet, vol. 4, no. 7, 2008, e1000118.

[8] Melzer, D., et al. "A genome-wide association study identifies protein quantitative trait loci (pQTLs)." PLoS Genet, 2008.

[9] Vitart, V. et al. "SLC2A9 is a newly identified urate transporter influencing serum urate concentration, urate excretion and gout." Nat Genet. 2008; 40(4):432–7.

[10] Enomoto, A. et al. "Molecular identiﬁcation of a renal urate anion exchanger that regulates blood urate levels." Nature. 2002; 417:447–452.

[11] McArdle, P.F. et al. "Association of a common nonsynonymous variant in GLUT9 with serum uric acid levels in old order amish." Arthritis Rheum. 2008; 58(9):2894–901.

[12] Augustin, R. et al. "Identification and characterization of human glucose transporter-like protein-9 (GLUT9): alternative splicing alters trafficking." J Biol Chem. 2004; 279(16):16229–36.

[13] Saxena, R. et al. "Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels." Science. 2007; 316(5829):1331–6.

[14] Gieger, C. et al. "Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum." PLoS Genet. 2008; 4(11):e1000282.