Skip to content

Endoglin

Background

endoglin (gene symbol: ENG) is a transmembrane glycoprotein that plays a critical role in cellular signaling, particularly within the transforming growth factor-beta (TGF-β) superfamily pathway. Its involvement in various physiological processes, especially those related to the vascular system, makes it a subject of significant research.

Biological Basis

endoglin is expressed in several cell types, including vascular smooth muscle cells, renal mesangial cells, and platelets. [1] It is also prominently expressed on endothelial cells, where it acts as a co-receptor for TGF-β ligands. This interaction is crucial for regulating angiogenesis—the formation of new blood vessels—as well as maintaining vascular integrity and cardiovascular development. Research suggests endoglin is likely involved in platelet biology, with specific genetic variations being associated with platelet aggregation phenotypes. For instance, a single nucleotide polymorphism (SNP) in the endoglin gene has been found associated with epinephrine-induced platelet aggregation and nominally with ADP-induced and collagen-induced platelet aggregation. [1]

Clinical Relevance

The biological functions of endoglin have direct clinical implications. Genetic mutations in the ENG gene are the primary cause of Hereditary Hemorrhagic Telangiectasia type 1 (HHT1), a dominant genetic disorder characterized by abnormal blood vessel development, leading to recurrent nosebleeds, gastrointestinal bleeding, and arteriovenous malformations in organs like the lungs, liver, and brain. Beyond HHT1, altered endoglin expression and function are implicated in a range of other conditions, including preeclampsia, certain cancers, and various cardiovascular diseases. The observed associations between endoglin SNPs and platelet aggregation suggest a potential role in thrombotic disorders and warrant further investigation. [1]

Social Importance

Understanding endoglin's role is vital for developing targeted therapies and diagnostic tools for diseases affecting the vascular system. Its involvement in conditions like HHT1 highlights the importance of genetic research in identifying underlying causes of rare diseases and informing patient care. Furthermore, its associations with common traits like platelet aggregation contribute to a broader understanding of cardiovascular health and disease risk, potentially aiding in personalized medicine approaches.

Methodological and Statistical Constraints

The interpretability of genetic associations, including those potentially involving endoglin, is subject to several methodological and statistical constraints inherent in genome-wide association studies (GWAS). Many studies, for instance, operate with limited statistical power to detect genetic effects that explain a small proportion of phenotypic variation, often missing numerous common variants with subtle, yet cumulatively significant, influences. [2] Furthermore, the practice of estimating effect sizes primarily from replication stages can lead to an overestimation of true genetic effects, a phenomenon known as Winner's Curse, which necessitates cautious interpretation of reported magnitudes. [3]

A significant limitation in earlier GWAS was the incomplete coverage of genetic variation, as studies often utilized a subset of available SNPs, potentially missing causal variants or entire genes like endoglin due to assay limitations. [1] This partial coverage also hinders the comprehensive study of candidate genes and the replication of previously reported findings. [1] Additionally, reliance on imputation to infer missing genotypes, while beneficial, introduces a small percentage of error, which can slightly reduce statistical power or introduce noise into the analyses. [3] The common assumption of an additive mode of inheritance in genetic analyses may also overlook complex non-additive genetic effects that contribute to phenotype variability. [4]

Phenotypic Measurement and Generalizability

Challenges in phenotype measurement and the generalizability of findings pose further limitations for understanding genetic influences. Phenotypes, such as echocardiographic traits, when averaged across multiple examinations spanning extended periods (e.g., twenty years) and involving different equipment, can introduce misclassification and mask age-dependent genetic effects. [2] This variability in measurement can obscure the true genetic architecture and its dynamic interplay with development. Moreover, study designs that perform only sex-pooled analyses may fail to detect genetic associations that are specific to either males or females, leading to an incomplete understanding of sex-dependent genetic mechanisms. [1]

A critical limitation for the broader applicability of research findings is the restricted ancestry of study populations. Many large-scale genetic studies have historically focused on individuals of European descent, which limits the generalizability of their discoveries to other ethnic groups. [2] Although efforts are made to correct for population stratification through methods like principal component analysis or genomic control, this underlying demographic bias means that genetic variants and their effects may not be directly transferable across diverse populations. [5] Furthermore, the exclusion of rare variants (e.g., those with minor allele frequency <1%) in many GWAS means that the contribution of these potentially impactful variants to phenotypes remains largely unexplored. [6]

Unaddressed Confounders and Knowledge Gaps

Significant knowledge gaps persist due to unaddressed confounders and the complex nature of genetic influences. A major limitation is the typical absence of investigations into gene-environment (GxE) interactions, which are crucial for understanding how genetic variants manifest their effects in specific environmental contexts. [2] For instance, the impact of a gene on a trait might be modulated by factors like dietary intake, and without such analyses, a comprehensive picture of genetic etiology remains elusive. [2]

Additionally, the adjustment for various covariates in statistical models, while necessary, can potentially mask genetic effects that are mediated through these very covariates, leading to an underestimation or misinterpretation of genetic influences. [1] Consequently, many associations identified in GWAS, including those that might relate to endoglin, should be regarded as hypotheses requiring further rigorous testing and replication in independent cohorts. [1] The cumulative effect of these limitations, including insufficient power for small effects, incomplete variant coverage, and unexamined GxE interactions, contributes to the challenge of explaining the "missing heritability" for many complex traits.

Variants

The genetic variants discussed here encompass a range of functional implications, from blood group determination and metabolic regulation to cellular signaling and vascular health, often intersecting with pathways relevant to endoglin, a key protein in endothelial function and angiogenesis. Endoglin, encoded by the ENG gene, acts as a co-receptor for the transforming growth factor-beta (TGF-β) superfamily, critical for blood vessel development, remodeling, and maintaining vascular integrity. Variants within or near genes involved in these processes can subtly or significantly alter their functions, contributing to individual differences in disease susceptibility and physiological traits.

Variants within genes directly involved in vascular and metabolic regulation, such as ENG, PLAUR, and MME, play significant roles in maintaining physiological balance. Endoglin, encoded by the ENG gene, is a crucial component of the TGF-β signaling pathway, which is fundamental for angiogenesis and vascular homeostasis. Single nucleotide polymorphisms (SNPs) like rs1800956, rs41356253, and rs76001241 within the ENG gene may influence its expression or the protein's ability to modulate TGF-β signaling, thereby affecting vascular development and repair, as well as the progression of diseases like hereditary hemorrhagic telangiectasia. The PLAUR gene, which codes for the urokinase plasminogen activator receptor, is involved in cell surface proteolysis, a process essential for cell migration, tissue remodeling, and fibrinolysis, making variants like rs4760 relevant to vascular inflammation and repair mechanisms. Similarly, the MME gene encodes neprilysin, an enzyme that degrades various peptides, including natriuretic peptides, thereby influencing blood pressure regulation and vascular tone, where a variant like rs61762319 could alter its enzymatic activity. Such genetic variations are routinely investigated in genome-wide association studies seeking to identify loci associated with a wide array of biomarker traits that reflect cardiovascular and metabolic health. [7] These studies aim to uncover the genetic underpinnings of complex traits, including those related to lipid concentrations and risk of coronary artery disease. [3]

The ABO gene, which dictates human blood groups, is another critical locus with broad implications for health, including cardiovascular outcomes. Variants such as rs779860630, rs8176741, and rs8176722 profoundly affect the glycosyltransferase enzymes that determine A, B, and O blood antigens, which are present not only on red blood cells but also on endothelial surfaces and other tissues. These antigenic differences can influence susceptibility to various diseases, including venous thromboembolism and certain cancers, highlighting a broader impact on systemic biology relevant to endoglin's role in vascular health. The identification of specific ABO gene regions and associated polymorphisms, like those influencing the O blood group, underscores its importance in human genetic variation. [8] Additionally, intergenic variants such as rs138683771, located between LCN1P1 and ABO, or rs34434834 near ST3GAL4 and KIRREL3, can influence the expression or regulation of nearby genes. ST3GAL4 encodes a sialyltransferase involved in glycosylation, a process that can impact cell-cell recognition and signaling, potentially interacting with ABO pathways and influencing endothelial cell behavior, while KIRREL3 plays a role in cell adhesion, a fundamental aspect of vascular integrity. These genetic variations contribute to the polygenic nature of dyslipidemia and other related conditions. [4]

Further variants span genes and intergenic regions with diverse cellular functions that indirectly connect to overall physiological health and potentially, vascular biology. The SURF1 gene, with variants like rs183853102, is essential for the assembly of cytochrome c oxidase, a key enzyme in the mitochondrial electron transport chain. Impairment of mitochondrial function can lead to increased oxidative stress and energy deficits, which negatively impact endothelial cells and their ability to maintain vascular health, thereby indirectly affecting endoglin-mediated processes. Variants in STXBP1, such as rs76142582, rs72769820, and rs7036307, affect a gene critical for synaptic vesicle exocytosis and neurotransmitter release, primarily in the nervous system. While its direct link to endoglin is less clear, fundamental cellular secretion mechanisms can have broader systemic impacts. Intergenic variants, such as rs186021206 between RPL7AP64 and ASGR1, or rs140728646 between OBP2B and LCN1P1, often reside in regulatory regions that can influence the expression of adjacent genes. For instance, ASGR1 (asialoglycoprotein receptor 1) is involved in the hepatic clearance of glycoproteins, a process vital for liver function and systemic metabolism, which are intrinsically linked to cardiovascular health. Such broad genetic analyses, including those focusing on hemostatic factors and hematological phenotypes, highlight the complex genetic architecture underlying human traits. [1]

Key Variants

RS ID Gene Related Traits
rs4760 PLAUR granulocyte percentage of myeloid white cells
monocyte percentage of leukocytes
leukocyte quantity
neutrophil count, eosinophil count
granulocyte count
rs183853102 SURF1 GDNF family receptor alpha-like measurement
level of pancreatic secretory granule membrane major glycoprotein GP2 in blood
level of mucin-2 in blood
level of cell surface glycoprotein MUC18 in blood
kin of IRRE-like protein 2 measurement
rs779860630
rs8176741
rs8176722
ABO interleukin-6 receptor subunit beta amount
insulin-like growth factor 1 receptor amount
kremen protein 1 measurement
ephrin-B2 measurement
hepatocyte growth factor receptor amount
rs34434834 ST3GAL4, KIRREL3 angiotensin-converting enzyme measurement
intercellular adhesion molecule 1 measurement
angiopoietin-2 measurement
level of tyrosine-protein kinase receptor Tie-1 in blood
von Willebrand factor quality, coronary artery disease
rs186021206 RPL7AP64 - ASGR1 ST2 protein measurement
alkaline phosphatase measurement
low density lipoprotein cholesterol measurement, lipid measurement
low density lipoprotein cholesterol measurement
low density lipoprotein cholesterol measurement, phospholipid amount
rs140728646 OBP2B - LCN1P1 angiopoietin-1 receptor measurement
protein HEG homolog 1 measurement
level of pancreatic secretory granule membrane major glycoprotein GP2 in blood
kin of IRRE-like protein 2 measurement
chymotrypsin-C measurement
rs138683771 LCN1P1 - ABO level of pancreatic secretory granule membrane major glycoprotein GP2 in blood
level of mucin-2 in blood
level of cadherin-17 in blood serum
level of ephrin type-A receptor 4 in blood serum
endoglin measurement
rs61762319 MME systolic blood pressure
testosterone measurement
level of neprilysin in blood
blood protein amount
Alzheimer disease
rs76142582
rs72769820
rs7036307
STXBP1 endoglin measurement
rs1800956
rs41356253
rs76001241
ENG endoglin measurement

Clinical Manifestations and Associated Cell Types

Endoglin (ENG), also known as CD105, is expressed in various cell types, including vascular smooth muscle cells, renal mesangial cells, and platelets. [1] Its involvement in platelet biology suggests a role in hemostatic processes, where alterations related to ENG may manifest as changes in platelet function. [1] Specifically, this can impact their aggregation capabilities, which are crucial for blood clot formation. These presentations represent intermediate phenotypes that can provide detailed insights into potentially affected biological pathways. [9]

Assessment of Platelet Function and Biomarkers

The primary method for assessing phenotypes related to endoglin involves measuring platelet aggregation. [1] This can be quantified using various inducers, such as epinephrine (Epi), adenosine diphosphate (ADP), and collagen. [1] Studies have shown significant associations with Epi-induced platelet aggregation, alongside nominal or borderline nominal significance for ADP-induced and collagen-induced aggregation, respectively. [1] These aggregation phenotypes are typically analyzed as multivariable adjusted residuals from measurements taken at specific examination cycles, such as cycle 5, to account for confounding factors. [1] Diagnostic tools involve statistical analyses like additive family-based association tests (FBAT) and linear regression models with general estimating equations (GEE) to identify associations between genetic variants and these continuous-scale phenotypes. [2] P-values derived from these tests quantify the strength of association, with lower p-values indicating greater statistical significance. [1] These quantitative trait loci (pQTLs) for protein levels offer objective measures for understanding disease mechanisms. [8]

Variability in Presentation and Diagnostic Implications

The clinical presentation and measurement values related to endoglin may exhibit variability influenced by factors such as age and sex, as phenotypes are often adjusted for these covariates in analyses. [1] This adjustment helps to discern genetic effects independent of these demographic factors, suggesting potential inter-individual and age- or sex-related differences in platelet function or ENG expression. [1] Such phenotypic diversity highlights the complexity in understanding the full spectrum of endoglin's influence. From a diagnostic perspective, associations identified with endoglin-related variants and platelet aggregation are currently considered hypotheses that require further validation in independent cohorts. [1] While these findings suggest potential biological pathways for further investigation, their direct diagnostic or prognostic value for clinical decision-making is still emerging and not yet established. [1] Further research is warranted to clarify their clinical correlations and red flags for specific conditions.

References

[1] Yang Q, et al. "Genome-wide association and linkage analyses of hemostatic factors and hematological phenotypes in the Framingham Heart Study." BMC Medical Genetics, vol. 8, no. 1, 2007, p. 79.

[2] Vasan, R. S., et al. "Genome-wide association of echocardiographic dimensions, brachial artery endothelial function and treadmill exercise responses in the Framingham Heart Study." BMC Med Genet, vol. 8, suppl. 1, 2007, p. S2.

[3] Willer, C. J., et al. "Newly identified loci that influence lipid concentrations and risk of coronary artery disease." Nat Genet, vol. 40, no. 2, 2008, pp. 161-169.

[4] Kathiresan, Sekar, et al. "Common variants at 30 loci contribute to polygenic dyslipidemia." Nature Genetics, vol. 40, 2008, pp. 189–197.

[5] Pare, G., et al. "Novel association of ABO histo-blood group antigen with soluble ICAM-1: results of a genome-wide association study of 6,578 women." PLoS Genet, vol. 4, no. 7, 2008, e1000118.

[6] Dehghan, A., et al. "Association of three genetic loci with uric acid concentration and risk of gout: a genome-wide association study." Lancet, vol. 372, no. 9654, 2008, pp. 1858-1864.

[7] Benjamin, Emelia J., et al. "Genome-wide association with select biomarker traits in the Framingham Heart Study." BMC Medical Genetics, vol. 8, suppl. 1, 2007, p. S11.

[8] Melzer, David, et al. "A genome-wide association study identifies protein quantitative trait loci (pQTLs)." PLoS Genetics, vol. 4, no. 4, 2008, p. e1000034.

[9] Gieger, Christian, et al. "Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum." PLoS Genetics, vol. 4, no. 11, 2008, e1000282.