Detection of single nucleotide polymorphisms (SNPs) in HER2, MUC1, ESR1, and BRCA1 genes associated with canine mammary cancer
J. D. Carvajal-Agudelo, L. Giraldo-Chalarca, D. M. Cortes-Mera, P. A. Ossa-López, E. D. Morales-Álvarez and F. A. Rivera-Páez*
Abstract
Worldwide, canine mammary cancer (CMC) is the most frequent type of neoplasia in female dogs, and it is three times more frequent in dogs than in humans. In Colombia, CMC is the second most frequent type of cancer, after skin neoplasia.
Genetics is one of the most important factors involved in any type of cancer, and the genetic basis of this disease is reflected through line breeding due to changes in allelic frequencies that are traceable using molecular markers. This study aimed to detect single nucleotide polymorphisms (SNPs) associated with CMC in blood samples collected from collected from healthy and CMC female dogs at Diego Villegas Toro Veterinary Hospital of Universidad de Caldas (Manizales, Colombia). We designed primers using Primer-BLAST and Primer3, and gene fragments from HER2, MUC1, ESR1, and BRCA1 were amplified to identify SNPs through genome mapping using the UCSC Genome Institute genome browser.
We used the genome of Canis lupus familaris Boxer breed [GCF_000002285.3, (CanFam 3.1)] as a reference to compare the gene fragments and SNPs.
We associated SNPs with the CMC and control groups by testing odds ratios (OR) through Fisher’s exact tests to determine an association or risk for CMC. We detected two SNPs for ESR1, three for MUC1, six for HER2, and one for BRCA1.
MUC1 was the only gene to display an SNP in an exonic region that resulted in an amino acid substitution (Pro>Thr). No significant differences based on the OR were found, though the majority of SNPs, with the exception of four, were found in females with CMC. We report a novel molecular marker for HER2 that amplifies exons 25–26 and introns 24–25, and highlight the importance of conducting further studies on MUC1 and elucidating the role of introns and splicing in candidate genes associated with CMC.Key words: genetic mapping; genetic variation; intron; mammary neoplasia; SNP association
Introduction
Canine mammary cancer (CMC) is the most frequent group of neoplasia in female dogs, with a 50% to 70% incidence compared to other types of cancer; it is also three times more prevalent in dogs than in humans (Beselga, 2013; do Carmo Silva et al., 2019). In Colombia, CMC shows a 21.5% incidence and is second only to skin neoplasia (Caicedo and Acosta, 2012). In Manizales, 25% of all diagnosed cases of mammary neoplasia are malignant (Pedraza-Ordόñez et al., 2008).
The risk factors, harmful environments, and types and subtypes of CMC have been reported (Araújo et al., 2018). Genetics is one of the most important factors involved in any type of cancer, and is mainly manifested through linebreeding or in dog breeds that are highly susceptible to this disease (Borge et al., 2011). Line breeding causes changes in the allelic frequencies of regulatory genes involved in cancer, which can be studied through genomic markers. They are related to specific changes, such as single nucleotide polymorphisms (SNPs) (Wang and Moult, 2001). SNPs have been associated with changes in hormonal, epidermal, growth, cell differentiation, and cell cycle factors (Ressel et al., 2013; Campos et al., 2015; Hermo et al., 2017).
Previous studies on dog breeds, genotype frequency and susceptibility to similar diseases provide efficient models to study the effects of different genes (Borge et al., 2011; Borge et al., 2013). These models imply using direct lines and their progeny, which can be easily studied by non-invasive genotyping through skin or fur scraping and blood samples (Siravegna and Bardelli, 2014).
CMC is poorly characterized at the genomic level, unlike human breast cancer. This is evidenced by the low number of SNPs reported for different genes of interest showing genetic mutations and variations of medical importance (Liu et al., 2014). The main molecular markers and genes reported to date for CMC include estrogen receptors (ESR1), progesterone (PR), epidermal growth (HER2), hormone metabolism [cytochrome P450 (CYP1A1 and CYP1B1)], cellular proliferation [proliferating cell nuclear antigen (PCNA)], cellular regulation [breast cancer 1 and 2 (BRCA1 and BRCA2)], apoptosis [ADP-ribose polymerase (PARP), caspases], and basal [mucin 1 (MUC1) ], among others (Vinothini et al., 2009; Dawood et al., 2011). Many of these genes have important functions in the sustainment of cellular and molecular mechanisms (Borge et al., 2011). Medical diagnosis and the search for molecular markers directly associated with diseases and susceptible genotypes have greatly improved, given the advances in genomic studies (Ginsburg and Willard, 2009; Santillán-Garzón et al., 2015).
The human epidermal growth factor receptor 2 (HER2), also known as erb-b2 (receptor tyrosine kinase 2), is widely used as a molecular marker for mammary cancer prognosis in dogs and humans (Muhammadnejad et al., 2012).
Furthermore, SNPs of HER2 related to CMC have been reported (Hsu et al., 2009; Canadas-Sousa et al., 2019a). Likewise, BRCA1 is used as a diagnostic molecular marker and is directly associated with tumour suppression and cell cycle regulation (Tsuchida Complete et al., 2001; Rivera et al., 2009; Enginler et al., 2014; Sun et al., 2015). Other molecular markers, such as estrogen receptor alpha (ESR1), are mediators of hormonal processes and can promote carcinogenic responses (Russo and Russo, 2008).
Therefore, the direct relationship of these genes with regulatory mechanisms in mammary tissue has motivated the study of SNPs associated with CMC (Borge et al., 2013; Akis et al., 2019).
MUC1 is involved in apoptotic responses in tumour cells (Yin et al., 2003) and is considered a carcinogenic gene related to breast cancer in humans (De Cremoux et al., 2000; Kufe, 2013).
Several studies on SNP identification report mutations that can cause significant alterations in cellular function of candidate genes and, thus, could be used as molecular markers for cancer prognosis or diagnosis. This study aimed to detect SNPs of HER2, MUC1, ESR1, and BRCA1 genes in patients with CMC in Manizales (Caldas, Colombia).
Materials and methods
Sample collection
We studied 31 cases of female dogs at Diego Villegas Toro Veterinary Hospital of Universidad de Caldas (Manizales, Colombia) between June 2018 and January 2019. Nineteen patients showed alterations in mammary glands and 12 were healthy patients (control group). We collected 4 mL of peripheral blood from each patient in 5 mL heparin tubes through aseptic venipuncture and immediately conserved the samples at -4 °C.
Amplification of HER2, MUC1, ESR1, and BRCA1
Genomic DNA was isolated from blood samples using the Dneasy Blood and Tissue Kit (Qiagen®) following the manufacturer’s instructions. We assessed DNA quality by 1% agarose gel electrophoresis and estimated the amount of DNA through fluorometric quantification on a Quantus FluorometerTM (Promega®) using the QuantiFluor®dsDNA System (Promega®).
We searched the literature for primers used to analyse polymorphic sites in MUC1, ESR1, and BRCA1 (Table 1).
For HER2, we designed primers using Primer-BLAST and Primer3 (Untergasser et al., 2012; Ye et al., 2012) and analysed the primers using Oligoanalyzer (IDT Integrated DNA Technologies). Primers were evaluated with conventional PCR on a Techne TC-PLUS thermocycler. We used a final volume per reaction of 40 μL, containing 20.6 μL ultrapure water, 8 μL 5X buffer, 2.4 μL MgCl 2 (25 mM), 3.2 μL dNTPs mix (10 mM), 0.6 μL each primer (25 μM), 2 units GoTaq®Flexi DNA Polymerase (Promega®), and 4 μL DNA.
The thermal cycling conditions were initial denaturation at 95 °C for 5 min, followed by 35 cycles at 95 °C for 20s, 55 °C for 15s, 72 °C for 30s, and a final extension at 72 °C for 7 min. PCR products were separated by horizontal electrophoresis on 1% agarose gels using TBE 1X pH 8.0 running buffer at 100v/40mA. Gels were stained with SYBR®Safe and visualized on a Gel Doc-It2 310 photodocumenter (UVP). Amplicons were purified using the Wizard® SV Gel and PCR Clean-Up System (Promega®), according to the manufacturer’s instructions. Sanger sequencing of the purified amplicons was provided by Macrogen (South Korea).
Sequence quality was assessed using Geneious Trial v8.14 (Drummond et al., 2009 and GeneStudio V.2.2 (GeneStudio Inc, 2011).
Identity, specificity, and genetic mapping of SNPs
The reference gene sequences for HER2, MUC1, ESR1, and BRCA1 were obtained from the genome of Canis lupus familaris Boxer breed [GCF_000002285.3, (CanFam 3.1)]. We mapped each gene to the genome using the BLAT tool (Kent, 2002) of the UCSC Genome Institute genome browser (https://genome.ucsc.
edu/) (Karolchik et al., 2003) to compare SNPs and confirm their location and identity. Genomic locations and reference sequences for each gene are shown in Table 2.
The sequences generated in this study were aligned to the reference sequences using ClustalW in MEGA X (Kumar et al., 2018).
Data analysis
To determine significant differences between the SNPs and tumour presence, we associated SNPs with the CMC and control groups by testing odds ratios (OR) using Fisher’s exact tests. We used a statistical significance level of 0.05. The analyses were conducted using IBM SPSS (Arbuckle, 2011).
Results
Of the 31 patients included in the study, 51.6% were purebred (e.g., Boston Terrier, English Bulldog, Border Collie, French Poodle, Labrador Retriever, Pinscher, Pitbull, Samoyed, Schnauzer, and Shih Tzu) and 48.4% were mixed-breed. The DNA sequences obtained for each patient were compared to ESR1, MUC1, HER2, and BRCA1 reference gene sequences from the genome of Canis lupus familiaris Boxer breed [GCF_000002285.3, (CanFam 3.1)]. The alignments showed identities equal to or greater than 99%.
Furthermore, we detected two SNPs for ESR1, three of MUC1, six of HER2, and one for BRCA1, as well as two deletions for MUC1 and three deletions of ESR1 (Table 3).
The GenBank accessions for the nucleotide sequences obtained in this study are [MT552115-MT552145] for HER2, [MT550815-MT550843] for ESR1, [MT561299-MT561329] for BRCA1, and [MT571366-MT571396] for MUC1.
The association analysis of SNPs with CMC and control patients did not show significant differences for any SNP. The OR did not indicate a direct significant association with CMC since only three SNPs (4451 C>A MUC1, 23079 T>G HER2, 67395 ESR1 T>C) were found within the risk interval (>1) (Table 4).
However, the majority of SNPs, with the exception of four, were detected in female dogs with CMC.
Discussion
This study involves SNPs in exons 25 and 26 of HER2 and identifies unique SNPs in introns 24 and 25. Previous studies have identified SNPs of HER2 associated with CMC in exons 14-18 (Hsu et al., 2009) and 9, 13, 14, 16, 23, and 27 (Borge et al., 2011). These results suggest that most SNPs of HER2 related to CMC are located along the gene and not in certain regions. For ESR1, we found two novel SNPs in intron 6, while prior studies have only assessed exons 2, 4, and 8 in relation to CMC (Borge et al., 2011).
Borge et al. (2013) and Canadas-Sousa et al. (2019b) genotyped and haplotyped HER2, finding extensive and strong relationships along the entire gene.
For BRCA1, we report one SNP in intron 21. BRCA1 is one of the most studied genes in CMC. Previous studies analysing exon 9, intron 8, 5′ UTR (Enginler et al., 2014), exon 22 (Borge et al., 2011), 5’ flanking region (Qiu and Lin 2016), and 5’/ 3’ UTRs (Sun et al., 2015), and array genotyping studies on BRCA1 (Rivera et al., 2009) found strong associations with the incidence and development of CMC. Consequently, research on this gene requires studying a wide region for association analyses. For MUC1, there are no SNP association studies in CMC; therefore, this is the first study to characterize SNPs of MUC1 in exon 7 and intron 6 in CMC research. We report three SNPs for MUC1, specifically, one resulting in an amino acid substitution (4451 G/C, Pro/Thr). Other studies have found a relationship between the expression of MUC1 and CMC (Campos et al., 2015), and associated SNPs in humans (Zeillinger et al., 2000; Yang et al., 2007). These previous findings suggest that this gene strongly influences molecular mechanisms in CMC, which must be further studied in canine models. Finally, we report two deletions of MUC1 and three of ESR1 in intronic regions, which could be used to curate the sequences of the corresponding genes in the reference genome of Canis lupus familaris Boxer breed [GCF_000002285.3, (CanFam 3.1)].
The main challenges of studying specific candidate genes involved in diseases include the information obtained from these genes and coverage; the latter is limited by the design of molecular markers and, therefore, restricts the information and relationships that can be found (Thomas et al., 2009). SNP association studies in CMC have focused on exonic regions since these show direct changes in amino acid composition (Borge et al., 2011; Goebel and Merner, 2017). However, intronic SNPs can induce excisions and alternative splicing, promote changes in RNA stability or structure that directly affect gene expression, interfere with the speed and accuracy of transcription and/ or translation, and alter protein folding (Canadas-Sousa et al., 2019a). Moreover, the use of any gene involved in the etiopathogenesis of a disease should be supported by a significant association at the population level and/or experimental demonstration (Wang and Moult, 2001).
In this context, all SNPs detected in this study are novel, since the majority are located in intronic regions and these SNPs are not generally studied. However, studies on SNPs in intronic regions are relevant since induced changes in excisions, splicing, and gene expression can significantly alter the phenotype and influence cancer development (Hiratani et al., 2005; Law et al., 2007). Moreover, we report a new marker of HER2 that amplifies exons 25–26 and introns 24–25, which can be promising for future studies on CMC.
Acknowledgments
To the Vice-rectory of Research and Post-graduate Studies of Universidad de Caldas for funding the project titled “Expression of ER-α, MUC1, HER-2 genes and their signficance in the early diagnosis of canine mammary cancer in Manizales, Caldas” (Expresión de los genes ER-α, MUC1, HER-2 y su significancia en el diagnóstico temprano en cáncer mamario canino en Manizales, Caldas) code 0298218. To the research group GEBIOME and Diego Villegas Toro Veterinary Hospital of Universidad de Caldas for their contributions to the project.
References [… show]
Detekcija polimorfizama jednog nukleotida (SNP-a) u HER2, MUC1, ESR1 i BRCA1 gena povezanih s tumorom mliječne žlijezde kuja
Juan D. CARVAJAL-AGUDELO, Laura GIRALDO-CHALARCA, Diana M. CORTES-MERA, Paula A. OSSA-LÓPEZ, Grupo de Investigación GEBIOME, Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas y Naturales, Universidad de Caldas, Calle 65 No. 26-10 Apartado Aéreo 275 Manizales, Caldas, Colombia; Edwin D. MORALES-ÁLVAREZ, Grupo de Investigación GEBIOME, Departamento de Química, Facultad de Ciencias Exactas y Naturales, Universidad de Caldas, Calle 65 No. 26-10 Apartado Aéreo 275 Manizales, Caldas, Colombia; Fredy A. RIVERA-PÁEZ, Grupo de Investigación GEBIOME, Departamento de Ciencias Biológicas, Facultad de Ciencias Exactasay Naturales, Universidad de Caldas, Calle 65 No. 26-10 Apartado Aéreo 275 Manizales, Caldas, Colombia
Diljem svijeta, tumor mliječne žlijezde kuja (CMC) najčešća je skupina neoplazija kuja te je tri puta učestalija u pasa nego u ljudi. U Kolumbiji, CMC je bio druga najčešća vrsta tumora, nakon neoplazija na koži.
Nadalje, genetika je jedan od najvažnijih čimbenika uključenih u bilo koju vrstu tumora, a genetska baza ove bolesti održava se kroz linijski uzgoj uslijed promjena alelnih frekvencija, koje se mogu pratiti preko molekularnih markera. Ova studija imala je za cilj detektirati polimorfizme jednog nukleotida (SNP-e) povezane s CMC-om u uzorcima krvi prikupljenih od kuja s CMC-om i zdravih kuja u veterinarskoj bolnici Diego Villegas Toro Sveučilišta Caldas (Manizales, Kolumbija).
Dizajnirali smo početnice uporabom Primer-BLAST i Primer3 te su fragmenti gena HER2, MUC1, ESR1 i BRCA1 pojačani za identifikaciju SNP-a preko mapiranja genoma uporabom preglednika genoma Instituta za genom Sveučilišta Santa Cruz u Kaliforniji (UCSC). Rabili smo genom Canis lupus familaris pasmine bokser [GCF_000002285.3, (CanFam 3.1)] kao referencu za usporedbu fragmenata gena i SNP-a. Povezali smo SNP-e s CMC-om i kontrolnim skupinama testiranjem omjera izgleda (OR) pomoću Fisherovih egzaktnih testova za određivanje povezanosti ili rizika od CMC-a. Detektirali smo dva SNP-a za ESR1, tri za MUC1, šest za HER2 i jedan za BRCA1.
MUC1 je bio jedini gen koji je pokazao SNP u regiji egzona što je rezultiralo supstitucijom aminokiseline (Pro>Thr). Nismo pronašli značajne razlike na temelju OR-a, premda je većina SNP-a, izuzev četiri, pronađena u kuja s CMC-om. Prijavljujemo novi molekularni marker za HER2 koji pojačava egzone 25 – 26 i introne 24 – 25 te naglašavamo važnost provođenja dodatnih studija na MUC1, kao i pojašnjenja uloge introna i izrezivanja u gena kandidata poveznih s CMC-om.Ključne riječi: genetsko mapiranje, genetska varijacija, intron, neoplazija mliječnih žlijezda, povezanost s SNP-om