Skip to main content

Prediction of SARS-CoV-2 hosts among Brazilian mammals and new coronavirus transmission chain using evolutionary bioinformatics


Severe acute respiratory syndrome coronavirus (SARS-CoV) and SARS-CoV-2 are thought to transmit to humans via wild mammals, especially bats. However, evidence for direct bat-to-human transmission is lacking. Involvement of intermediate hosts is considered a reason for SARS-CoV-2 transmission to humans and emergence of outbreak. Large biodiversity is found in tropical territories, such as Brazil. On the similar line, this study aimed to predict potential coronavirus hosts among Brazilian wild mammals based on angiotensin-converting enzyme 2 (ACE2) sequences using evolutionary bioinformatics. Cougar, maned wolf, and bush dogs were predicted as potential hosts for coronavirus. These indigenous carnivores are philogenetically closer to the known SARS-CoV/SARS-CoV-2 hosts and presented low ACE2 divergence. A new coronavirus transmission chain was developed in which white-tailed deer, a susceptible SARS-CoV-2 host, have the central position. Cougar play an important role because of its low divergent ACE2 level in deer and humans. The discovery of these potential coronavirus hosts will be useful for epidemiological surveillance and discovery of interventions that can contribute to break the transmission chain.


Severe acute respiratory syndrome coronavirus (SARS-CoV) and SARS-CoV-2 enter the host cell using surface (S) protein as a key component and via angiotensin-converting enzyme 2 (ACE2) receptor binding (Guruprasad 2020; Li et al. 2006). Changes in the structure of S protein can reduce its affinity for ACE2 receptor and thus can impair coronavirus infectivity (Li et al. 2005). Therefore, ACE2 could be a species-specific barrier that interferes with coronavirus cross-species transmission. However, mutations in genes coding S protein might be responsible for crossing the species-specific barrier and effective binding of SARS-CoV-2 to new hosts (Alexander et al. 2020; Li et al. 2005). Accordingly, S protein is the fastest-evolving protein responsible for SARS-CoV transmission from animals to humans (Song et al. 2005).

The transmission routes of SARS-CoV, Middle East respiratory syndrome coronavirus (MERS-CoV), and SARS-CoV-2 encompass a series of wild animals (Tiwari et al. 2020). Among these animals, bats are a broad reservoir of coronavirus (Wong et al. 2020). For instance, Rhinolophus affinis, an Asian bat, is a host for a coronavirus strain having high genetic similarity with SARS-CoV-2. Additionally, betacoronavirus strains, partially similar to SARS-CoV, were detected in neotropical bat species such as A. lituratus, C. perspicillata and D. rotundus (Brandão et al. 2008; Corman et al. 2013; Góes et al. 2016). The betacoronavirus strain was found in Brazilian territory that has 178 bat species (Nogueira et al. 2014), and might favor the spillover of coronavirus to new susceptible host species.

Bats are a potential reservoir for several human pathogenic viruses, including coronavirus (Calisher et al. 2006). However, they exhibit remarkable resilience without manifestation of diseases because of their unique immune system (Brook and Dobson 2015). The high population density of bats favors viral dispersion (Calisher et al. 2006). Several species of bats fly over long distances during seasonal migrations (Popa-Lisseanu and Voigt 2009), and might be responsible for further spread of viruses to geographically distant locations. Furthermore, the use of echolocation for navigation produced by a specialized larynx can result in high dispersion of droplets and aerosols, which also facilitates viral transmission (Calisher et al. 2006).

Although SARS-like-CoV and SARS-like-CoV-2 are widely spread in bats, direct bat-to-human transmission has no evidence (Lu et al. 2015; Wong et al. 2020). Therefore, an intermediate host might be involved in the transmission of coronavirus to humans by improving the affinity of S protein for ACE2 receptor (Wang et al. 2006). In addition, similarity of ACE2 between hosts seems to be an important fator for zoonotic transmission. In a previous study, our group reported that human ACE2 receptor showed low divergence compared with ACE2 receptor from civet or pangolin, both which are known as intermediate hosts for coronavirus outbreaks. However, human ACE2 receptor showed higher divergence than that of bat ACE2 receptor (Lopes et al. 2020). Intermediate divergences reported between human and civet/pangolin ACE2 might contribute to the transmission of SARS-CoV and SARS-CoV-2, respectively.

The increased contact between animals and humans has raised concerns about the emergence of new coronavirus outbreak. Zoonotic transmission of SARS-CoV-2 has been proved, however, the spillover of SARS-CoV-2 from humans to wild animals must be considered. Evidence within mink farms has indicated the transmission of SARS-CoV-2 between humans and mink and back to humans (Munnink et al. 2021). Moreover, feral cats that roam on and beyond the mink farms were determined to be infected by SARS-CoV-2. These cats were likely infected by SARS-CoV-2-infected minks (van Aart et al. 2021). Furthermore, owing to the detection of SARS-CoV-2 in zoo animals, human-to-wild animal transmission has also been discussed (McAloose et al. 2020; Singla et al. 2020). The close contact between human population infected with SARS-CoV-2 and wild mammals might be a potential threat for new coronavirus outbreak. Based on this assumption, we aimed to study ACE2 receptor in Brazilian wild mammals to predict potential hosts for coronavirus using evolutionary bioinformatics analyses.


We performed phylogenetic analysis including 34 ACE2 protein complete sequences, obtained from the NCBI Protein database ( using MrBayes V.3.2 (Ronquist et al. 2012). ACE2 sequences from 27 species of wild mammals found in Brazil, five known coronavirus hosts, Homo sapiens and A. mississippiensis (outgroup) were included in this study. ACE2 sequences from known SARS-CoV, MERS-CoV and SARS-CoV-2 hosts were included to determine the phylogenetic relationship between these animals and Brazilian wild mammals to predict potential hosts among them. The accession numbers of all sequences are listed in Supplementary Table 1. ACE2 protein sequences were aligned using MUSCLE (Edgar 2004). To construct a phylogenetic tree, the Jones-Taylor-Thornton model (Jones et al. 1992) with a gamma distribution for among-site rate variation (JTT + G model) was used, selected by the model test conducted in MEGA X (Kumar et al. 2018). Additional analysis of a phylogenetic tree of cervid species, including 11 mitochondrial cytochrome b amino acid sequences (listed in Supplementary Table 1), was performed using mtREV model (general reversible Markov model-REV for mitochondrial DNA-encoded proteins) (Adachi and Hasegawa 1996), selected by the model test conducted in MEGA X (Kumar et al. 2018). Bayesian phylogenetic trees were searched for one million generations with sampling every 100 generation until the standard deviation from split frequencies was less than 0.01. The parameters and the trees were summarized by wasting 25% of the samples obtained (burn-in). A consensus tree was constructed and was further used to determine the posterior probabilities. Phylogenetic trees were formatted using the FigTree v1.3.1 software ( We further performed evolutionary divergent pairwise analysis based on the number of amino acid substitutions per site, using the MEGA X software (Kumar et al. 2018), and constructed a heatmap matrix to complement the phylogeny. To infer the evolutionary divergence, we used the ACE2 complete sequence-based alignment. We also used a concatenated alignment with four ACE2 regions containing residues involved in binding to S protein of SARS-CoV-2 (Supplementary file 1). ACE2 binding regions that encompass the key residues enabling interactions with SARS-CoV-2 S protein were based on previous studies (Shang et al. 2020; Brown et al. 2021).

ACE2 phylogenetic tree (Fig. 1a), rooted in ACE2 of A. mississippiensis, showed two distinct clades. Clade 1 was clustered into three subclades: bats (A), ungulates (B), and felines/canids (carnivores) together with known SARS-CoV and SARS-CoV-2 hosts (C). Clade 2 is a small and outside clade composed of primates, encompassing humans. D. novemcinctus (armadillo) was placed distantly in the tree.

Fig. 1
figure 1

Evolutionary analysis based on angiotensin-converting enzyme 2 (ACE2). a Phylogenetic tree included 34 amino acid sequences. The species in blue represent the known severe acute respiratory syndrome coronavirus (SARS-CoV)-2 hosts, the species in gray represent the known MERS-CoV host, and the species in red represent the known SARS-CoV hosts. The numbers in the phylogenetic tree represent the posterior probability that are the confidence values from each clade (the higher confidence is 1). Scale bar indicates the number of substitutions/site for the trees. b Bayesian phylogenetic tree was based on ACE2 protein. ACE2 protein sequences were aligned and used to infer the evolutionary divergence values, represented in the matrix heatmap. The heatmap color gradient represents the evolutionary divergence based on the number of amino acid substitutions/site from a pairwise comparison between sequences, from low (red) to high (blue). Evolutionary divergence analyses were conducted using the JTT matrix-based model. The heatmap were constructed using the Microsoft Excel™ software

The phylogenetic tree of ACE2 sequence (Fig. 1a) revealed that clade 1C of SARS-CoV intermediate hosts nested with Brazilian wild carnivores. Paguma larvata (civet) is closely related to Puma concolor (cougar), whereas Nyctereutes procyonoides (raccoon dog) is closely related to C. brachyurus (maned wolf) and Speothos venaticus (bush dog). These close evolutionary relationships were confirmed by low divergence between SARS-CoV hosts and the indigenous carnivores (Fig. 1b and Table 1). Manis javanica (Malayan pangolin), the intermediate host of SARS-CoV-2, was placed together with cougar, maned wolf and bush dog in clade 1C and showed low divergence of ACE2 protein compared with these animals (Fig. 1b and Table 1), although it is not close to the Brazilian carnivores such as the civet and raccoon dog are.

Table 1 Angiotensin-converting enzyme 2 (ACE2) based-evolutionary divergence

ACE2-based analyses showed that humans are phylogenetically distant and evolutionarily divergent from bats. Bats were placed closer to the mammals of clades 1B and 1C than humans from clades 1B and 1C in the phylogenetic tree (Fig. 1a). However, the evolutionary divergence of humans from wild carnivores is lower than that from bats (Fig. 1b and Table 1). Among the indigenous carnivores, cougar showed lowest divergence from human ACE2 receptor (Table 1).

An additional analysis targeting key residues based on the binding region of ACE2 protein, which are important regions in SARS-CoV-2 S protein-ACE2 interactions, revealed similarities between humans and other mammals (Table 2). Sequences from cougars had higher similiarity with those from humans; only three different ACE2 key residues involved in binding to S protein were observed between them. The similarity between cougar and human sequences was even higher than the similarities within the primate group. ACE2 key residues, involved in binding to S protein, from bush dog, maned wolf and white-tailed deer were also highly similar to human ACE2 key residues. Human ACE2 key residues in binding regions were more similar to those of indigenous mammals than to those of known coronavirus hosts, such as camel, civet and pangolin. Evolutionary divergent analysis based on ACE2 regions containing the residues that interface with SARS-CoV-2 S protein of different species also showed low divergence between human and cougar and white-tailed deer (Table 2 and Fig. 2). Moreover, ACE2 binding region from cougar and deer presented a low divergence to those from indigenous bats (Fig. 2 and Supplementary Table 2). These results reinforce the potential of Brazilian mammals to serve as susceptible hosts for SARS-CoV-2. Among Brazilian bats, ACE2 key residues from Anoura geoffrey had higher similarity in comparison with those of humans. However, A. geoffrey is a known host for alphacoronavirus, but there has been no reports of this bat species harbouring betacoronavirus (Corman et al. 2013).

Table 2 Comparison of Angiotensin-converting enzyme 2 (ACE2) key residues involved in binding to S protein of SARS-CoV-2
Fig. 2
figure 2

Heatmap matrix was based on four ACE2 binding regions that encompass the key residues enabling interactions with the SARS-CoV-2 S protein, detailed on Table 2 and Supplementary file 1. The ACE2 binding regions were concatened and used to infer the evolutionary divergence values, represented in the matrix heatmap. The evolutionary divergence values are available in the Supplementary Table 2. The heatmap color gradients represent the evolutionary divergence based on the number of amino acid substitutions/site from a pairwise comparison between sequences, from low (red) to high (blue). Evolutionary divergence analyses were conducted using the JTT matrix-based model. The heatmap were constructed using the Microsoft Excel™ software

Our results, based on ACE2 analysis, revealed a close relationship and low divergence between the ungulates Camel ferus (camel), a known MERS-CoV host, and Odocoileus virginianus (white-tailed deer), a host susceptible to SARS-CoV-2 infection (Fig. 1 and Table 1). Additionally, the key residues in binding region of ACE2 protein of ungulates had few differences in comparison with those of human ACE2 (Table 2 and Fig. 2). MERS-CoV transmission from camels to humans requires close exposure to be effective (Han et al. 2016). In this context, humans also come in close contact with white-tailed deer because of their hunting and use as livestock (McShea 2012). SARS-CoV-2 transmission in farmed animals (American mink) has already been reported (Munnink et al. 2021). Furthermore, deer have high population density and are broadly distributed across all American continents, that are next to urban areas. For controling the population density of deer, their hunting is considered legal in some territories (McShea 2012). In northern Brazil, white-tailed deer are hunted and their meat is consumed by quilombola communities (de Figueiredo et al. 2016).

Based on the close relationship between humans and white-tailed deer, and considering their susceptibility to SARS-CoV-2 infection, we proposed a scenario that represents the transmission chain of SARS-CoV-2, including the potential hosts predicted by our evolutionary bioinformatics analyses and considering the ecological features of the wild mammals (Fig. 3). White-tailed deer have intricate relationships with other wild animals and humans. For instance, white-tailed deer (and other deer species) have been frequently preyed on by cougars (Cooley et al. 2008; McShea 2012). Both the species share a large spatial area on the American continent. In some locations, cougars appear to primarily subsist on white-tailed deer during winter (Cooley et al. 2008). This situation could favor the transmission of SARS-CoV-2 from deer to cougar. In addition, our results showed low ACE2 evolutionary divergence between deer and cougar (Table 1), which may further contribute to potential transmission. SARS-CoV-2-infected wild felines from zoos (McAloose et al. 2020) and domestic cats (Halfmann et al. 2020) may contribute to the SARS-CoV-2 transmission hypothesis of the cougars being infected by SARS-CoV-2. SARS-CoV-2 spillover from humans to cougars is possible because of substantial conflicts between the two in farming frontiers (de Souza et al. 2018; Palmeira et al. 2015).

Fig. 3
figure 3

Diagram displaying the potential transmission chain of severe acute respiratory syndrome coronavirus (SARS-CoV)-2 among humans and wild animals. Known SARS-CoV-2 hosts (horseshoe bat and pangolin), susceptible hosts confirmed by previous studies (white-tailed deer, mink, domestic cat, and zoo felines) and potential SARS-CoV-2 hosts (cougar, maned wolf and bush dog) predicted by bioinformatics analysis were included. (From left to right) Horseshoe bats (Rhinolophus genus), the main animal reservoir of SARS-CoV-2 (SARS-like-CoV) and Malayan pangolin, the SARS-CoV-2 intermediate host. White-tailed deer are susceptible to SARS-CoV-2 infection and are preyed by cougars, a potential SARS-CoV-2 host predicted by this study. SARS-CoV-2 may transmit from humans to cougars, considering that lions and tigers were infected by SARS-CoV-2 in the zoo. Maned Wolf, a potential SARS-CoV-2 host, can prey on other deer species. Bush dog is likely to prey on deer, but evidence is lacking. D. rotundus, a vampire bat, is a SARS-like-CoV host that can feed on white-tailed deer and other deer species. Moreover, D. rotundus could spread coronavirus to other bat species or even to cougar, maned wolf or bush dog, although there is no scientific confirmation

Additional ecological interactions involving white-tailed deer place these animals at the center of potential transmission chain (Fig. 3). A study reported that D. rotundus feed on white-tailed deer (Sánchez-Cordero et al. 2011). The high abundance of deer may constitute an important feed resource for vampire bats. The susceptibility of D. rotundus toward SARS-like-CoV infections indicates that vampire bat might be infected by SARS-CoV-2 and transmit the virus to other animals. D. rotundus was also observed feeding on another cervid species, red brocket deer (Mazama americana) (Zortéa et al. 2018). In addition, these South American cervid are preying upon by maned wolf (Juarez and Marinho-Filho 2002) and by bush dog, according to the observation of individuals living next to wild habitat (Beisiegel and Ades 2002). Although the susceptibility of red brocket deer to SARS-CoV-2 has not yet been confirmed, all cervid species should be considered potential hosts for SARS-CoV-2 because of the close evolutionary relationship between these animals and white-tailed deer (Supplementary Fig. 1).

Taken together, connections between SARS-CoV-2 susceptible hosts with potential hosts comprise a complex network and hence might contribute to SARS-CoV-2 transmission and emergence of new coronavirus strains. The history of SARS shows that coronavirus presents high variability, potential to infect wildlife and rapid evolution to adapt in new susceptible hosts, thus becoming a threat to humans (Gong and Bao 2018). SARS-like-CoV strains, frequently found in wildlife after the end of the SARS outbreak, can be implicated in SARS-CoV-2 origin (Zhang et al. 2020). High mutation rate and worldwide spread of SARS-CoV-2 contribute to the emergence of new variants (Benedetti et al. 2020). Thus, high mutation rates of SARS-CoV-2 may favor virus transmission in susceptible wild hosts.

Considering ACE2 receptor as a transmission barrier that interferes with the coronavirus spillover to humans, a low divergence between ACE2 receptors of susceptible hosts may favor cross-species transmission. The close evolutionary relationship between the known coronavirus intermediate hosts and Brazilian mammals based on ACE2 sequences showed that other hosts might be also be involved in the chain. Our results suggest that cougar, maned wolf, bush dog, and white-tailed deer, as well as their relative species, are potential hosts for SARS-CoV-2 transmission. We focused on white-tailed deer as a susceptible SARS-CoV-2 host because of their increased population, wide distribution, and close contact with humans and other potential hosts such as cougars. In this context, white-tailed deer and cougar (prey and predator) could be involved in coronavirus transmission together with tropical bats susceptible to betacoronavirus. The central role of white-tailed deer during SARS-CoV-2 outbreak, as well camels during MERS outbreak, can make them important coronavirus reservoirs. However, these results must be confirmed through animal experiments and studies to detect coronavirus strains and assess the ability of these animals to serve as susceptible hosts.

In Brazil, SARS-CoV-2 infections have been rapidly spreading, since the beginning of the coronavirus disease 2019 (COVID-19) pandemic, favoring the emergence of new variants (da Silva and Pena 2021). Thus, a high incidence of SARS-CoV-2 cases in populations that live in proximity to wild habitats, which harbor a large diversity of mammals, can be an additional and critical scenario of new emerging coronavirus strains in Brazilian territories. Genomic surveillance should be widely implemented to identify new variants and establish measures to control SARS-CoV-2 transmission. However, in the context of Brazil, SARS-CoV-2 surveillance has been slow (da Silva and Pena 2021). The discovery of potential hosts for coronaviruses is essential for epidemiological surveillance. Moreover, coronavirus transmission chain can be broken using effective intervention if all players involved in the chain are identified. Therefore, the surveillance of SARS-CoV-2 in domestic or wild animals, specially those with low divergence in the ACE2 receptor when compared to that of humans, should be broadly applied. Brazil should join forces to promote multi-sectoral responses, reducing the impact of the SARS-CoV-2 pandemic and consequently preventing the emergence of new coronavirus outbreaks.

Availability of data and materials

All the data obtained from the NCBI Protein and NCBI GenBank databases included in this research study presented accession code/numbers for research or reanalysis.



Angiotensin-converting enzyme 2


Coronavirus disease 2019




Middle East respiratory syndrome coronavirus


Middle East respiratory syndrome coronavirus coronavirus


General reversible Markov model for mitochondrial DNA-encoded proteins


National Center for Biotechnology Information




Severe acute respiratory syndrome


Severe acute respiratory syndrome coronavirus


Severe acute respiratory syndrome coronavirus 2


Download references


The authors thank Universidade Federal de São Paulo (Unifesp) for the support.

Code availability

Not applicable.


Not applicable.

Author information

Authors and Affiliations



LRL conceived the work with the GMC, PBP and, ACSJ contributions. The NCLP and LRL obtained and curated the data. LRL, PBP and, GMC made inferences and results analyses. LRL wrote the draft paper with GMC, PBP, NCLP, SKK and ACSJ contributions. All authors contributed to the writing of the manuscript. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Luciano Rodrigo Lopes.

Ethics declarations

Ethics approval and consent to participate

This research included secondary biological data with no possibility of individual identification. The Unifesp Research Ethics Committee was consulted and it was determined that this study did not require ethical approval.

Consent for publication

Not applicable.

Competing interests

The authors state that there is no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplementary Table 1.

Protein accession numbers.

Additional file 2. 

ACE2 key residues interfacing with SARS-CoV-2 S protein.

Additional file 3: Supplementary Table 2.

Estimates of evolutionary divergence between ACE2 regions that encompass the key residues enabling interactions with the SARS-CoV-2 S protein. The number of amino acid substitutions per site between sequences are shown. Evolutionary analyses were performed using MEGA X software. Analyses were conducted using the JTT matrix-based model. The rate variation among sites was modeled with a gamma distribution. This analysis involved 34 amino acid sequences aligned using MUSCLE. All ambiguous positions were removed for each sequence pair (pairwise deletion option). A total of 40 positions were included in the final dataset.

Additional file 4: Supplementary Figure 1.

Bayesian tree based on 11 cervid mitochondrial cytochrome b amino acid sequences. Phylogeny was performed using MrBayes 3.2.v. and the mtREV+G. Values of posterior probabilities are shown at the nodes of interest. The scale bar indicates the number of substitutions/site for the trees. Phylogenetic tree was formatted using the FigTree v1.3.1 software (

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Lopes, L.R., de Mattos Cardillo, G., de Lucca Pina, N.C. et al. Prediction of SARS-CoV-2 hosts among Brazilian mammals and new coronavirus transmission chain using evolutionary bioinformatics. Animal Diseases 1, 20 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • SARS-CoV-2
  • Angiotensin-converting enzyme 2
  • Coronavirus
  • Brazilian mammals
  • White-tailed deer