is their ability to adapt to changing population of invaders, which leads to selection pressure on the
invaders to evade these defenses, thereby creating arms race dynamics (McLaughlin and Malik, 2017).
There is growing evidence that KZNFs are engaged in such arms races with TEs (Bruno et al., 2019;
Fernandes et al., 2018; Jacobs et al., 2014). KZNFs form the largest ZNF gene subfamily found in
Sarcopterygii, i.e. tetrapods and lobe-finned fish (Imbeault et al., 2017; Bellefroid et al., 1991). The
extensive variation in KZNF copy number across species has long been appreciated (Bellefroid et al.,
1991; Huntley et al., 2006), but the evolutionary forces driving this remained elusive until a breakthrough
study showing that the number of ZNF domains in a given genome is positively correlated to retroelement
copy number across a small but diverse sample of vertebrates, suggesting a coevolutionary relationship
between the two (Thomas and Schneider, 2011). This relationship was later bolstered by ChIP-seq
experiments in humans and mice mapping the genome-wide binding of hundreds of KZNFs, which
revealed that most target specific TE families (Imbeault et al., 2017; Wolf et al., 2020). Furthermore,
KZNF knockouts in mouse and humans lead to upregulation of TE expression (Haring et al., 2021; Wolf
et al., 2020). Mechanistic studies showed that TE transcriptional repression via KZNF proteins is
typically mediated via their KRAB domain, which interacts with KAP1/TRIM28 corepressor to recruit
the H3K9me3 writer, SETDB1, amongst several other chromatin silencing factors (Wolf and Goff, 2009;
Rowe et al., 2010; Matsui et al., 2010; Ecco et al., 2017); in doing so, most KZNFs nucleate the formation
of heterochromatin at their target TE loci. Together, these findings support the idea that TE proliferation
is a driving force behind the diversification of KZNFs.
Much less is known about the factors driving the evolution of other ZNF families. KZNFs represent a
small fraction of all ZNFs, being restricted to some thirty thousand tetrapods, against a background of
millions of animal species (Mora et al., 2011; Sahney et al., 2010), most of which are predicted to harbor
hundreds of individual ZNF genes. One of the few other large families to have been studied are the zinc
finger-associated domain (ZAD)-ZNFs, which are found in many insect species (Chung et al., 2007). The
handful of ZAD-ZNFs characterized in Drosophila melanogaster do not appear to have clear roles in TE
repression, but instead perform a variety of functions related to heterochromatin organization (Kasinathan
et al., 2020). Thus far, only one ZAD-ZNF gene (CG17801) has been reported to directly affect TE
transcript levels, and was identified as part of an earlier screen for piRNA pathway components (Czech et
al., 2013). More recently, the ZAD-ZNF protein Kipferl has been shown to regulate TE activity indirectly,
by targeting the protein Rhino to heterochromatic piRNA clusters in Drosophila melanogaster, thus
licensing piRNA production from these loci (Baumgartner et al., 2022). Despite apparent differences in
the immediate function of most ZAD- and KZNF genes, both families share features that are broadly
characteristic of the majority of ZNF genes thus far studied, namely: rapid turnover and sequence
evolution, evidence of being under positive selection, involvement in establishing or maintaining
heterochromatin, and expression during early embryogenesis (Chung et al., 2007; Kasinathan et al., 2020;
Ecco et al., 2017). Other highly expanded ZNF families, such as those of hemipteran bugs, octopus, and
zebrafish, also possess subsets of these features (Panfilio et al., 2019; Albertin et al., 2015; Howe et al.,
2016).
Here, we investigate the hypothesis that interaction with TEs is a driving force underlying the molecular
evolution and functional diversification of ZNF families across metazoans. Through a survey of all
currently available metazoan genome assemblies, we find that the number of ZNF open reading frames is
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted November 30, 2022. ; https://doi.org/10.1101/2022.11.29.518450doi: bioRxiv preprint