A selenocysteine tRNA and SECIS element in Plasmodium falciparum

  1. TOBIAS MOURIER,
  2. ARNAB PAIN,
  3. BART BARRELL, and
  4. SAM GRIFFITHS-JONES
  1. The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom

Abstract

The molecular machinery for incorporating selenocysteine into proteins is present in both prokaryotes and eukaryotes. Although selenocysteine insertion has been reported in animals, plants, and protozoans, known eukaryotic selenocysteine tRNA sequences and selenocysteine insertion sequences are limited to animals and plants. Here we present clear indications of the presence of selenocysteine-tRNA and a selenocysteine insertion sequence in Plasmodium falciparum. To our knowledge, this is the first report of an identification of protozoan selenocysteine insertion machinery at the sequence level.

Keywords

Systematic incorporation of selenium into protein is facilitated by a tRNA on which the initial serine is converted to selenocysteine (hence designated tRNA[Ser]Sec). In the presence of a selenocysteine insertion sequence (SECIS) in the mRNA, the tRNA[Ser]Sec recognizes UGA that otherwise acts as a stop codon. Selenoproteins are most often involved in redox regulation of metabolic processes; disruption of eukaryotic selenocysteine insertion is in several cases reported to be lethal to the organism (Bosl et al. 1997; Kohrle et al. 2000). Eukaryotic proteins with incorporated selenocysteines have been found in animals and in a single representative from plants (Fu et al. 2002; Novoselov et al. 2002) and protozoans (Osaka et al. 2003), respectively.

From the genome of the apicomplexan malaria parasite Plasmodium falciparum (Gardner et al. 2002) we have identified a putative tRNA[Ser]Sec. This prompted a search for other indications of the presence of selenocysteine insertion machinery, which revealed a SECIS-like sequence, conserved across Plasmodium phylogeny, and with covariation indicative of a conserved secondary structure. The P. falciparum SECIS element is situated downstream of MAL8P1.86, a protein of unknown function, in which the presence of a putative TGA selenocysteine codon has caused gene prediction algorithms to incorrectly identify a 60-bp intron covering the TGA codon.

A possible selenocysteine tRNA was identified by a whole genome analysis of (G + C)-rich regions in P. falciparum. This was supported by tRNAscan-SE prediction (Lowe and Eddy 1997) using the covariance model analysis only for maximum sensitivity. Running tRNAscan-SE at the default setting did not result in any tRNA predictions, indicating a divergent primary structure. Homologs of the putative tRNA[Ser]Sec were detectable in a range of other Plasmodium genomes (Figs. 1, 2). We also detected Sec-tRNAs in Toxoplasma gondii and Eimeria tenella (Figs. 1, 2), but not in Theileria annulata or Cryptosporidium parvum. The putative apicomplexan Sec-tRNAs display the atypically long variable arm and a resulting length of around 90 nt previously seen in other eukaryotic tRNA[Ser]Sec (Hatfield and Gladyshev 2002). The P. falciparum tRNA[Ser]Sec is flanked by a Met-tRNA and the gene encoding PF14_0361 on chromosome 14. Although data are limited by the unfinished nature of the genomes, this region appears to have conserved synteny in Plasmodium species (data not shown).

In a previous study, Lambert and coworkers published a list of confirmed and putative SECIS elements identified using the ERPIN program (Gautheret and Lambert 2001; Lambert et al. 2002). This computational prediction included three low-scoring SECIS elements from P. yoelii and P. vivax. WU-BLAST (http://blast.wustl.edu/) searches with the P. yoelii sequence returned significant matches from a range of other Plasmodium genomes (Fig. 1). An alignment of the putative SECIS elements is shown in Figure 3. The elements display three helices, hence conforming to the Form 2 SECIS element structure (for review, see Krol 2002). A conventional eukaryotic non-Watson–Crick quartet (UGAN-NGAN, highlighted in Fig. 3) preceded by adenosine is conserved in all candidate sequences. Also, the invariant AA pair in the internal loop 2 is conserved (Korotkov et al. 2002).

The putative P. falciparum SECIS element is situated 134 bp downstream of the MAL8P1.86 gene. The translated product of this gene has no annotated function and does not contain any known protein domains (Pfam15: Bateman et al. 2004). Microarray analysis has shown up-regulation of MAL8P1.86 expression during the P. falciparum trophozoite stage in human red blood cells (Bozdech et al. 2003). The MAL8P1.86 gene is annotated as having a 60-bp intron in which the TGA codon resides (GeneDB, http://www.genedb.org/). The TGA codon is conserved in Plasmodium genomes as is the coding potential of the entire intron (Fig. 4). The possibility of a highly conserved intron is rendered unlikely by the finding that the splice signals are not detected outside P. falciparum (Fig. 4). Hence, rather than being an intron, this sequence most likely represents the part of the coding sequence containing a selenocysteine codon. The presence of selenocysteine insertion in P. falciparum is further supported by the fact that the P. falciparum genome encodes a putative selenocysteine-specific elongation factor and a putative selenophosphate synthetase (PFA0495c and PFI0505, respectively).

The SECIS element from P. berghei resides in the very proximal end of a contig, and the absence of a MAL8P1.86 homolog is most likely due to low genome coverage (Fig. 1). Given the close relationship between P. falciparum and P. reichenowi, the fact that neither the SECIS element nor any MAL8P1.86 homolog is detectable in the latter (Fig. 1) is most easily explained by low coverage (consistent with the ∼1× coverage of the P. reichenowi genome). However, we cannot rule out the possibility that these features have been recently lost (within the last ∼10 million years; Rich and Ayala 2000) and that the P. reichenowi Sec-tRNA is conserved for the processing of another selenoprotein (see below).

Searching the complete P. falciparum genome with covariance models built from alignments of both animal SECIS elements and the Plasmodium sequences identified here did not suggest further convincing SECIS candidates (Infernal: http://www.genetics.wustl.edu/eddy/infernal/, Eddy 2002; Griffiths-Jones et al. 2003). Hence, if P. falciparum hosts further SECIS elements, these will be divergent in sequence and secondary structure. The human genome is reported to contain 25 selenoproteins (Kryukov et al. 2003), but as Caenorhabditis elegans appears to only encode a single selenoprotein (Gladyshev et al. 1999), this would not be unprecedented in eukaryotic genomes.

Given the involvement of selenoproteins in redox regulation and the up-regulation of MAL8P1.86 during a Plasmodium stage in human blood cells, it is tempting to speculate that the Plasmodium selenoprotein may have a role in metabolic processes related to redox regulation within infected red blood cells. It may not be coincidental that MAL8P1.86 is located next to a superoxide dismutase (SOD) gene. The Plasmodium selenoprotein—which unlike the majority of known eukaryotic selenoproteins is not widely conserved—may play a part in a novel and vital SOD-related pathway. Identification of this malarial selenoprotein may lead to new areas of malaria biology yet to be explored. However, until more is known about the exact function of MAL8P1.86 this remains purely conjectural.

As ab initio gene prediction algorithms will interpret any TGA selenocysteine codon as a stop codon, selenoproteins are likely to be missed in genome annotations. Therefore, selenoprotein detection requires more detailed analysis, including direct experimental detection of incorporated selenium (Gladyshev et al. 1999; Novoselov et al. 2002; Osaka et al. 2003), structural analysis of SECIS structures (Lambert et al. 2002; Kryukov et al. 2003), or—as in the present study—a comparative analysis of related genomes. With the above in mind, it is likely that reports of protozoan selenoproteins will continue to appear, perhaps rendering systematic selenocysteine insertion more the rule than the exception in the tree of life.

FIGURE 1.

Presence of detectable components in apicomplexan genomes. Evolutionary relationship is indicated on the left (Escalante and Ayala 1994; Barta et al. 2001). (nd) Not detectable. Sequence sources: Published genomes of P. falciparum, P. yoelii, and C. parvum appear in Gardner et al. (2002), Carlton et al. (2002), and Abrahamsen et al. (2004), respectively; unfinished genomes of P. berghei, P. chabaudi, P. gallinaceum, P. knowlesi, P. reichenowi, E. tenella, and T. annulata are provided by The Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/) and GeneDB (http://www.genedb.org/); draft genome sequence of T. gondii was searched at ToxoDB (http://toxodb.org/).


FIGURE 2.

Alignment of putative Sec-tRNAs. The predicted secondary structure consensus is shown below as nested parentheses. Only residues differing from the P. falciparum sequence are shown. Identical residues indicated as dots, gaps as dashes. The UCA anticodon and the long variable arm are highlighted.


FIGURE 3.

(A) Alignment of putative SECIS elements. Only residues differing from the P. falciparum sequence are shown. Gray-shaded residues are able to base pair in accordance with the predicted consensus secondary structure shown below. (B) Predicted secondary structure for the putative P. falciparum SECIS element. The non-Watson–Crick quartet and the invariant AA pair (Korotkov et al. 2002; Krol 2002) are denoted by black boxes and bold letters in alignment and structure, respectively.


FIGURE 4.

Alignment of genomic regions with a predicted selenocysteine codon. Genomic DNA (top) from five Plasmodium species. Only residues differing from the P. falciparum sequence are shown. Annotated intron in P. falciparum is in lower case. Note that the suggested donor and acceptor sites are not conserved. Translated protein sequences are shown below. The TGA codon (highlighted) is translated as selenocysteine (U).


Acknowledgments

T.M. was supported by a grant from The Carlsberg Foundation. Work at The Sanger Institute is funded by The Wellcome Trust.

Footnotes

REFERENCES

« Previous | Next Article »Table of Contents