The TMEM106A protein has a molecular weight of 28.9 kDa. It has 262 amino acids, 240 of which are in the domain of function.[5] The protein has a transmembrane region.[7] There is evidence for a secondary transmembrane region in humans but this region is not conserved in related orthologs.[8] The protein does not contain a signal peptide.[9] The protein structure contains a similar proportion of alpha-helix and beta-strand secondary structures (this does not include transmembrane structures).[10][11]
TMEM106A protein with beta-sheets (red), alpha-helices (blue), and transmembrane region (grey)
There are several areas for post-translational modification for TMEM106A including:
The TMEM106A gene has two paralogs: TMEM106B and TMEM106C. These paralogs belong to the gene family pfam07092, which belongs to the DUF1356 superfamily. This family consists of several mammalian proteins that are around 250 amino acids in length.[15] TMEM106B and TMEM106C are conserved in invertebrates to mammals.
The TMEM106A gene has been found in only the Chordate phylum.[20] Of the three subphyla, TMEM106A is most commonly found in Vertebrata and has also been found in select members of Tunicata, which are invertebrate marine filter feeders. This phylum split occurred 722.5 million years ago.[21]TMEM106A has not been seen in bacteria, plants, or fungi.
TMEM106A is expressed in several human tissues. The tissues with highest expression are uterus, kidneys, small intestine, and stomach.[16][22] EST profiles for orthologs show expression is conserved with greatest expression in kidneys and lesser expression in several other areas.[23] Some tissues never show expression including: muscle, adipose tissue, and bone.
Gene neighborhood
In Homo sapiens, TMEM106A is located next to NBR1, a gene identified as an ovarian tumor antigen monitored in ovarian cancer.[24] It is also located near BRCA1, a breast cancertumor suppressor gene.[25] The first 140 amino acids of the TMEM106A protein, including portions of DUF1356 and a transmembrane region, are deleted along with BRCA1 during early-onset breast cancer.[26]
^Garnier J, Osguthorpe DJ, Robson B (March 1978). "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins". J. Mol. Biol. 120 (1): 97–120. doi:10.1016/0022-2836(78)90297-8. PMID642007.
^Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". J. Mol. Biol. 294 (5): 1351–62. doi:10.1006/jmbi.1999.3310. PMID10600390.