Pathway: PKMTs methylate histone lysines
Reactions in pathway: PKMTs methylate histone lysines :
PKMTs methylate histone lysines
Lysine methyltransferases (KMTs) and arginine methyltransferases (RMTs) have a common mechanism of catalysis. Both families transfer a methyl group from a common donor, S-adenosyl-L-methionine (SAM), to the nitrogen atom on the epsilon-amino group of lysine or arginine (Smith & Denu 2009) using a bimolecular nucleophillic substitution (SN2) methyl transfer mechanism (Smith & Denu 2009, Zhang & Bruice 2008). All human KMTs except DOT1L (KMT4) (Feng et al. 2002, van Leeuwen et al. 2002, Lacoste et al. 2002) have a ~130 amino acid catalytic domain referred to as the SET domain (Del Rizzo & Trievel 2011, Dillon et al. 2005, Herz et al. 2013).
Some KMTs selectively methylate a particular lysine residue on a specific histone type. The extent of this methylation (mono-, di- or tri-methylation) also can be stringent (Herz et al. 2013, Copeland et al. 2009). Many KMTs also have non-histone substrates (Herz et al 2013), which are not discussed in this module.
The coordinates of post-translational modifications represented and described here follow UniProt standard practice whereby coordinates refer to the translated protein before any processing. Histone literature typically refers to specific residues by numbers which are determined after the initiating methionine has been removed. Therefore the coordinates of post-translated residues in the Reactome database and described here are frequently +1 when compared to the histone literature.
SET domain-containing proteins are classified in one of 7 families (Dillon et al. 2005). First to be discovered were the SUV39 family named after founding member SUV39H1 (KMT1A), which selectively methylates lysine-10 of histone H3 (H3K9) (Rea et al. 2000). Family member EHMT2 (KMT1C, G9A) is the predominant H3K9 methyltransferase in mammals (Tachibana et al. 2002). SETDB1 (KMT1E, ESET) also predominantly methylates H3K9, most effectively when complexed with ATF7IP (MCAF, hAM) (Wang et al. 2003).
SETD2 (KMT3A, HYPB), a member of the SET2 family, specifically methylates histone H3 lysine-37 (H3K36) (Sun et al. 2005). WHSC1 (KMT3G, NSD2, MMSET) a member of the same family, targets H3K36 when provided with nucleosome substrates but also can methylate histone H4 lysine-45 when octameric native or recombinant nucleosome substrates are provided (Li et al. 2009); dimethylation of histone H3 at lysine-37 (H3K36me2) is thought to be the principal chromatin-regulatory activity of WHSC1 (Kuo et al. 2011). Relatives NSD1 (KMT3B) and WHSC1L1 (KMT3F, NSD3) also methylate nucleosomal H3K36. NSD1 is active on unmethylated or a mimetic monomethylated H3K36, but not di- or trimethylated H3K36 mimetics (Li et al. 2009). Human SETD7 (KMT7, SET7/9), not classified within the 7 SET-domain containing families, mono-methylates lysine-5 of histone H3 (H3K4) (Xiao et al. 2003).
Some KMTs selectively methylate a particular lysine residue on a specific histone type. The extent of this methylation (mono-, di- or tri-methylation) also can be stringent (Herz et al. 2013, Copeland et al. 2009). Many KMTs also have non-histone substrates (Herz et al 2013), which are not discussed in this module.
The coordinates of post-translational modifications represented and described here follow UniProt standard practice whereby coordinates refer to the translated protein before any processing. Histone literature typically refers to specific residues by numbers which are determined after the initiating methionine has been removed. Therefore the coordinates of post-translated residues in the Reactome database and described here are frequently +1 when compared to the histone literature.
SET domain-containing proteins are classified in one of 7 families (Dillon et al. 2005). First to be discovered were the SUV39 family named after founding member SUV39H1 (KMT1A), which selectively methylates lysine-10 of histone H3 (H3K9) (Rea et al. 2000). Family member EHMT2 (KMT1C, G9A) is the predominant H3K9 methyltransferase in mammals (Tachibana et al. 2002). SETDB1 (KMT1E, ESET) also predominantly methylates H3K9, most effectively when complexed with ATF7IP (MCAF, hAM) (Wang et al. 2003).
SETD2 (KMT3A, HYPB), a member of the SET2 family, specifically methylates histone H3 lysine-37 (H3K36) (Sun et al. 2005). WHSC1 (KMT3G, NSD2, MMSET) a member of the same family, targets H3K36 when provided with nucleosome substrates but also can methylate histone H4 lysine-45 when octameric native or recombinant nucleosome substrates are provided (Li et al. 2009); dimethylation of histone H3 at lysine-37 (H3K36me2) is thought to be the principal chromatin-regulatory activity of WHSC1 (Kuo et al. 2011). Relatives NSD1 (KMT3B) and WHSC1L1 (KMT3F, NSD3) also methylate nucleosomal H3K36. NSD1 is active on unmethylated or a mimetic monomethylated H3K36, but not di- or trimethylated H3K36 mimetics (Li et al. 2009). Human SETD7 (KMT7, SET7/9), not classified within the 7 SET-domain containing families, mono-methylates lysine-5 of histone H3 (H3K4) (Xiao et al. 2003).
Eukaryotic DNA is associated with histone proteins and organized into a complex nucleoprotein structure called chromatin. This structure decreases the accessibility of DNA but also helps to protect it from damage. Access to DNA is achieved by highly regulated local chromatin decondensation.
The 'building block' of chromatin is the nucleosome. This contains ~150 bp of DNA wrapped around a histone octamer which consists of two each of the core histones H2A, H2B, H3 and H4 in a 1.65 left-handed superhelical turn (Luger et al. 1997, Andrews & Luger 2011).
Most organisms have multiple genes encoding the major histone proteins. The replication-dependent genes for the five histone proteins are clustered together in the genome in all metazoans. Human replication-dependent histones occur in a large cluster on chromosome 6 termed HIST1, a smaller cluster HIST2 on chromosome 1q21, and a third small cluster HIST3 on chromosome 1q42 (Marzluff et al. 2002). Histone genes are named systematically according to their cluster and location within the cluster.
The 'major' histone genes are expressed primarily during the S phase of the cell cycle and code for the bulk of cellular histones. Histone variants are usually present as single-copy genes that are not restricted in their expression to S phase, contain introns and are often polyadenylated (Old & Woodland 1984). Some variants have significant differences in primary sequence and distinct biophysical characteristics that are thought to alter the properties of nucleosomes. Others localize to specific regions of the genome. Some variants can exchange with pre-existing major histones during development and differentiation, referred to as replacement histones (Kamakaka & Biggins 2005). These variants can become the predominant species in differentiated cells (Pina & Suau 1987, Wunsch et al. 1991). Histone variants may have specialized functions in regulating chromatin dynamics.
The H2A histone family has the highest sequence divergence and largest number of variants. H2A.Z and H2A.XH2A are considered 'universal variants', found in almost all organisms (Talbert & Henikoff 2010). Variants differ mostly in the C-terminus, including the docking domain, implicated in interactions with the (H3-H4)x2 tetramer within the nucleosome, and in the L1 loop, which is the interaction interface of H2A-H2B dimers (Bonisch & Hake 2012). Canonical H2A proteins are expressed almost exclusively during S-phase. There are several nearly identical variants (Marzluff et al. 2002). No functional specialization of these canonical H2A isoforms has been demonstrated (Bonisch & Hake 2012). Reversible histone modifications such as acetylation and methylation regulate transcription from genomic DNA, defining the 'readability' of genes in specific tissues (Kouzarides 2007, Marmorstein & Trievel 2009, Butler et al. 2012).
N.B. The coordinates of post-translational modifications represented here follow Reactome standardized naming, which includes the UniProt standard practice whereby coordinates refer to the translated protein before any further processing. Histone literature typically refers to coordinates of the protein after the initiating methionine has been removed; therefore the coordinates of post-translated histone residues described here are frequently +1 when compared with the literature. For more information on Reactome's standards for naming pathway events, the molecules that participate in them and representation of post-translational modifications, please refer to Naming Conventions on the Reactome Wiki or Jupe et al. 2014.
The 'building block' of chromatin is the nucleosome. This contains ~150 bp of DNA wrapped around a histone octamer which consists of two each of the core histones H2A, H2B, H3 and H4 in a 1.65 left-handed superhelical turn (Luger et al. 1997, Andrews & Luger 2011).
Most organisms have multiple genes encoding the major histone proteins. The replication-dependent genes for the five histone proteins are clustered together in the genome in all metazoans. Human replication-dependent histones occur in a large cluster on chromosome 6 termed HIST1, a smaller cluster HIST2 on chromosome 1q21, and a third small cluster HIST3 on chromosome 1q42 (Marzluff et al. 2002). Histone genes are named systematically according to their cluster and location within the cluster.
The 'major' histone genes are expressed primarily during the S phase of the cell cycle and code for the bulk of cellular histones. Histone variants are usually present as single-copy genes that are not restricted in their expression to S phase, contain introns and are often polyadenylated (Old & Woodland 1984). Some variants have significant differences in primary sequence and distinct biophysical characteristics that are thought to alter the properties of nucleosomes. Others localize to specific regions of the genome. Some variants can exchange with pre-existing major histones during development and differentiation, referred to as replacement histones (Kamakaka & Biggins 2005). These variants can become the predominant species in differentiated cells (Pina & Suau 1987, Wunsch et al. 1991). Histone variants may have specialized functions in regulating chromatin dynamics.
The H2A histone family has the highest sequence divergence and largest number of variants. H2A.Z and H2A.XH2A are considered 'universal variants', found in almost all organisms (Talbert & Henikoff 2010). Variants differ mostly in the C-terminus, including the docking domain, implicated in interactions with the (H3-H4)x2 tetramer within the nucleosome, and in the L1 loop, which is the interaction interface of H2A-H2B dimers (Bonisch & Hake 2012). Canonical H2A proteins are expressed almost exclusively during S-phase. There are several nearly identical variants (Marzluff et al. 2002). No functional specialization of these canonical H2A isoforms has been demonstrated (Bonisch & Hake 2012). Reversible histone modifications such as acetylation and methylation regulate transcription from genomic DNA, defining the 'readability' of genes in specific tissues (Kouzarides 2007, Marmorstein & Trievel 2009, Butler et al. 2012).
N.B. The coordinates of post-translational modifications represented here follow Reactome standardized naming, which includes the UniProt standard practice whereby coordinates refer to the translated protein before any further processing. Histone literature typically refers to coordinates of the protein after the initiating methionine has been removed; therefore the coordinates of post-translated histone residues described here are frequently +1 when compared with the literature. For more information on Reactome's standards for naming pathway events, the molecules that participate in them and representation of post-translational modifications, please refer to Naming Conventions on the Reactome Wiki or Jupe et al. 2014.
Chromatin organization refers to the composition and conformation of complexes between DNA, protein and RNA. It is determined by processes that result in the specification, formation or maintenance of the physical structure of eukaryotic chromatin. These processes include histone modification, DNA modification, and transcription. The modifications are bound by specific proteins that alter the conformation of chromatin.