FGW_matrices ========================== This module provides methods to generate difference dictionaries and data lists for the fused Gromov-Wasserstein (FGW) methods in ``GWProt.GW_protein.run_FGW_dict`` and ``GWProt.GW_protein.run_FGW_data_lists``. These difference matrices are used to incorporate biochemical features into FGW alignments and to compute local geometric distortion (LGD) at the residue level. .. module:: GWProt.FGW_matrices .. autofunction:: GWProt.FGW_matrices.split_data_list ---------------------- **Hydrophobicity-Based Differences** ---------------------------------------------- .. autofunction:: GWProt.FGW_matrices.get_hydrophobicity_dict .. autofunction:: GWProt.FGW_matrices.get_hydrophobicity_list These methods generate difference matrices using hydrophobicity values from `Eisenberg, Schwarz, Komaromy, and Wall `_. These can be used as the feature space for FGW, allowing LGD to reflect hydrophobicity differences between residues. ---------------------------- **BLOSUM-Based Differences** -------------------------------------- .. autofunction:: GWProt.FGW_matrices.get_BLOSUM_dict This method generates difference matrices based on the `BLOSUM matrices `_. For a pair of amino acids, it computes :math:`e^{-b}` (where :math:`b` is the BLOSUM entry), then normalizes the result to ensure the distances satisfy the triangle inequality. These matrices can be used in FGW to compute LGD based on sequence similarity. ---------------------------- **Grantham-Based Differences** ---------------------------------------- .. autofunction:: GWProt.FGW_matrices.get_Grantham_dict This method generates difference matrices using the `Grantham `_ difference scores, which reflect physicochemical differences between amino acids. These can be used in FGW to compute LGD based on these properties. -------------------- **Isoelectric Point Differences** -------------------------------------------- .. autofunction:: GWProt.FGW_matrices.get_pI_dict .. autofunction:: GWProt.FGW_matrices.get_pI_list These methods generate difference matrices using Solomon isoelectric point values. For more advanced handling of isoelectric points, use the ``GWProt.GW_pI`` module, which offers greater functionality for FGW and LGD computations. **References** --------------------------- - Solomons, T.W.G. and Fryhle, C.B. and Snyder, S.A. (2016) Organic Chemistry 12th Ed. Wiley. - Grantham, R. (1974) Amino Acid Difference Formula to Help Explain Protein Evolution. Science, New Series, Vol. 185, No. 4154. 862-864. - Eisenberg, D.; Schwarz, E.; Komaromy, M.; and Wall, R. (1984) Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J Mol Biol. Oct 15;179(1):125-42.