GW_protein_pI ========================== .. module:: GWProt.GW_protein_pI .. autoclass:: GWProt.GW_protein_pI.GW_protein_pI **Downsampling** ------------------------- Unlike ``GW_protein`` , this stores the isoelectric point values with each protein [1]_. Most of the methods are the same as those of ``GW_protein``. The key difference is that when downsampling, we can combine the isoelectric points of adjacent residues which are grouped together, giving an estimated isoelectric point of the segments. Thus unlike the FGW methods in ``GW_protein`` , we can run FGW on downsampled proteins in a meaningful way. This is done using an algorithm based on the one in the `Sequence Manipulation Suite `_ based on the Henderson-Hasselbach equation. For uniform downsampling, this is done automatically in the ``downsample_n`` method. .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.downsample_n For downsampling to specified indices, we first need to ' smooth out ` the isoelectic values, so that when we downsample, we will be accounting for the isoelectric points of nearby discarded residues. .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.convolve_pIs Then we can downsample: .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.downsample_by_indices **Computing FGW and LGD** ------------------------- As ``GW_protein_pI`` objects only use isoelectric points, the FGW methods are streamlined: .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.run_FGW .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.FGW_lgd .. autofunction:: GWProt.GW_protein_pI.GW_protein_pI.run_FGW_seq_aln The local geometric distortion (LGD) quantifies the contribution of each residue to the FGW distance, providing a residue-level measure of structural conservation or flexibility based on isoelectric point differences. .. [1] We note that these are rather naive estimates of the isoelectric points. More sophisticated ones can be computed by other software packages using the 3-dimensional structure of a protein. This could then be used with ``GW_protein.GW_protein.run_FGW_data_lists`` .