AbstractComplex
Overview
AbstractComplex
is the base class for molecular complex representations. All other polyatomic complexes inherit from this class.
get_bonds()
→ Returns bond information within the molecule.get_laplacians()
→ Returns Laplacian matrices.get_atomic_structure()
→ Returns atomic structure details.get_all_cell_coadj()
→ Returns all cell coadjacencies.get_incidence()
→ Returns incidence matrices.get_skeleta()
→ Returns skeletal molecular structure.get_adjacencies()
→ Returns adjacency matrices for molecular structures.get_spectral_k_chains()
→ Returns k-chains of complex with additional spectral features.get_raw_k_chains()
→ Returns k-chains of complex.abstract_mol.k_chains_formal_sum()
→ Represents k-chains symbolically (as a formal sum).
Usage Example
Methods Explained
get_bonds()
Returns all chemical bonds between pairs of atoms. The return type is List[Tuple]
where each tuple is of length 3
.
The first two values being the atoms and the third value is a list contaning the kind of chemical bond and its numerical equivalent.
For example:
get_laplacians()
In this context the method returns the Hodge Laplacian. The Hodge Laplacian is a generalization of the graph Laplacian.
More formally for a CW complex \(X\) let \(C_{k}(X)\) denote the set of all \(k\)-chains on \(X\). \(C_{k}(X)\) has the algebraic structure
of a free Abelian group with basis \(\{e_{\alpha}^{k}\}_{\alpha = 1}^{N}\). Then let \(d_{k}: C^{k}(X) \to C^{k+1}(X)\) be the coboundary operator, and \(d_{k}^{*}\) its adjoint. Then the Hodge Laplacian is:
$$
\Delta_{k} := d_{k-1} \circ d_{k-1}^{\ast} + d_{k}^{\ast} \circ d_{k}
$$.
This method will return the Hodge Laplacians for all \(k\).
The return type is a Dict
with a single key "molecule_laplacians"
whose value is a List[Tuple]
wherein one can find the laplacians, a numpy.matrix
, for each \(k\). Note that the dictionary returned contains all laplacians at the atomic level for every atom. One can retrieve the laplacians at the electronic structure level, however we leave such functionality for the get_atomic_structure()
method.
get_atomic_structure()
This method enables one to iterate over the entire molecular structure. Essentially one can iterate over every atom and its corresponding protons, neutrons, and electrons. This explicitly describes the entire structure along with the laplacians, incidence and even weights at every level up to electronic structure. Note that there are byte arrays corresponding to approximate sampled locations, one will have to use np.frombuffer(value, dtype=np.uint8)
in order to convert these into matrix form. Note that value
here is a proton, neutron, or electron. Additionally note that all dirac matricies are stored as scipy
COOrdinate sparse matricies of dtype int64
.
get_all_cell_coadj()
This returns the standard coadjacency matrix of all cells with respect to all nodes. The return type is a defaultdict
with key "molecule_all_cell_coadj"
.
get_incidence()
This method returns a defaultdict
with key molecule_incidence
that contains the incidence relations across all levels, up to electronic structure. The information returned is conceptually equivalent to that of incidence matricies in a graph, but for CW complexes.
get_skeleta()
This method returns a defaultdict
enabling one to iterate over the skeleton of the molecule. The only key is 'molecule_skeleta'
.
get_adjacencies()
This method will return the adjacency matricies for the entire molecule. An adjacency matrix in this context can be thought of as equivalent to the usual definition for graphs.
get_spectral_k_chains()
This method will return all \(k\)-chains for the complex with additional spectral features computed aka scaling, persistence. Note that we don't provide a \(0\)-chain due to reliance on higher \(k\) to compute the additional features. To use \(k=0\) rely on get_raw_k_chains()
.
get_raw_k_chains()
This method will return all \(k\)-chains for the complex. Note that we provide the \(0\) chains explicitly.
abstract_mol.k_chains_formal_sum()
This method will return the symbolic computation for get_raw_k_chains()
aka represent the computation of all \(k\) chains as a formal sum.