cheat/examples/train_lgnn at main · catalyticmaterials/cheat

History

Name		Name	Last commit message	Last commit date
parent directory ..
graphs		graphs
parity		parity
results		results
README.md		README.md
dft2graphs.py		dft2graphs.py
lGNN.state		lGNN.state
lGNN_curve.png		lGNN_curve.png
plot_parity.py		plot_parity.py
test.py		test.py
train.py		train.py

README.md

Training and testing the lean graph neural network (lGNN)

This folder contains an example of how to train and test the lGNN model.

dft2graphs.py will pull the supplied trajectory files in the gpaw folder and perform graph feature construction to form train, validation and test sets from the relaxed slabs with adsorbates. Adsorbtion energies, $\Delta E_{ads}$ are calculated as:

$$\Delta E_{ads} = E_{slab+ads} - E_{slab} - E_{ads}$$

where $E_{slab+ads}$ and $E_{slab}$ are the slab with and without adsorbate respectively and $E_{ads}$ is the gas-phase reference energy of the adsorbate.

The graph includes the adsorbate and the nearest neighboring atoms to the adsorbing atom(s) (ensemble) as well as the next nearest neighbors in the third surface layer.

The graph node features are onehot encoded to denote element. In addition, the layer tag and an AtomOfInterest feature tracking important atomic positions with favourable long-ranged interactions are included. As no positional information is included in the nodes and because the edges only denote connectivity, the resulting graphs retains equivariant properties.

The graphs are PyTorch Geometric data-objects from which following information can be accessed: 'x': Node features 'y': Adsorbtion energy 'edge_index': Edge pairs 'onehot_labels': Element list used for onehot encoding (does not include tag or AoI feature) 'ads': Adsorbate 'gIds': "graph Ids" used for translating a 5x5x3 sized surface to a graph (used in conjunction with templates and the surrogate surface. See the surface_simulation folder for further info)

The lGNN model is trained by running train.py where you will also find a few adjustable parameters regarding the GNN architecture and training. The architecture will be saved in the .state-file. Therefore, after training, the model can be loaded with

with open(f'{filename}.state', 'rb') as input:
	model_state = pickle.load(input)
model = lGNN(trained_state=model_state)

The lGNN class supports two methods: model.predict(graphlist) will return the predicted adsorbtion energies from a list of graphs. model.test(data_loader,batch_size) will return additional information

pred, true, ads = model.test(data_loader, batch_size)

with pred and true being the predicted and true energies, respectively, and ads denoting the adsorbate for easy categorization.

Running test.py will create a .results file. Use it as argument to plot_parity.py to obtain a parity plot of the test results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train_lgnn

train_lgnn

README.md

Training and testing the lean graph neural network (lGNN)

Files

train_lgnn

Directory actions

More options

Directory actions

More options

Latest commit

History

train_lgnn

Folders and files

parent directory

README.md

Training and testing the lean graph neural network (lGNN)