Custom Linear CUDA kernel #78

mariogeiger · 2025-01-28T12:16:04Z

import torch
import cuequivariance_torch as cuet
import cuequivariance as cue

e = cue.descriptors.linear(
    cue.Irreps(cue.O3, "12x0e + 32x1o"),
    cue.Irreps(cue.O3, "32x0e + 48x1o"),
).flatten_modes("i")
d = e.d

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
x0 = torch.randn(3, d.operands[0].size, device=device)
x1 = torch.randn(100, d.operands[1].size, device=device)
i = torch.randint(0, 3, (100,), device=device, dtype=torch.int32)

m = cuet.EquivariantTensorProduct(
    e,
    device=device,
    math_dtype=torch.float32,
    layout=cue.ir_mul,
    index_first_input=True,
)
y2 = m(x0, x1, indices=i)

mariogeiger · 2025-02-06T15:35:05Z

import cuequivariance as cue
import numpy as np

e = cue.descriptors.linear(
    cue.Irreps(cue.O3, "12x0e + 32x1o"),
    cue.Irreps(cue.O3, "32x0e + 48x1o"),
).flatten_modes("i")
d = e.d

e0 = cue.descriptors.fully_connected_tensor_product(
    cue.Irreps(cue.O3, "3x0e"),
    cue.Irreps(cue.O3, "12x0e + 32x1o"),
    cue.Irreps(cue.O3, "32x0e + 48x1o"),
)

print(f"old ordering: {e0.d.operands[0].segments}")
print(f"new ordering: 3x ({e.d.operands[0].segments})")

# convert old weights (w0) to new weights (w)
w0 = np.random.randn(e0.inputs[0].dim)
num_elements = e0.inputs[1].dim
w = []
for s in e0.d.operands[0].segment_slices():
    w.append(w0[s].reshape(num_elements, -1))
w = np.concatenate(w, axis=1).flatten()

…riantTensorProduct; update forward method in _BatchLinear to handle optional indices.

mariogeiger added 5 commits January 28, 2025 04:15

draft

9dfc977

support alternative subscripts (wip)

772a424

BatchLinear ready

efd337c

fix

511c48e

add index_first_input to EquivariantTensorProduct

c635b32

mariogeiger changed the title ~~[wip] Indexed Weights Linear~~ Indexed Weights Linear Jan 30, 2025

mariogeiger added 3 commits January 30, 2025 04:18

Merge branch 'main' into batched_linear

e36646f

format

74eec00

put back extra_repr

f15a19c

mariogeiger added 2 commits February 6, 2025 13:02

Remove index_first_input parameter and related assertions from Equiva…

2e2b612

…riantTensorProduct; update forward method in _BatchLinear to handle optional indices.

fix

a2d1f6d

mariogeiger changed the title ~~Indexed Weights Linear~~ Custom Linear CUDA kernel Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom Linear CUDA kernel #78

Custom Linear CUDA kernel #78

mariogeiger commented Jan 28, 2025 •

edited

Loading

mariogeiger commented Feb 6, 2025

Custom Linear CUDA kernel #78

Are you sure you want to change the base?

Custom Linear CUDA kernel #78

Conversation

mariogeiger commented Jan 28, 2025 • edited Loading

mariogeiger commented Feb 6, 2025

mariogeiger commented Jan 28, 2025 •

edited

Loading