Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Internal tensorizer error: BirCodeGenLoop #1073

Closed
JmeanJmy opened this issue Dec 27, 2024 · 2 comments
Closed

Internal tensorizer error: BirCodeGenLoop #1073

JmeanJmy opened this issue Dec 27, 2024 · 2 comments

Comments

@JmeanJmy
Copy link

Hi team, we are currently testing migrating our MQTransformer model from nvidia to trainium instance, and we are ancountering error Internal tensorizer error: BirCodeGenLoop during the training process. Could you help look into what happened?

2024-12-27 22:53:48.000781:  1268479  ERROR ||NEURON_CC_WRAPPER||: Failed compilation with ['neuronx-cc', 'compile', '--framework=XLA', '/tmp/ubuntu/neuroncc_compile_workdir/398c613e-505d-4256-8f66-7c1634efa75b/model.MODULE_796301862831790987+e30acd3a.hlo_module.pb', '--output', '/tmp/ubuntu/neuroncc_compile_workdir/398c613e-505d-4256-8f66-7c1634efa75b/model.MODULE_796301862831790987+e30acd3a.neff', '--target=trn1', '--verbose=35']: 2024-12-27T22:53:48Z [TEN404] (aten__scatter_scatter.134) Internal tensorizer error: BirCodeGenLoop: - Please open a support ticket at https://github.com/aws-neuron/aws-neuron-sdk/issues/new. You may also be able to obtain more information using the 'XLA_IR_DEBUG' and 'XLA_HLO_DEBUG' environment variables.

We use trn1.32xlarge and NeuronXLAStrategy /

@jluntamazon
Copy link
Contributor

Hi @JmeanJmy,

Thank you for reaching out! This could be a variety of issues (appears related to scatter) but it is difficult to tell exactly what is happening from the error message alone. Could you provide a minimal example which reproduces the error? This is important since it allows us to ensure that we are producing the exact same behavior you see without making assumptions about the code & model usage

@karthickgopalswamy
Copy link
Contributor

Hi @JmeanJmy
closing this now, feel free to reopen with info requested above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants