Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

能发一下你运行成功的环境吗? #6

Open
ChengEast opened this issue Jun 15, 2024 · 14 comments
Open

能发一下你运行成功的环境吗? #6

ChengEast opened this issue Jun 15, 2024 · 14 comments

Comments

@ChengEast
Copy link

请问可以提供一下你运行成功的环境文件吗?比如:ubuntu版本、python版本、anaconda版本、yml文件等,谢谢!

@ChengEast
Copy link
Author

安装了ubuntu的24.04、22.04,可是都出现了包版本冲突,
The conflict is caused by:
paxml 1.4.0 depends on seqio-nightly==0.0.17.dev20231010
paxml 1.3.1 depends on seqio-nightly
paxml 1.3.0 depends on seqio-nightly
paxml 1.2.0 depends on seqio-nightly
paxml 1.1.0 depends on tfds-nightly==4.8.3.dev202303280045
The user requested praxis
paxml 1.0.0 depends on praxis==1.0.0
paxml 0.4.0 depends on tfds-nightly==4.8.3.dev202303280045
paxml 0.2.1 depends on seqio-nightly
paxml 0.2.0 depends on seqio-nightly
paxml 0.1 depends on seqio-nightly

To fix this you could try to:

  1. loosen the range of package versions you've specified
  2. remove package versions to allow pip attempt to solve the dependency conflict

Pip subprocess error:
ERROR: Cannot install -r /home/zd/timesfm/condaenv.8g628zq5.requirements.txt (line 11) and praxis because these package versions have conflicting dependencies.
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts

failed

@lhw828
Copy link
Owner

lhw828 commented Jun 17, 2024

1、我尝试过导出一个docker,失败了。
2、Ubuntu版本:
NoLSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 24.04 LTS Release: 24.04 Codename: noble
3、conda --version conda 24.1.2
4、yml文件:
environment0.zip
5、如果需我可以打包一份Ubuntu的虚拟机镜像文件。

@ChengEast
Copy link
Author

ChengEast commented Jun 17, 2024 via email

@ChengEast
Copy link
Author

ChengEast commented Jun 17, 2024 via email

@lhw828
Copy link
Owner

lhw828 commented Jun 18, 2024

把日志喂给ai看看有什么建议

@ChengEast
Copy link
Author

把日志喂给ai看看有什么建议
AI 也是说要安装的资源包与python版本冲突了,我看你在environment0.yml里dependent段里写的python版本是3.10.14,但是Ubuntu24.04里自带的是3.11.07,你是不是安装在Ubuntu22.04里的呢?

@lhw828
Copy link
Owner

lhw828 commented Jun 20, 2024

我是在“conda”环境下运行这个项目的。

@lhw828
Copy link
Owner

lhw828 commented Jun 20, 2024

首先你要切换到到conda环境,在里面安装所需版本,再在里面运行demo

@ChengEast
Copy link
Author

感谢感谢!我把python3.10设置为默认版本,再安装你发的yml文件,现在不报错了,环境安装完成了。谢谢!我继续搞接下来的步骤。

@LouisLee1983
Copy link

WARNING:absl:No registered CheckpointArgs found for handler type: <class 'paxml.checkpoints.FlaxCheckpointHandler'>
WARNING:absl:Configured CheckpointManager using deprecated legacy API. Please follow the instructions at https://orbax.readthedocs.io/en/latest/api_refactor.html to migrate by May 1st, 2024.
WARNING:absl:train_state_unpadded_shape_dtype_struct is not provided. We assume train_state is unpadded.
ERROR:absl:For checkpoint version > 1.0, we require users to provide
train_state_unpadded_shape_dtype_struct during checkpoint
saving/restoring, to avoid potential silent bugs when loading
checkpoints to incompatible unpadded shapes of TrainState.
Restored checkpoint in 0.75 seconds.
Jitting decoding.
Killed

请问这个错误提示是哪里出了问题?我的:orbax-checkpoint是0.5.9版本。

@lhw828
Copy link
Owner

lhw828 commented Jul 1, 2024

WARNING:absl:No registered CheckpointArgs found for handler type: <class 'paxml.checkpoints.FlaxCheckpointHandler'> WARNING:absl:Configured CheckpointManager using deprecated legacy API. Please follow the instructions at https://orbax.readthedocs.io/en/latest/api_refactor.html to migrate by May 1st, 2024. WARNING:absl:train_state_unpadded_shape_dtype_struct is not provided. We assume train_state is unpadded. ERROR:absl:For checkpoint version > 1.0, we require users to provide train_state_unpadded_shape_dtype_struct during checkpoint saving/restoring, to avoid potential silent bugs when loading checkpoints to incompatible unpadded shapes of TrainState. Restored checkpoint in 0.75 seconds. Jitting decoding. Killed

请问这个错误提示是哪里出了问题?我的:orbax-checkpoint是0.5.9版本。

这段警告和错误信息来自于一个使用Pax和Orbax库的机器学习训练脚本。Pax是Google开发的一个深度学习库,而Orbax是一个用于机器学习实验的Python库,用于管理训练过程中的数据和模型。
警告信息说明以下几点:

  1. No registered CheckpointArgs found for handler type:这表明在加载或保存检查点(checkpoint)时,没有为指定的处理程序类型找到注册的参数。检查点处理程序是用来序列化和反序列化模型和优化器状态的。
  2. Configured CheckpointManager using deprecated legacy API:这表明使用了Orbax库中已经弃用的旧版API。开发者被要求按照Orbax的官方文档迁移到新的API,以保持与新版本兼容。
  3. train_state_unpadded_shape_dtype_struct is not provided:这意味着在加载或保存检查点时,没有提供train_state_unpadded_shape_dtype_struct。这是一个结构,它包含了训练状态(TrainState)的形状和类型信息,用于确保在加载检查点时,训练状态的形状与检查点中的形状兼容。
    错误信息说明:
  • 在检查点的版本大于1.0时,用户必须在保存或恢复检查点时提供train_state_unpadded_shape_dtype_struct,以避免在加载检查点时由于训练状态的未填充形状不兼容而可能出现的潜在静默错误。
    解决这些问题的方法可能包括:
  • 确保你的代码使用了Orbax库的最新API,并且按照官方文档进行了迁移。
  • 在保存和加载检查点时,确保提供了train_state_unpadded_shape_dtype_struct
  • 查阅Pax和Orbax的官方文档,了解如何正确地配置和操作检查点。
    如果你需要更具体的帮助,你可能需要提供更多的代码上下文或错误日志,以便能够给出更精确的指导。
    这是ai给的解释,提供一下思路吧。

@LouisLee1983
Copy link

感谢这位兄弟。回馈:是wsl的内存分配不够。最好搞成12g+8g。在"C:\Users\xxx.wslconfig" 输入:
[wsl2]
memory=12GB
swap=16GB
localhostForwarding=true

然后重启wsl。

@zhouzjj
Copy link

zhouzjj commented Aug 14, 2024

安装了ubuntu的24.04、22.04,可是都出现了包版本冲突, The conflict is caused by: paxml 1.4.0 depends on seqio-nightly==0.0.17.dev20231010 paxml 1.3.1 depends on seqio-nightly paxml 1.3.0 depends on seqio-nightly paxml 1.2.0 depends on seqio-nightly paxml 1.1.0 depends on tfds-nightly==4.8.3.dev202303280045 The user requested praxis paxml 1.0.0 depends on praxis==1.0.0 paxml 0.4.0 depends on tfds-nightly==4.8.3.dev202303280045 paxml 0.2.1 depends on seqio-nightly paxml 0.2.0 depends on seqio-nightly paxml 0.1 depends on seqio-nightly

To fix this you could try to:

  1. loosen the range of package versions you've specified
  2. remove package versions to allow pip attempt to solve the dependency conflict

Pip subprocess error: ERROR: Cannot install -r /home/zd/timesfm/condaenv.8g628zq5.requirements.txt (line 11) and praxis because these package versions have conflicting dependencies. ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts

failed

想问问你这个问题咋解决的呀,我一直解决不了

@ham114
Copy link

ham114 commented Sep 13, 2024

Pip subprocess error:
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ConnectTimeoutError(<pip._vendor.urllib3.connection.HTTPSConnection object at 0x7fb2c58138e0>, 'Connection to pypi.tuna.tsinghua.edu.cn timed out. (connect timeout=15)')': /simple/absl-py/
ERROR: Ignored the following versions that require a different python version: 1.8.0 Requires-Python >=3.11; 1.9.0 Requires-Python >=3.11; 1.9.1 Requires-Python >=3.11; 1.9.2 Requires-Python >=3.11; 1.9.3 Requires-Python >=3.11
ERROR: Could not find a version that satisfies the requirement python-graphviz>=0.20.1 (from versions: none)
ERROR: No matching distribution found for python-graphviz>=0.20.1

failed

你好,请问这个问题怎么解决?我是在ubuntu22.04上运行的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants