Skip to content
View mansicer's full-sized avatar
🎆
coding
🎆
coding
  • Nanjing University

Highlights

  • Pro

Organizations

@LAMDA-RL

Block or report mansicer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. LAMDA-RL/ODIS LAMDA-RL/ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 38 5

  2. MAIC MAIC Public

    The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".

    Python 50 10

  3. Q-Adapter Q-Adapter Public

    Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    Python 13

  4. LAMDA-RL/ReDA LAMDA-RL/ReDA Public

    The implementation of the AAMAS'24 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"

    Python 3