Feat/sharegpt multirole #1

teknium1 · 2024-02-22T11:59:55Z

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

* make mlflow optional * fix xformers don't patch swiglu if xformers not working fix the check for xformers swiglu * fix install of xformers with extra index url for docker builds * fix docker build arg quoting

* WIP conversion to use pydantic for config validation * wip, more fields, add capabilities * wip * update pydantic validation to match existing tests * tweak requirements * setup deprecated paams pydantic model * more validations * wrap up rest of the validations * flesh out the rest of the options from the readme into pydantic * fix model validators as class methods remember to return in validator missing return add missing relora attributes fix test for DictDefault change fix sys template for mistral from fastchat change in PR 2872 fix test for batch size warning * more missing attributes for cfg * updates from PR feedback * fix validation for datasets and pretrain datasets * fix test for lora check

* Add checkpoint logging to mlflow artifact registry * clean up * Update README.md Co-authored-by: NanoCode012 <[email protected]> * update pydantic config from rebase --------- Co-authored-by: NanoCode012 <[email protected]> Co-authored-by: Wing Lian <[email protected]>

…a dict (#1334)

* Add StableLM examples and configurations * Add FFT and LORA configuration files and modify readme with usage

* add lion-pytorch optimizer * update pydantic to support lion optimizer --------- Co-authored-by: Wing Lian <[email protected]>

* support user-defined prompt processing strategies for dpo * interpret dict dataset types as user-defined * fix lint errors * setup pydantic config for validation of User defined DPO --------- Co-authored-by: Wing Lian <[email protected]>

* Lora example for Mistral on MPS backend * Add some MPS documentation * Update examples/mistral/lora-mps.yml Co-authored-by: NanoCode012 <[email protected]> * Update examples/mistral/lora-mps.yml Co-authored-by: NanoCode012 <[email protected]> * Update README.md --------- Co-authored-by: NanoCode012 <[email protected]> Co-authored-by: Wing Lian <[email protected]>

seperated -> separated

* add gemma instruct chat template * support for chat tempalte strategy too

* add missing evals_per_epoch setting * more pydantic fixes * more fixes * move test from normalization to validation * increase eval size for sample packing tests

* run tests again on Modal * make sure to run the full suite of tests on modal * run cicd steps via shell script * run tests in different runs * increase timeout * split tests into steps on modal * increase workflow timeout * retry doing this with only a single script * fix yml launch for modal ci * reorder tests to run on modal * skip dpo tests on modal * run on L4s, A10G takes too long * increase CPU and RAM for modal test * run modal tests on A100s * skip phi test on modal * env not arg in modal dockerfile * upgrade pydantic and fastapi for modal tests * cleanup stray character * use A10s instead of A100 for modal

* plain input/output prompt strategy w/o chat templates * disable duplicate code check * make sure to add an eos/eot token to the end of the output so it will stop * multi turn segement support and test

* lora+ support * optimizer should default to None * include mit license

…r openai finetuning (#1361) * allow the sharegpt handler to also better handle datasets destined for openai finetuning * make sure to support system role

* add starcoder2 * Apply suggestions from code review Co-authored-by: NanoCode012 <[email protected]> * chore: lint * Apply suggestions from code review Co-authored-by: NanoCode012 <[email protected]> --------- Co-authored-by: Wing Lian <[email protected]> Co-authored-by: NanoCode012 <[email protected]>

* add docs * add docs * run linter

* add Jarvis cloud gpu and sponsorship * whitespace

* wip qlora + fsdp fixes * more fixes * make sure to load the lora 🤦 * only setup quantized meta on non-zero rank: * only run setup_quantized_peft_meta_for_training for qlora+fsdp * more fixes for qlora+fsdp * chore: lint * add example yml * support mistral too * fix for model_type and add mixtral support too * set cpu_offload: false to reduce vram, constrain new accleerator logic to qlora + fsdp * refactor for duplicate code

* validation for fsdp and deepspeed * make sure to return data

* Fix pydantic configuration for the max_memory input * chore: lint --------- Co-authored-by: Wing Lian <[email protected]>

* Add Glaive conversation format support * fix black formatting errors * Fix black and pylint formatting errors * only set role_key_tool if provided in the dataset constructor * Update src/axolotl/prompt_strategies/sharegpt.py Co-authored-by: Wing Lian <[email protected]> * sharegpt test * tokenizer test * fix formatting --------- Co-authored-by: Wing Lian <[email protected]>

NanoCode012 added 3 commits February 22, 2024 21:18

feat(prompt): support multiple roles for sharegpt

1623a50

fix: add handling of empty role back

85ddde2

feat: rebased and allowed more dynamic roles via config

33bcf57

NanoCode012 force-pushed the feat/sharegpt_multirole branch from 13f31d8 to 33bcf57 Compare February 22, 2024 13:13

NanoCode012 and others added 26 commits February 22, 2024 22:31

fix: variable

b54869c

chore: update message

8751ffb

feat: add vicuna format

24591e8

fix: JSON serializable error

9e171a9

fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci]

2ed52bd

Use yaml codeblock for config.yaml field (#1303) [skip ci]

5cf226e

make mlflow optional (#1317)

5894f0e

* make mlflow optional * fix xformers don't patch swiglu if xformers not working fix the check for xformers swiglu * fix install of xformers with extra index url for docker builds * fix docker build arg quoting

chore: update readme to be more clear (#1326) [skip ci]

c6b01e0

hotfix for capabilities loading (#1331)

7de912e

hotfix for lora rank (#1332)

cf00231

hotfix for missing outputs params (#1333)

e7eed20

hotfix to exclude_unset from pydantic config when converting back to …

269c543

…a dict (#1334)

Add StableLM 2 Example Scripts (#1327) [skip ci]

f30d062

* Add StableLM examples and configurations * Add FFT and LORA configuration files and modify readme with usage

add lion-pytorch optimizer (#1299) [skip ci]

1648279

* add lion-pytorch optimizer * update pydantic to support lion optimizer --------- Co-authored-by: Wing Lian <[email protected]>

more pydantic fixes (#1338)

3f69571

fix: checkpoint saving with deepspeed (#1321)

5be8b55

Update debugging.md (#1339) [skip ci]

5265cd6

fix steps check for anneal on first cycle (#1316)

2c9c88b

Update fastchat_conversation_turns.py (#1294) [skip ci]

2b9687f

seperated -> separated

add gemma instruct chat template (#1341)

c1a7b3d

* add gemma instruct chat template * support for chat tempalte strategy too

more fixes 20240228 (#1342) [skip ci]

0f985e1

* add missing evals_per_epoch setting * more pydantic fixes * more fixes * move test from normalization to validation * increase eval size for sample packing tests

deprecate py 3.9 support, set min pytorch version (#1343) [skip ci]

6d4bbb8

winglian and others added 30 commits February 28, 2024 15:07

fix for protected model_ namespace w pydantic (#1345)

6b3b271

chore: enable sample_packing for Gemma (#1351)

170d4d7

Fix validation for early stopping (#1358)

b5b4492

plain input/output prompt strategy w/o chat templates (#1346)

4d09b42

* plain input/output prompt strategy w/o chat templates * disable duplicate code check * make sure to add an eos/eot token to the end of the output so it will stop * multi turn segement support and test

lora+ support (#1352)

decb66e

* lora+ support * optimizer should default to None * include mit license

allow the sharegpt handler to also better handle datasets destined fo…

2598c9f

…r openai finetuning (#1361) * allow the sharegpt handler to also better handle datasets destined for openai finetuning * make sure to support system role

Update tinyllama lora.yml to fix eval packing issue (#1362)

8984bf1

Remove unsupported python version 3.9 from README (#1364) [skip ci]

3765747

support for DoRA w/ PEFT (#1363)

0cfdb2c

add docs for input_output format (#1367) [skip ci]

ed70a08

* add docs * add docs * run linter

update flash attention for gemma support: (#1368)

58b0d4b

JarvisLabs (#1372)

638c2da

* add Jarvis cloud gpu and sponsorship * whitespace

validation for fsdp and deepspeed (#1388) [skip ci]

3fd8093

* validation for fsdp and deepspeed * make sure to return data

support for rslora (#1387) [skip ci]

7659c00

Fix pydantic configuration for the max_memory input (#1385) [skip ci]

0bc114d

* Fix pydantic configuration for the max_memory input * chore: lint --------- Co-authored-by: Wing Lian <[email protected]>

Set gradient_clipping to auto in DeepSpeed configs (#1382) [skip ci]

b0ee9ec

chore: lint (#1389)

4326520

Merge branch 'main' into feat/sharegpt_multirole

58fa1ee

fix: typing

2b9d66c

fix: don't remap for unknown keys

ad70d34

fix: add roles to pydantic

3031632

feat: add test

c2738b3

chore: remove leftover print

320312f

chore: remove leftover comment

4b86dd8

chore: remove print

f00f63b

fix: update test to use chatml

cc545db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/sharegpt multirole #1

Feat/sharegpt multirole #1

teknium1 commented Feb 22, 2024

Feat/sharegpt multirole #1

Are you sure you want to change the base?

Feat/sharegpt multirole #1

Conversation

teknium1 commented Feb 22, 2024

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)