AMindToThink
/

ppo_with_value15

Generated from Trainer

Model card Files Files and versions Community

ppo_with_value15 / policy_model

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

AMindToThink's picture

Model save

f7c9767 verified 20 days ago

config.json

757 Bytes

Model save 20 days ago
generation_config.json

90 Bytes

Model save 20 days ago
model.safetensors

649 MB
LFS

Model save 20 days ago
special_tokens_map.json

579 Bytes

Model save 20 days ago
tokenizer.json

3.56 MB

Model save 20 days ago
tokenizer_config.json

5.26 kB

Model save 20 days ago
training_args.bin
Detected Pickle imports (10)
- "trl.trainer.ppo_config.PPOConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.SaveStrategy",
- "accelerate.state.PartialState",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.IntervalStrategy",
- "torch.device",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy"
How to fix it?
6.2 kB
LFS

Model save 20 days ago