Fine tuning with RL

Hi folks

Is there a recommended way to load a pretrained model for RL without loading the entire checkpoint?

I'm training with TD3 and I want to refine the model with new data and a modified reward function.  If I call `torchfort_rl_off_policy_create_system("td3_system",...` then `torchfort_load_model("td3_system",...` with the filename for a saved checkpoint policy/model.pt I get an `Invalid usage. (Invalid model name provided.)` error.

I guess one approach would be to load the entire checkpoint then try to kill the history, but I don't know how to do that either.

How do I set up for fine tuning?

Ciao
Terry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fine tuning with RL #105

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Fine tuning with RL #105

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions