Releases¶

0.9.3¶

The new version supports training policies using the SAC algorithm;
The new version supports custom node loss functions;
The new version supports custom network nodes;
The new version supports introducing expert constraints;
The new version optimizes the control strategy for multiple control nodes;
The new version supports configuring multiple discriminators;
The new version fixes some bugs and optimizes training stability;
The documentation has been updated in the new version.

0.9.2¶

Updated RNN Backbone, improved training performance when using LSTM and GRU as Backbone;
Optimized asynchronous decision flowchart loading, compatible with more loading mechanisms;
Optimized time step nodes, empty node mechanisms, and supports more rich visual content;
Fixed some bugs, improved training stability;
Updated documentation content, providing more explanations for added functionality, and improved the organization of tutorial examples.

0.9.1¶

Support loading training completion environment model as initialization model;
Support the use of multi head networks;
Support freezing model parameters of some nodes in training environment;
Support multiple matchers configuration;
Add response curve drawing function;
Support the use of random, grid, zoom and Bayes algorithms;
New function cases are added to the document;
New task examples (IB) have been added to the document;
Fixed some errors;
Some new test cases have been added.

0.9.0¶

Add causal discovery algorithm to assist in generating decision flow graph;
Support for training multi-environment adaptive policy;
Support for training environments and training policy using different decision flow graphs;
Support defining nodes without data binding;
Support for calling individual node models for inference through virtual environment models;
The optimization policy can explain the style of the tree;
Fixed some bugs.

0.8.0¶

Upgrade dependency package version;
Support the action for multi-steps in env;
Delete nohup cmd in pyarmor register;
Support plot image in dataset_val;
Support append step node;
Support miss some data point;
Fixed some bugs.

0.7.3¶

Rename venv_algo（revive_p, revive_t）;
Decision diagram supporting training stateless transfer;
Update revive_p algorithm;
Append gym_env example script in examples;
Fixed some bugs.

0.7.2¶

Support automatic generation of multi-time frame data nodes.
Support license verification by accesskey.
Fixed some other bugs.

0.7.1¶

Add refrigerator task example.
Update revive algorithm, training is more stable and better;
Update model rollout mechanism.
Fixed some other bugs.

0.7.0¶

Support Online Identity Verification;
Support for training multi agent policy;
Update revive algorithm, training is more stable and better;
Fixed some other bugs.

0.6.0¶

Update function node import mechanism.
Update value test for the expert functions.
Update ray version==1.8.
Add license register script.
Fix some bug.