Reference

Class `BaseExperiment`

Classical (monitored) experiment class.

Inherits from:

ArcadesComponent

This class can be inherited to add more features

Info:

edit Alexis BRENON alexis.brenon@imag.fr

See also:

ArcadesComponent

Data Types

ClassificationTable
List of classification metrics computed.

See some explanations about metrics on this site: http://blog.revolutionanalytics.com/2016/03/com_class_eval_metrics_r.html
Fields:
- number accuracy
- number average_accuracy
- number precision
- number recall
- number f1
- number macro_precision
- number macro_recall
- number macro_f1
- number micro_precision
- number micro_recall
- number micro_f1
Dump

Dump.
InitArgument
Argument used for instanciation.
Todo:
- Extract some arguments to Dump.
Fields:
- {train=environment.BaseEnvironment,test=environment.BaseEnvironment} environment
  Table with train and test environments
- agent.BaseAgent agent
  The agent to use
- OutputArgument output
  Output options
- number steps
  Total number of steps to do (excluding evaluation steps) (default math.huge)
- number eval_freq
  Number of steps between two evaluations
- number eval_steps
  Number of evaluation steps
- number save_at
  Next interation at which to save the experiment (optional)
- {train=table,test=table} loop
  Results of the last iteration (optional)
- {train=table,test=table} metrics
  Table of saved metrics (optional)
- number step
  Current step (optional)
InteractionsResult
Result of a set of interactions.
Fields:
- integer num_it
  Actual number of interactions done
- {real=number,sys=number,user=number} time
  Times elapsed by interactions
- integer num_rewards
  Number of non-zero rewards received
- integer num_episodes
  Number of episodes (terminal states)
- number total_reward
  Sum of rewards obtained
- torch.Tensor confusion_matrix
  Confusion matrix
- InteractionsTable interactions
  Details of interactions
- ClassificationTable classification_metrics
  Sumed up metrics
- {torch.Tensor/tensor.md/,...} inputs
  Inputs of the interactions
InteractionsTable
A list of interactions.
Fields:
- torch.Tensor expected_actions
  Actions expected by the environment
- torch.Tensor actions
  Actions executed
- torch.Tensor rewards
  Rewards obtained
- torch.Tensor terminals
  Terminal signal of the input state
OutputArgument

Arguments to describe the output.

Fields

agent.BaseAgent
agent

The agent to use
{train=environment.BaseEnvironment,test=environment.BaseEnvironment}
environment

Table with train and test environments
number
eval_freq

Number of steps between two evaluations
number
eval_steps

Number of evaluation steps
{train=table,test=table}
loop

Results of the last iteration
{train=table,test=table}
metrics

Table of saved metrics
torch.DiskFile
metrics_file

File used as output.
OutputArgument
output

Output options
number
save_at

Next interation at which to save the experiment
number
step

Current step
number
steps

Total number of steps to do (excluding evaluation steps)
torch.Timer
timer

A Timer used to time the experiment.

Metamethods

__init ( args )
Default constructor.
Parameters:
- InitArgument args

Public Methods

report ()
Report loop results.
Returns:
- self
run ()

Start the experiment.

This function will run steps learning interactions, separated by eval_steps evaluation interactions each eval_freq interactions.
save ()
Save the current experiment and dependencies.
Returns:
- self
setSteps ( steps )
Update the total number of steps to execute.

Use this function if you want to continue a previous experiment.
Parameters:
- number steps
  New number of steps to do
Returns:
- self

Private Methods

_graphical_report ()
Save some informations graphically.

This function will plot some graph, save images or something like this about the elements of the experiment.
Returns:
- self
_inputs_report ()
Save some inputs.

This can be used for checks and/or post-mortem debug.
Returns:
- self
_interact ( environment, steps )
Actually do interactions.

This function does the interactions without checking anything prior to it (agent mode, environment state, etc.). It resets the environment during the interactions if necessary.
Parameters:
- environment.BaseEnvironment environment
  Environment with witch to interact
- number steps
  Number of interactions to execute
Returns:
- InteractionsResult Only a subset (num_it, interactions, confusion_matrix, inputs) of fields are defined
_metrics_report ()
Build and save metrics string.
Returns:
- self
_plot_confusion_matrix ()
Save a graphical version of confusion matrix.
Returns:
- self
_plot_f1_score ()
Plot evolution of the F1-Score.
Returns:
- self
_plot_network_filters ()
Save a graphical representation of agent network filters
Returns:
- self
_plot_reward_per_ep ()
Plot evolution of reward per episode
Returns:
- self
_test ( steps )
Do some testing/evaluation interactions.
Parameters:
- number steps
  Number of interactions to do
Returns:
- InteractionsResult Result of the interactions
_text_report ()
Build and print a quick textual report.
Returns:
- self
_torch_report ()
Save Torch components.
Todo:
- save NN weights
Returns:
- self
_train ( steps )
Do some training interactions.
Parameters:
- number steps
  Number of interactions to do
Returns:
- InteractionsResult Result of the interactions
draw_filters ( output_path, network )
Dump images of the filters of the convolutionnal network.
Parameters:
- string output_path
  Base output path for images
- nn.Container network
  Network to dump

Static Functions

_compute_classification_metrics ( confusion_matrix )
Compute classification metrics from a multi-classes confusion matrix.

See some explanations about metrics on this site: http://blog.revolutionanalytics.com/2016/03/com_class_eval_metrics_r.html
Parameters:
- torch.Tensor confusion_matrix
  A 2D matrix
Returns:
- ClassificationTable Classification metrics

Arcades

Data Types

ClassificationTable

Dump

InitArgument

InteractionsResult

InteractionsTable

OutputArgument

Fields

agent

environment

eval_freq

eval_steps

loop

metrics

metrics_file

output

save_at

step

steps

timer

Metamethods

__init ( args )

Public Methods

report ()

run ()

save ()

setSteps ( steps )

Private Methods

_graphical_report ()

_inputs_report ()

_interact ( environment, steps )

_metrics_report ()

_plot_confusion_matrix ()

_plot_f1_score ()

_plot_network_filters ()

_plot_reward_per_ep ()

_test ( steps )

_text_report ()

_torch_report ()

_train ( steps )

draw_filters ( output_path, network )

Static Functions

_compute_classification_metrics ( confusion_matrix )

`agent`

`environment`

`eval_freq`

`eval_steps`

`loop`

`metrics`

`metrics_file`

`output`

`save_at`

`step`

`steps`

`timer`