PIPELINE OBJECT PARAMETERS

NAME	TYPE	DEFAULT
help	bool	False
Prints the help information for the package, including an overview and parameter descriptions.
seed	int \| None	None
Seed for random number generation. If None, no seed is set.

FIT MODE PARAMETERS

NAME	TYPE	DEFAULT
help	bool	False
Prints the help information for the package, including an overview and parameter descriptions.
mode	str	None
Mode of operation, either 'fit' or 'validation'. In FIT mode, models are fitted to empirical data. In VALIDATION mode, parameter recovery or model recovery is performed, depending on the recovery parameter. This is a required parameter, so there is no default value.
models	list[str] \| None	None
Supported models QLearning, ActorCritic Relative, Advantage Hybrid2012, Hybrid2021 StandardHybrid2012, StandardHybrid2021 Standard models Standard models as described in each reference, which introduces the model, with the addition of counterfactual learning rates. Hybrid models have alternatives versions without counterfactual learning rates: StandardHybrid2012, StandardHybrid2021 QLearning: Standard Q-Learning Model ActorCritic: Standard Actor-Critic Model Relative: Standard Relative Model (Palminteri et al., 2015) Advantage: Simplified Relative Model (Williams et al., 2025) Hybrid2012+bias: Hybrid 2012 Model (Gold et al., 2012) Hybrid2021+bias+decay: Hybrid 2021 Model (Geana et al., 2021) Optional Parameters You can add optional parameters to models by adding them to the model name using a + sign +bias: Adds a valence bias to the model (e.g. Hybrid2012+bias), only usable with Hybrid models +novel: Adds a free parameter for the novel stimulus (e.g. QLearning+novel), useable with all models +decay: Adds a decay parameter to the model (e.g. QLearning+decay), useable with all models
learning_filename	str \| None	None
Filename (and path) of the learning task data.
transfer_filename	str \| None	None
Filename (and path) of the transfer task data.
random_params	bool \| str	False
Mode of determining parameter starting points. Can be 'normal', 'random' or False. If 'normal' starting parameter values will be drawn from a normal distribution with the means being defined in the fixed parameter. The parameters will be cutoff at the bounds defined in the bounds parameter. If 'random' is selected, the parameters will be drawn from a uniform distribution between the bounds. If no fixed or bound parameters are provided, the default values will be used (found in the RLModel class).
fixed	dict \| None	None
Optional fixed parameter values. For the FIT mode, these values will be used as the model parameters if random_params = False. If random_params = 'normal', these values will be used as the mean of the normal distribution. If random_params = 'random', these values will be ignored. For the VALIDATION mode, these values will be ignored if the parameter or fit_filename parameters are used. Otherwise, these values will be used in the same way as in the FIT mode.
bounds	dict \| None	None
Bounds that each parameter is cutoff at. Operates in both fitting and simulating data. It is a nested dictionairy in the form of {model: {parameter: tuple}}. The tuple are two floats representing the bottom and top bounds, e.g., bounds['QLearning']['factual_lr'] = (0.1, 0.99)
number_of_runs	int	1
How many times to fit each model per participant. Two outputs when fitting data is the fit_data.pkl and full_fit_data.pkl files. This works well with randomized starting points (e.g., random_params = 'random' or random_params = 'normal') because each run has a different set of starting parameters, which helps finding the best fit parameters. The fit_data.pkl file contains the best run for each model and participant, while the full_fit_data.pkl file contains all runs.
generated	bool	False
Use generated data instead of empirical.
multiprocessing	bool	False
Use multiprocessing for parallel computation.
training	str	scipy
Training backend to use [scipy, torch]. The pytorch backend is on beta testing. It works, but performs worse than the scipy backend. There has not yet been an investigation into why this is the case. If using the torch backend, the training_epochs and optimizer_lr parameters are used. These are ignored if the scipy backend is used.
training_epochs	int	1000
If using torch backend (training = 'torch'), this determines the number of training epochs.
optimizer_lr	float	0.01
If using torch backend (training = 'torch'), this is the learning rate for the ADAM optimizer (which is the only one implemented at this time).
number_of_participants	int	0
TEST PARAM. This parameter is used to cut your provided dataset down. It will cut it down to this number of participants. It will take the first N participants from the dataset, where N is the number of participants you inputted. If 0 is inputted (default), it will keep all participants. This is designed mostly for testing.

VALIDATE MODE PARAMETERS

NAME	TYPE	DEFAULT
help	bool	False
Prints the help information for the package, including an overview and parameter descriptions.
mode	str	None
Mode of operation, either 'fit' or 'validation'. In FIT mode, models are fitted to empirical data. In VALIDATION mode, parameter recovery or model recovery is performed, depending on the recovery parameter. This is a required parameter, so there is no default value.
recovery	str	parameter
This parameter sets the recovery mode, it can be 'parameter' or 'model'. Parameter recovery is the process of generating data with known parameters for a given model, and then fitting that data with the same model to determine whether the parameters are recoverable. This can still use a list of models, but each model will only recover its own generated data. Model recovery is the process of generating data with known parameters for all given models. Each model is then fitted to all generated data (regardless of which model generated it) to test which model best fits data from every model. Ideally, the model that generated the data should be the best fit.
models	list[str] \| None	None
List of models to fit. Supported models: QLearning, ActorCritic Relative, Advantage Hybrid2012, Hybrid2021, StandardHybrid2012, StandardHybrid2021 Standard models: QLearning: Standard Q-Learning Model ActorCritic: Standard Actor-Critic Model Relative: Standard Relative Model (Palminteri et al., 2015) Advantage: Simplified Relative Model (Williams et al., in prep) Hybrid2012+bias: Standard Hybrid 2012 Model (Gold et al., 2012) Hybrid2021+bias+decay: Standard Hybrid 2021 Model (Geana et al., 2021) Optional Parameters: You can add optional parameters to models by adding them to the model name using a + sign +bias: Adds a valence bias to the model (e.g. Hybrid2012+bias), only usable with Hybrid2012, and Hybrid2021 +novel: Adds a free parameter for the novel stimulus (e.g. QLearning+novel), useable with all models +decay: Adds a decay parameter to the model (e.g. QLearning+decay), useable with all models
learning_filename	str \| None	None
Filename (and path) of the learning task data.
transfer_filename	str \| None	None
Filename (and path) of the transfer task data.
random_params	bool \| str	False
Mode of determining parameter starting points. Can be 'normal', 'random' or False. If 'normal' starting parameter values will be drawn from a normal distribution with the means being defined in the fixed parameter. The parameters will be cutoff at the bounds defined in the bounds parameter. If 'random' is selected, the parameters will be drawn from a uniform distribution between the bounds. If no fixed or bound parameters are provided, the default values will be used (found in the RLModel class).
fixed	dict \| None	None
Optional fixed parameter values. For the FIT mode, these values will be used as the model parameters if random_params = False. If random_params = 'normal', these values will be used as the mean of the normal distribution. If random_params = 'random', these values will be ignored. For the VALIDATION mode, these values will be ignored if the parameter or fit_filename parameters are used. Otherwise, these values will be used in the same way as in the FIT mode.
bounds	dict \| None	None
Bounds that each parameter is cutoff at. Operates in both fitting and simulating data. It is a nested dictionairy in the form of {model: {parameter: tuple}}. The tuple are two floats representing the bottom and top bounds, e.g., bounds['QLearning']['factual_lr'] = (0.1, 0.99)
parameters	dict \| list[dict]	None
A nested dictionary where the first level is the model name, which then is a dictionary of the model parameters with their values. These values will be used to generate data using the model, and the random_params variable will be ignored. This parameter conflicts with the fit_filename parameter, so if both are provided, you will receive an error. The intended use of this parameter is to run a specific model with set parameters. Note that this is not the same as the fixed parameter, which is used as priors (the mean) when using random_params='normal'. However, if this parameter is not provided, and random_params = False, then the fixed parameter will be used as the model parameters.
fit_filename	str \| None	None
A nested dictionary where the first level is the model name, which then is a dictionary of the model parameters with their values. This is used to load pre-fitted model parameters from a file, which would have been saved as fit_data.pkl when running the fit mode. This parameter overrides the parameters parameter, so all information provided there is also relevant for this parameter. This file will use participant IDs to determine the model parameters for each participant, so it is important that the same data is being used in recovery as was used in the fit mode.
task_design	dict \| None	None
This parameter defines the task parameters (e.g., trials) to be run. This is an alternative to the learning_filename and transfer_filename parameters, which instead use the predifined task designs for each participant. This parameter is a nested dictionary with the following structure: The highest level must have 'learning_phase' and 'transfer_phase' as keys. The learning_phase key should then include a dict with 'number_of_trials' and 'number_of_blocks' as keys. The transfer_phase key should then include a dict with 'number_of_trials' or 'times_repeated' as keys. These keys in the transfer_phase are mutually exclusive, meaning you can only use one of them at a time. If both are provided, number_of_trials will be used. In the transfer_phase, the 'number_of_trials' key is used to define the number of trials in the transfer phase, and the 'times_repeated' key is instead used to define how many times all pairs of stimuli are repeated (there exists 36 pairs).
datasets_to_generate	int	1
The number of participants to generate if using the task_design parameter. This will be overriden with the number of participants you have if you use the learning_filename and transfer_filename parameters.
number_of_runs	int	1
How many times to fit each model per participant. Two outputs when fitting data is the fit_data.pkl and full_fit_data.pkl files. This works well with randomized starting points (e.g., random_params = 'random' or random_params = 'normal') because each run has a different set of starting parameters, which helps finding the best fit parameters. The fit_data.pkl file contains the best run for each model and participant, while the full_fit_data.pkl file contains all runs.
number_of_participants	int	0
TEST PARAM. This parameter is used to cut your provided dataset down. It will cut it down to this number of participants. It will take the first N participants from the dataset, where N is the number of participants you inputted. If 0 is inputted (default), it will keep all participants. This is designed mostly for testing.
generate_data	bool	True
This boolean will determine whether you will generate new data using the models. The intended use of this parameter is to toggle off data generation if you have already generated data. For example, if you generated data when validating using recovery='parameter' (i.e., generate_data=True), and now want to run recovery='model' using the same generated data (i.e., generate_data=False).
clear_data	bool	True
Whether to clear previous simulated data before generating new. This will be ignored if generate_data=False so not to delete the data you are planning to use.
multiprocessing	bool	False
Use multiprocessing for parallel computation.
training	str	scipy
Training backend to use [scipy, torch]. The pytorch backend is on beta testing. It works, but performs worse than the scipy backend. There has not yet been an investigation into why this is the case. If using the torch backend, the training_epochs and optimizer_lr parameters are used. These are ignored if the scipy backend is used.
training_epochs	int	1000
If using torch backend (training = 'torch'), this determines the number of training epochs.
optimizer_lr	float	0.01
If using torch backend (training = 'torch'), this is the learning rate for the ADAM optimizer (which is the only one implemented at this time).