I have just started dabbling in machine learning. I would like to use reinforcement learning to do the following: I have a "core" function that takes 5 input numbers and returns 1 output number. I would like to use a reinforcement learning algorithm to input the 5 numbers into my function and try to maximize the 1 output number. Does my function qualify as "environment"? I am feeling a bit lost in this topic. Could someone point me in the right direction as to which toolbox to use and maybe the functions I should look at in the relevant toolbox?
Thanks a lot!