-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Random policy example without TF1 dependency #8
Comments
What do you mean by a random policy to collect data? Also, what kind of data do you want to collect? For instance, the easiest way to generate random policy is to generate trajectories or change the control API parameters (there are at least 3-4 different APIs to control parameters like position, velocity, pitch, roll, yaw, etc. This doesn't require you to use TF/Keras. The examples you see in the |
Thanks @srivatsankrishnan
|
@srivatsankrishnan any ideas on if it is possible to collect rollouts with a random policy? |
Definitely possible! You need to set it up something like this:
You can write your own function to randomly generate actions and pass it to the foo_random method. Note, this is just illustration of how you can do it. You might need to include the right includes and make sure it compiles. The step function should return you the env, rewards, dones (status), and info. |
Thanks @srivatsankrishnan. I have the following script to test out the idea:
But I notice after establishing connection with AirSim/Unreal it keeps on printing CONNECTED (and some other text) and does not execute |
Hi, |
No it does not really work.
Script opens up a new window but the drone stays stationary. |
@srivatsankrishnan I would really appreciate any ideas on how to resolve this issue. |
Have these issues been resolved now, @aqibsaeed? |
nope! |
@aqibsaeed I haven't even reached the steps in the picture you posted, and I've been forced to stop due to other issues! I don't know how you got to this point. Envious! |
Hi @srivatsankrishnan,
Is there any example of a random policy to collect data without any dependency on Tensorflow/Keras etc.? As all these frameworks are sort of outdated and there is very little support for them.
I am looking for a way to simply instantiate an environment and collect data with a random policy and afterwards do RL in TF2. Could you please direct me to such an example ? The examples I see under
runt_time
andtest_suites
all require TF1/Keras.Thanks in advance.
The text was updated successfully, but these errors were encountered: