Original Reddit post

Balloon Popping Challenge is a 6-DoF rocket guidance, navigation, and control (GNC) simulation environment built using Gymnasium . This project is based on ActiveRocketPy , a fork of open-source software RocketPy . The environment is designed to simulate an active controlled rocket to pop balloons scattered in the sky. The simulator incorporates realistic physics, including atmospheric conditions and rocket dynamics, to provide a challenging platform for developing and testing GNC algorithms. Examples are provided for training and evaluation Te The actions, observations, info, rewards in this environment are: actions: launch : a binary command to launch the rocket. launch_inclination_heading : a 2-element array [inclination, heading] representing the launch inclination (0-90 degrees from horizontal) and heading angles (0-360 degrees from north). tvc : a 2-element array [TVC_x, TVC_y] representing the thrust vector control (TVC) gimbal angles (deg). Polarity: positive gimbal angles provide positive torques. throttle : a scalar representing the throttle ratio between 0 and 1. roll : a scalar representing the roll torque command in N-m. observations: simulation_time : the current simulation time in seconds. balloon_status : a n-element array representing the status of each balloon (0: on the ground, 1: released, 2: popped). n is the number of balloons in the scenario. balloon_states : a n x 6 array representing the position (posX, posY, posZ) and velocity (velX, velY, velZ) of each balloon. Position is the center of the balloon in the launch frame (relative to launch origin) in meters. Velocity is the center of the balloon in the launch frame (relative to launch origin) in m/s. rocket_sensors : a 12-element array representing the rocket’s sensor measurements (gyroX, gyroY, gyroZ, accX, accY, accZ, posX, posY, posZ, velX, velY, velZ). Orientation of inertial sensors matches body frame. The measurements will be nan before launch action. Gyroscopes measure the angular velocity (rad/s) in the rocket body frame. Accelerometers measure the linear acceleration (m/s²) in the rocket body frame. Gravity is included in the accelerometer measurements. GNSS sensors measure the position (m) and velocity (m/s) in the launch frame (relative to launch origin). Note that the rocket’s true states (e.g., attitude, angular velocity) are not directly observed by the agent, and the agent needs to infer them from the sensor measurements. info: rocket_states : a 13-element array representing the rocket’s true states. These states are not observed and should not be used by the agent but can be used for development and debugging. The states are [posX, posY, posZ, velX, velY, velZ, e0, e1, e2, e3, wX, wY, wZ]: pos: center of dry mass position (m) in the launch frame (relative to launch origin). vel: center of dry mass velocity (m/s) in the launch frame (relative to launch origin). e: quaternion representing the attitude of the rocket (e0, e1, e2, e3) relative to the launch frame. w: angular velocity (rad/s) in the rocket body frame. rewards: The reward is calculated based on the total number of balloons popped at each time step. submitted by /u/thomasya

Originally posted by u/thomasya on r/ArtificialInteligence