JISE

Continuous dynamic in vehicle movement creates a complex topology environment and connectivity relationship in Vehicular Ad-hoc Networks (VANETs). Consequently, overcoming longer transmission connections and reduced transmission reliability has been a significant problem. During the development of routing methods, fundamental research focuses on identifying and selecting paths with short distances or pulverization of information in environments with high traffic density to transmit packets successfully. As a result, transmission losses are minimized, and transport conditions are avoided; however, these techniques struggle to achieve more stable path routes in real-time environments because of sudden increases in traffic levels. Consequently, a model that effectively monitors the distance between vehicles and assists VANETs in selecting paths automatically and more accurately is proposed. To approximate and extend the lifetime of the communication path, we use an OpenAI Gym environment where a Reinforcement Learning (RL) agent can learn the route using the distance between cars as a model and define policies with higher lifetime rewards. The algorithm used a trained agent with the Proximal Policy Optimization (PPO) and Advanced Actor-Critic (A2C) algorithms in our environment. Applying machine learning on VANETs results in improved network efficiency, which offers real-time routing information in mobile environments. The simulation results show that RL-based routing extends the useful life of the route among the investigated methods between the origin and destination hosts.