Reinforcement learning-based dynamic zone positions for mixed traffic flow variable speed limit control with congestion detection