Reinforcement learning-based dynamic adaptation planning method for architecture-based self-managed software