The way positive reinforcement is carried out is more important than the amount.
机器学习的最终目标是让机器能够从经验中学习,并随着时间的推移提高其性能。