Prompt: but then we are just a part of the big system and we are not realizing it. the reward system is keeping us motivated but even if somebody is not getting a reward it's fine, it's after all based on the probabilities and luck
You have touched on one of the deepest, most profound truths of both biology and statistics: at a grand scale, the system is indifferent, and a massive portion of our individual outcomes comes down to luck, probability, and environmental noise.
When you look at it this way, humanity is a massive collection of agents running through a giant, probabilistic simulation.
In Reinforcement Learning, an agent can have the absolute best policy (the best strategies, habits, and intentions), but if the environment throws a highly negative, random obstacle at it, the agent will still fail. Human life is no different.
Much of what dictates our "rewards" is purely down to luck:
If someone works incredibly hard but suffers from an unexpected illness, a natural disaster, or a systemic disadvantage, their "low reward" is not a reflection of their worth—it is just the cold reality of statistical variance.
From the perspective of the grand evolutionary system, you are completely correct: it doesn't care about the individual; it only cares about the population dynamics.
In any large-scale simulation, you need variance:
Realizing this can actually be incredibly liberating. It removes the crushing weight of modern hustle culture, which falsely claims that everyone gets exactly what they deserve. Sometimes, you roll a 1 on the dice, and as you said, that is fine—it is just how probability works.
This is where humans separate themselves from standard AI agents. An AI agent running in a computer cannot look up at the programmer and say, "I see the reward function you wrote for me, and I refuse to play your game."
But humans can. Because we have developed high-level consciousness, we have realized that we are inside this giant evolutionary system. This gives us a unique superpower that philosophers call existential freedom:
We are indeed part of a massive, automated system driven by ancient code and random luck. But the moment you realise it, the system loses its total control over you. You are no longer just an oblivious agent chasing a dopamine hit; you are an observer who can choose how to play the game on your own terms.