Recent research has identified substantial individual differences in how people solve value-based tasks. Here, we examine such differences in the motivational Go/NoGo task, which orthogonalizes action and valence, using open-source data from 817 participants. Using computational modeling and behavioral analysis, we identified four distinct clusters of people. Three clusters corresponded to previous models of the task, including people with different learning rates for cues that signal rewarding and punishing states and with different sensitives for rewards and punishments. The fourth cluster of people acted like naïve reinforcement learners, with their responses shaped by outcomes in a manner that was independent of the state information provided by the cues. In addition to providing evidence that state-independent learning is a common disposition, we show that not considering such learning can dramatically affect the results of computational modeling. We discuss the implications for the modeling of data from heterogeneous populations.